HomeCompaniesSepal AI

Data Development for Advanced AI

Sepal is a data research company on a mission to advance human knowledge and capabilities through safe AI. We partner with the world’s leading AI labs and enterprises to help their models get better at the tasks people actually want them to do. We’ve built a Cloud-Native Agent Dataset Factory which turns the process of generating evaluation and training data from manual, inconsistent, and labor-intensive into something automated, standardized, and scalable. Sepal AI was founded in 2024 by engineers and operators from Vercel and Turing. We went through Y Combinator, raised several million dollars from leading investors, and already count multiple Fortune 500s and top AI research labs as paying customers.
Active Founders
Robi Lin
Robi Lin
Founder
Co-Founder @ Sepal AI Built the enterprise workflow products and fulfillment strategy at Turing.com. Scaled Turing’s LLM trainer business line from 50 to 800+ onboarded developers in 5 months for foundational LLM and enterprise customers. Previously was at Bain & Co.
Kat Hu
Kat Hu
Founder
Cofounder @ Sepal AI Built Turing’s Foundational LLM trainer business GTM. Ran orgs of 500+ AI trainers & built corresponding operations for scale. Previously was at McKinsey.
Company Launches
🌱 Sepal AI - Confidently deploy your AI models
See original launch post

Tl;dr: Sepal provides frontier data and tooling for advancing responsible AI development.

__________________________________________________________

Sepal AI is on a mission to advance human knowledge and capabilities with the responsible development of artificial intelligence.

🧐 Responsibly advance human knowledge with AI? What does that mean?

We believe in a world where AI advances scientific research and empowers economic growth.

To achieve that future, AI product & model builders need:

  1. Golden Datasets and Frontier Benchmarking: To iteratively measure model performance on specific use cases.
  2. Training Data: To improve model capabilities using fine-tuning and RLHF.
  3. Safety / Red-teaming: To measure and forecast the safety of LLMs before putting them out in the wild.

__________________________________________________________

⚠️ Okay, well why does it matter?

Frontier data for AI development is vital for safe deployment & scaling. However, developing this data is difficult.

Most frontier data requires domain knowledge that can be hard to source and curate (e.g., finance, medical, physics, biology, etc.). Publicly available benchmarks (e.g., MMLU, GPQA, MATH, etc.) are contaminated and too general to be useful to actual product & model builders.

__________________________________________________________

🌱 How do we do this?

We’ve built Sepal AI - the data development platform that enables you to curate useful datasets.

The Platform: We bring data generation tooling, human experts, synthetic data augmentation, and rigorous quality control into one platform so you can manage the production of high-quality datasets.

uploaded image

Our Expert Network: We’ve built a network of 20k+ experts across STEM and professional services (think academic PhDs, business analysts, medical professionals, marketing and finance consultants) to support campaign design & data development.

Sample engagements we’ve run:

  • 🧬 Cell and Molecular Biology Benchmark: An original benchmark to evaluate complex reasoning across models. Produced by a team of PhD biologists from top institutions in the US.
  • 💼 Finance Q&A + SQL Eval: A Golden Dataset to test the ability of an AI agent to query a database and produce human-expert-level answers to complex finance questions.
  • 📏 Uplift Trials & Human Baselining: End to end support for conducting secure in-person evaluations on model performance.
  • …. [insert your custom use case next?]

__________________________________________________________

🙏 Asks:

  1. If you are building an AI application and need to measure or improve your model, or
  2. If you are a researcher at an AI lab building or evaluating models for new capabilities / risk areas, or
  3. If you’re passionate about the development of AI, AI safety, or evals in general…

Let’s chat — please check us out at www.sepalai.com

__________________________________________________________

👪 Our team:

uploaded image

Meet Kat (on the left), Robi (in the middle), Fedor (on the right)!

Robi and Kat previously built the technical LLM training business for Turing. Kat on the go-to-market & operations side. Robi on the product & fulfillment side. Fedor is a long-time close friend - he was an early engineer at Vercel & Newfront where he built out foundational infrastructure.

Say hi: founders@sepalai.com.

Jobs at Sepal AI
San Francisco, CA, US / Remote (US)
$130 - $180
0.20% - 0.90%
3+ years
San Francisco, CA, US
$130 - $180
0.20% - 0.90%
3+ years
San Francisco, CA, US / Remote (US)
$20 - $40 / hourly
Any
San Francisco, CA, US / New York, NY, US
$120K - $150K
0.30% - 0.70%
3+ years
Sepal AI
Founded:2024
Batch:Summer 2024
Team Size:15
Status:
Active
Location:San Francisco
Primary Partner:Tyler Bosmeny