Data Engineering Startups funded by Y Combinator (YC) 2025

December 2025

Browse 72 of the top Data Engineering startups funded by Y Combinator.

We also have a Startup Directory where you can search through over 5,000 companies.

  • Fivetran
    Fivetran
    Y Combinator LogoW2013
    Active • 1,200 employees • Oakland, CA, USA
    Fivetran automates data movement out of, into and across cloud data platforms. We automate the most time-consuming parts of the ELT process from extracts to schema drift handling to transformations, so data engineers can focus on higher-impact projects with total pipeline peace of mind. With 99.9% uptime and self-healing pipelines, Fivetran enables hundreds of leading brands across the globe, including Autodesk, Conagra Brands, JetBlue, Lionsgate, Morgan Stanley, and Ziff Davis, to accelerate data-driven decisions and drive business growth. Fivetran is headquartered in Oakland, California, with offices around the world. 
    data-engineering
    saas
    analytics
    b2b
  • Captain
    Captain
    Y Combinator LogoF2025
    Active • 2 employees • San Francisco, CA, USA
    Captain delivers the most accurate general-purpose retrieval engine for unstructured data. Connect file stores and effortlessly retrieve knowledge with much higher accuracy than RAG (Avg: 78% → 95% + citations).
    developer-tools
    data-engineering
    big-data
  • sieve
    sieve
    Y Combinator LogoX2025
    Active • 2 employees • New York, NY, USA
    sieve solves data cleaning for hedge funds and investment firms by letting them get clean data in four lines of code. Currently, their data pipelines have conditions that raise for human review, which literally send an email to engineers with data that needs to be reviewed. We provide an API that integrates directly into their existing pipeline - instead of raising for human review, they can send all the same information to our API and get clean, high-quality data back. By using our AI agents built specifically for financial data collection, along with expert-in-the-loop review, we provide our clients with clean, validated data at a scale and level of quality that wasn't achievable before.
    apis
    investing
    data-engineering
  • Labric
    Labric
    Y Combinator LogoX2025
    Active • 2 employees • San Francisco, CA, USA
    Labric is building the data layer that makes AI work for science. We capture messy lab data from instruments and tools, clean it, and connect it— so researchers can actually use AI to accelerate discovery.
    advanced-materials
    biotech
    nanotechnology
    data-engineering
    ai
  • Melder
    Melder
    Y Combinator LogoF2024
    Active • 2 employees • New York, NY, USA
    Melder is an Excel add-in that brings AI functions and document support into your spreadsheets. Upload files directly into cells, use smart formulas like =GEN, and build automations—all without leaving Excel. Core features: - File-to-Sheet: Drop PDFs directly into cells, then reference them in formulas. - AI-Powered Functions: Write formulas like =GEN() or =EXTRACT() to summarize, classify, and analyze content. - Chat Assistant: Use our AI assistant to help build your sheets or answer questions from your data, live in the workbook. Business users use Melder to: - Accelerate diligence by extracting insights from data rooms - Review contracts by identifying key terms and clauses instantly - Run market research by pulling information from competitor websites - Synthesize transcripts by generating summaries from interviews and calls Melder brings the power of structured spreadsheet logic to the messy, unstructured data world—no coding needed.
    data-engineering
    generative-ai
    artificial-intelligence
  • Coblocks
    Coblocks
    Y Combinator LogoF2024
    Active • 2 employees • New York, NY, USA
    Coblocks is a thoughtfully-designed data platform that helps teams write queries and automate workflows faster. We understand the columns, tables, and relationships in your data and use them to help anyone on your team build pipelines with AI, SQL and Python. Think of us like Zapier plus Cursor for data engineering. Here’s how we’re different: • All-in-one: You can get started in 2 minutes – no setup or configuration required. We have one-click integrations, warehousing, transformation, and schedules all built in. • Seamless integrations: Plug in your Postgres database, Stripe transactions, Hubspot leads, or any other data source, without writing code to keep things in sync. • Thoughtful AI: We love Cursor and we love data – we combined the two to help you write accurate queries. We use existing metadata to help you create new datasets, connect sources, fix errors, or edit in place. • Collaborative: Easily share data and discover what others in your org have built as a starting place for your analysis. Wrap common blocks of logic with templates so your team never has to start from zero. • Resilient and Scalable: Our compute engine is lightning-fast for queries and builds. Git and branching are built-in for both code and data, so you can time-travel backwards when things break. You can start with GBs and grow to TBs.
    data-engineering
    big-data
    analytics
    data-science
    ai
  • Snowpilot
    Snowpilot
    Y Combinator LogoS2024
    Active • 2 employees • San Francisco, CA, USA
    Snowpilot combines a spreadsheet UI with a federated data engine. We get live data from tools like Salesforce, Gong, and Mixpanel, enabling PMs, marketers, and salespeople to run high-impact workflows with data at any scale. Ben and Dom met at a Sequoia & a16z-backed data startup, Census. Together, we built the first real-time, warehouse-native customer data platform. Prior to that, Dom led 20+ ML engineers at Adobe to build their internal ad optimization platform, which allocates $1B in annual spend. Ben built the microservices stack powering the new Microsoft Edge, scaling from 0 to hundreds of millions of DAUs. We started coding Snowpilot in mid-August '24, and we already have a live app that can run sub-second queries on millions of rows, entirely in the user's browser. The data warehouse market is $10B/yr, growing 23% YOY. We will disrupt incumbents and significantly expand this market by enabling non-engineers to use big data on a daily basis.
    b2b
    data-engineering
    databases
    big-data
    ai
  • Sensei
    Sensei
    Y Combinator LogoS2024
    Active • 2 employees • San Francisco, CA, USA
    Sensei helps robotics companies scale and outsource their training data collection. Our hardware platform enables the collection of human-demonstration data at a tenth of the cost and twice the speed of current teleop approaches. Our software platform acts like Scale AI for robotics data: a large network of paid human operators use our low-cost collection platform to fulfill data-generation requests.
    robotics
    hard-tech
    artificial-intelligence
    marketplace
    data-engineering
  • Trellis AI
    Trellis AI
    Y Combinator LogoW2024
    Active • 25 employees
    Trellis helps healthcare providers treat more patients, faster—while eliminating pre-service paperwork. We automate document intake, prior authorizations, and appeals at scale to streamline operations and accelerate care. Our AI agent is trained on millions of clinical data points and converts messy, unstructured documents into clean, structured data directly in your EHR. With Trellis, leading healthcare providers and pharmaceutical companies were able to: 1. Reduce time to treatment by over 90% 2. Improve prior authorization approval and reimbursement rates 3. Leverage structured data to enhance drug program performance and clinical decision-making Administrative costs account for over 20% of U.S. healthcare spending—delaying care, draining revenue, and driving staff burnout while having less visibility into patient care than ever before. We built Trellis to tackle this head on.
    b2b
    data-engineering
    databases
    infrastructure
    ai
  • kater.ai
    kater.ai
    Y Combinator LogoW2024
    Active • 3 employees • San Francisco, CA, USA
    1. You explain your problem. 2. Kater identifies the most important data questions to ask. 3. Kater writes the code. 4. You get insights in seconds rather than weeks. Kater.ai flips the script on enterprise analytics by making every user an expert analyst. It uses a continuous classification engine to turn a single business question into a contextualized package of questions that is specific to your needs. Kater puts the power of data into the hands of business experts while ensuring they use trusted data that is specific to their persona. No more waiting for data analysts. No more wasted time on analysis misfires and rework. Yvonne was a data engineer and analyst who built the entire data stack at CREXi. Robin led engineering in Microsoft. Data is the new oil. Companies are data-rich, insight-poor. We're helping companies become insight-rich. This is the future of data.
    data-engineering
    analytics
    artificial-intelligence
  • Ocular AI
    Ocular AI
    Y Combinator LogoW2024
    Active • 6 employees • San Francisco, CA, USA
    Ocular AI is the data annotation engine for Generative AI, Computer Vision, and Enterprise AI models. We help you transform unstructured, multi-modal data into golden datasets to power generative AI, frontier models, and computer vision. Ocular Foundry is the most intuitive, data-centric, and fastest platform that lets you label, annotate, version, and deploy your data for training models. It also orchestrates your annotation jobs, improving collaboration with members and annotators. With Ocular Bolt, shift from humans in the loop to experts in the loop to supercharge your data labeling and annotation projects. Our global expert workforce ensures fast, accurate results—no matter the scale or complexity of your data. Companies spend huge amounts on training data, but Foundry and Bolt are AI-native tools that lower costs, reduce manual effort, and accelerate high-quality data collection. We’re replacing outdated, clunky, and expensive data software!
    artificial-intelligence
    data-engineering
    machine-learning
    computer-vision
    developer-tools
  • Reducto
    Reducto
    Y Combinator LogoW2024
    Active • 18 employees • San Francisco, CA, USA
    Reducto offers robust and reliable document ingestion for any workflow. Our API allows you to convert complex, unstructured documents into structured outputs that are perfect for RAG, process automation, and more.
    artificial-intelligence
    documents
    data-engineering
    search
    enterprise-software
  • Buster
    Buster
    Y Combinator LogoW2024
    Active • 3 employees • Salt Lake City, UT, USA
    Buster is an AI agent platform built for analytics engineering. It provides data teams with AI agents that keep their dbt projects reliable, documented, and consistent — automatically.
    data-engineering
    data-visualization
    databases
    data-science
    generative-ai
  • DataShare
    DataShare
    Y Combinator LogoS2023
    Active • 1 employees • Austin, TX, USA
    DataShare is a data-as-a-service platform that lets you embed charts, dashboards and exports directly into your product. For example, if you run an accounting startup, DataShare would enable you to embed a full profit and loss dashboard, with downloadable statements. DataShare is backed by an enterprise-grade data warehouse, and can be implemented in fewer than 20 lines of code.
    data-engineering
    databases
    analytics
  • Cedalio
    Cedalio
    Y Combinator LogoS2023
    Active • 6 employees • San Francisco, CA, USA
    Track and control emissions, energy, water, gas, and more—across all sites and countries, from a single place. Cedalio automates utility bill processing, detects anomalies, and centralizes your data into one source of truth. Save hours of manual work, make faster decisions, and enhance your sustainability efforts.​
    data-engineering
    artificial-intelligence
    energy
  • Artie
    Artie
    Y Combinator LogoS2023
    Active • 10 employees • San Francisco, CA, USA
    Artie is software that streams data from databases to data warehouses in real-time. Today, most companies run their ETL process every few hours or overnight, so their data warehouse is always out of date; with Artie, the warehouse always has live production data.
    data-engineering
    developer-tools
    enterprise-software
    saas
  • Ohm
    Ohm
    Y Combinator LogoW2023
    Active • 6 employees • San Francisco, CA, USA
    Ohm supports battery design & manufacturing teams globally. Our customers include battery manufacturers (traditional and next-generation chemistries) and Fortune 100 technology companies that use batteries in their products.
    data-engineering
    ai
  • TableFlow
    TableFlow
    Y Combinator LogoW2023
    Active • 2 employees • San Francisco, CA, USA
    TableFlow builds AI teammates for data tasks, helping operations and data teams automate the messy, manual tasks buried in PDFs, spreadsheets, images, and emails.
    ai
    automation
    documents
    data-engineering
    saas
  • Lume
    Lume
    Y Combinator LogoW2023
    Active • 5 employees
    Lume speeds up customer implementation with AI. Lume helps teams analyze, map, and ingest customer data up to 87% faster, accelerating time-to-revenue.
    data-engineering
    b2b
    saas
    infrastructure
    ai
  • Honeydew
    Honeydew
    Y Combinator LogoW2023
    Active • 6 employees • Tel Aviv-Yafo, Israel
    The way people use data is constantly changing. Data teams must support every new context without breaking the shared truth. Honeydew’s semantic layer does it automatically. We validate each change and update every data flow. Using Honeydew, data teams can support 10x more data users - without more engineers or compromising integrity.
    saas
    data-engineering
    analytics
    b2b
  • Versori
    Versori
    Y Combinator LogoW2023
    Active • 16 employees • Manchester, UK
    Orchestrate custom integrations, workflows & agents in hours, not months. Take control of your integration strategy and breathe easy with maintenance on AI Autopilot. For Product Teams: Build better integration libraries. Build a feature-rich integration library, for your users to enjoy. Offer out-of-the-box integrations that work for you and your customers. Embedded IPaaS typically locks you into connector or endpoint limitations. Versori gives you to tools for limitless customisation. Proactive, self healing agents, scan your connected apps for endpoint or schema changes. You get alerted, Versori AI fixes the change. Embed Versori built integrations into your app with the Versori SDK. Flexible to your development approach with advanced user management. For Operations Teams: Get your internal systems speaking the same language. Deliver integrations for new software in days, not months—so you can start unlocking value immediately. Versori’s speed to value reduces typical deployment fees by half—or more. Low code for speed. Full code for control. No more limits from inflexible integration platforms. For GTM & Sales Teams: Say yes to any prospect's integration request. Stop bouncing between teams to get integrations built. With Versori, Sales can go straight to yes. No more escalations or delays. Versori offer fully managed custom-builds, so your customers get exactly what they need, without compromise.
    api
    b2b
    data-engineering
    no-code
    saas
  • Sunpia
    Sunpia
    Y Combinator LogoS2022
    Active • 3 employees • San Jose, CA, USA
    Sunpia lets developers easily experience the cost and speed benefits of serverless infrastructure, without having to rewrite their code. Developers annotate their code and Sunpia automatically designs a microservice version of it they can deploy on their own cloud.
    data-engineering
    developer-tools
    kubernetes
  • Findly
    Findly
    Y Combinator LogoS2022
    Active • 11 employees • London, UK
    Findly.ai is the co-pilot for Business Intelligence that revolutionizes how businesses understand and interact with their data. By creating an engaging chat environment, it empowers decision-makers to gain insights, request reports, and generate visualizations based on their company's metrics. This seamless interaction is made possible by integrating a metric layer that comprehends all your company's metrics. The chat-based exploration simplifies complex data analysis, allowing users to generate comprehensive summaries with a single click, which can be exported to various formats. Furthermore, with the introduction of scheduled chats and action-triggered automations, Findly.ai enhances the autonomy and efficiency of decision-makers. It's more than a tool; it's a decision-making operational system aiming to facilitate decision-makers in achieving their KPIs while spending less time waiting for data.
    generative-ai
    data-engineering
    b2b
    chatbot
    artificial-intelligence
  • IvyCheck
    IvyCheck
    Y Combinator LogoS2022
    Active • 2 employees • Berlin, Germany
    IvyCheck helps you extract hidden insights from your data and ensures high data quality and consistency. Use Generative AI in your data warehouse to transform data at scale.
    b2b
    data-engineering
    databases
    ai
    generative-ai
  • MovingLake
    MovingLake
    Y Combinator LogoS2022
    Active • 3 employees • Mexico City, CDMX, Mexico
    MovingLake is Fivetran for event-driven architectures. Companies such as Casai use our product to obtain orders and price changes in real time.
    analytics
    api
    data-engineering
    b2b
    saas
  • Lamin
    Lamin
    Y Combinator LogoS2022
    Active • 6 employees • Munich, Germany
    Manage data & analyses with an open-source framework. Collaborate across dry & wetlab in a distributed hub. Enable learning at scale through API-first access.
    biotech
    data-engineering
    machine-learning
    developer-tools
    open-source
  • LanceDB
    LanceDB
    Y Combinator LogoW2022
    Active • 20 employees • San Francisco, CA, USA
    LanceDB is a new open-source vector database that can support low-latency billion-scale vector search on a single node. Built around a new columnar data format, LanceDB makes it incredibly easy to build applications for generative AI, recsys, search engines, content moderation, and more.
    aiops
    data-engineering
    machine-learning
    open-source
  • Elementary
    Elementary
    Y Combinator LogoW2022
    Active • 12 employees • Tel Aviv-Yafo, Israel
    Elementary enables data teams to detect problems in their data before their users do. An open-source solution that any data engineer can deploy in minutes without sharing sensitive data.
    data-engineering
    analytics
    developer-tools
    open-source
  • Dynamo AI
    Dynamo AI
    Y Combinator LogoW2022
    Active • 40 employees • San Francisco, CA, USA
    End-to-end privacy, security, and compliance solutions to prepare your organization for emerging AI regulations.
    privacy
    data-engineering
    machine-learning
  • Sieve
    Sieve
    Y Combinator LogoW2022
    Active • 12 employees • San Francisco, CA, USA
    Sieve is the only AI research lab exclusively focused on video data. Video already makes up 80% of internet traffic and has become the dominant medium driving creativity, communication, gaming, AR/VR, and robotics. Unlocking the ability to truly model video is the key to breakthroughs across all of these domains but progress has been bottlenecked by one thing: high-quality training data. That’s where Sieve comes in. We bring together exabyte-scale video infrastructure, novel video understanding techniques, and dozens of diverse data sources to create datasets that push the frontier of video modeling. This unique combination allows us to deliver data with unmatched precision, quality, and speed which has earned the trust of frontier AI labs, Fortune 100 companies, and fast-growing generative AI startups.
    video
    developer-tools
    artificial-intelligence
    data-engineering
    data-labeling
  • Versable
    Versable
    Y Combinator LogoW2022
    Active • 3 employees • Los Angeles, CA, USA
    Auto parts retailers get product data from hundreds of manufacturers that is inaccurate and inconsistent, often with big gaps in key values. They currently have a team of "catalog managers" who are required to process and enhance this data line by line, resulting in a week to months long lag between receiving product data and actually being able to start generating revenue from those products. Versable leverages AI to scan the web for tens of millions of auto parts listings, and uses a fine-tuned LLM with RAG to instantly process, enhance, and transform data. With just a part number, Versable is able to generate market-ready titles, product descriptions, and specs, in any format that's needed.
    automotive
    manufacturing
    data-engineering
    ai
  • Hydra
    Hydra
    Y Combinator LogoW2022
    Active • 6 employees • San Francisco, CA, USA
    Hydra is a real-time analytics database management system for Postgres. We seperate compute from storage to offer software engineers serverless analytics with autoscale, write isolation, automatic caching, and more. Shipping scalable projects on time series and event data has never been easier. Hydra is available for local development, cloud, and bare metal deployment.
    open-source
    data-engineering
    developer-tools
    analytics
  • Trackingplan
    Trackingplan
    Y Combinator LogoW2022
    Active • 17 employees • Barcelona, Spain
    Trackingplan automatically discovers and monitors all the information your applications and websites are collecting, ensuring that you can trust your BI, analytics, marketing, and sales tools. You can think of us as Segment Protocols but totally transparent, where developers can keep using Google Analytics, Amplitude, Hubspot, Intercom, Braze, etc. as they are used to. Installed in minutes in using your Tag Manager or adding just one line of code to your web or apps, we model all the data being sent to third parties. Since Trackingplan understands what each piece of data means, it identifies patterns, detects anomalies, and automatically connects the dots to create value from data that was hidden in plain sight: - An always up-to-date single source of truth and data governance tool. To discover, understand and document your data and improve communication across teams. - Automated notifications when something breaks or changes. To make sure that integrations are always well implemented: Schema errors, traffic anomalies, rogue events... - Easy to understand, customizable, cross-service alerts. To detect trends, insights, and problems without using complex, engineer-oriented solutions.
    data-engineering
    analytics
    saas
  • Pipekit
    Pipekit
    Y Combinator LogoS2021
    Active • 9 employees
    Our app manages Argo Workflows for data teams, enabling complex data & CI pipelines in half the time while saving companies hundreds of thousands of dollars annually. We maintain Argo Workflows, an open-source pipeline framework for Kubernetes that’s used in production by Bloomberg, Intuit, Adobe, New Relic, NVIDIA, and many other open-source early adopters.
    open-source
    developer-tools
    data-engineering
    devops
  • Evidence
    Evidence
    Y Combinator LogoS2021
    Active • 6 employees • Toronto, ON, Canada
    Evidence is an open source, code-based alternative to drag-and-drop BI tools. Build polished data products with just SQL and markdown.
    b2b
    data-engineering
    developer-tools
    analytics
    data-visualization
  • Patterns
    Patterns
    Y Combinator LogoS2021
    Active • 2 employees • San Francisco, CA, USA
    Patterns revolutionizes financial analysis by making it easy and accessible through natural language. We are seeking passionate individuals excited about simplifying financial analytics and transforming business intelligence. If you're interested in joining an innovative team in the finance space, explore our job openings and become part of our mission. Our advanced AI transforms financial data workflows and reporting, surpassing traditional spreadsheets and inflexible SaaS solutions. By integrating state-of-the-art LLMs with autonomous querying and financial reasoning, Patterns empowers practitioners to perform complex analyses effortlessly via a natural language interface.
    data-engineering
    analytics
    data-science
    data-visualization
  • Whaly
    Whaly
    Y Combinator LogoS2021
    Active • 3 employees • Paris, France
    Whaly helps data teams save time on maintenance and analysis building while making business users more autonomous on the analysis they want to improve their decision making. We do this by providing a self service data platform where both data and business teams can work together. We understood that most data teams were ending up being a bottleneck for the rest of the company and needed to give more autonomy to business teams to back their decisions with data. Emilien, Florian and Pierre were the minds behind the Data advertising platforms of the major media and e-commerce companies in France in their earlier position as Product Manager and head of Customer Success, giving them an edge on how to execute successfully a data project.
    data-engineering
  • Whalesync
    Whalesync
    Y Combinator LogoS2021
    Active • 7 employees • Miami, FL, USA
    Whalesync makes data syncing easy. Our automation platform syncs data between key business tools like Webflow/Wix/WordPress and Airtable/Notion/Google Sheets. We give marketing teams two-way, real-time sync, so they can manage their website from their favorite collaboration tools. Whalesync launched during Y Combinator’s S21 cohort. Since then we’ve raised from some of the world’s top investors. We’re now trusted by hundreds of companies like [Ramp](https://ramp.com/), [Webflow](https://webflow.com/), and [Alchemy](https://www.alchemy.com/), and process millions of transactions every day. Many of our customers enjoy the product so much they [tell all their friends](https://whalesync.com/customers).
    data-engineering
    no-code
    saas
    remote-work
    web-development
  • authzed
    authzed
    Y Combinator LogoW2021
    Active • 31 employees • New York, NY, USA
    We build the tools companies need to provide performant and scalable authorization for their applications. We’re founded by 3 successful entrepreneurs with expertise in enterprise software, most recently as leaders at Red Hat. Jake and Joey met on the APIs team at Google in 2010. They went on to found Quay, where Jimmy joined as their first hire. Over the past decade, they’ve changed the landscape for building and deploying software.
    security
    data-engineering
    developer-tools
    saas
    open-source
  • Clear
    Clear
    Y Combinator LogoW2021
    Active • 2 employees • London, UK
    Clear is the free mobile app that helps you track and share your skincare routine. We are fuelling innovation and empowering consumers in the skincare industry via data, technology and community. We were also the 2022 L'Oréal Beauty Tech for Good winners, and were featured under "Best New Apps and Updates" on the App Store in 2023. The skincare industry is worth $200B and social commerce is going to drive the future growth of every brand in the industry. We're going to be fuelling that growth.
    consumer
    digital-health
    marketplace
    data-engineering
  • Waydev
    Waydev
    Y Combinator LogoW2021
    Active • 10 employees • Menlo Park, CA, USA
    Waydev is the Engineering Productivity Intelligence platform that helps companies understand, measure, and improve the performance of their software teams. It connects to the tools engineers already use, then transforms code, work items, reviews, deployments, and AI-assisted activity into a unified source of truth for engineering effectiveness. Built for modern engineering leaders, Waydev delivers deep visibility across delivery speed, code quality, collaboration patterns, and real business impact. With automated insights, conversational analytics, and industry benchmarks, teams can make decisions with clarity, reduce bottlenecks, and continuously improve how software is shipped. Waydev replaces manual reporting, subjective evaluations, and static dashboards. It gives executives, managers, and ICs the context they need to optimize workflows, forecast outcomes, and align engineering output to business results. Waydev powers engineering intelligence for fast-growing startups and global enterprises by turning raw activity into actionable strategy.
    enterprise
    data-engineering
    data-visualization
    artificial-intelligence
    ai-enhanced-learning
  • Prequel
    Prequel
    Y Combinator LogoW2021
    Active • 9 employees • New York, NY, USA
    Prequel makes it easy for companies to share data with their customers. It helps you export data directly to your customer's Snowflake, Redshift, BigQuery, Databricks, or other data warehouse on an ongoing basis.
    data-engineering
    saas
    analytics
  • Polytomic
    Polytomic
    Y Combinator LogoW2020
    Active • 7 employees • San Francisco, CA, USA
    Polytomic is a no-code web app to sync data between your internal databases, business systems (e.g. Stripe, Salesforce, etc), data warehouses, spreadsheets, and even HTTP APIs.
    saas
    b2b
    data-engineering
  • Chaos Genius
    Chaos Genius
    Y Combinator LogoW2020
    Active • 10 employees • San Francisco, CA, USA
    Chaos Genius is a DataOps Observability platform for Snowflake. Enable Snowflake Observability to reduce Snowflake costs and optimize query performance.
    cloud-workload-protection
    machine-learning
    data-engineering
    analytics
    open-source
  • Datafold
    Datafold
    Y Combinator LogoS2020
    Active • 30 employees • New York, NY, USA
    Datafold automates manual work in data engineering. We leverage agentic AI to automate both day-to-day tasks, such as testing and code reviews, and massive one-off projects, such as data platform code migrations. Companies from Perplexity to Disney use Datafold to unlock more value from their data by freeing up their data teams from manual work, accelerating developer velocity, and ensuring data quality.
    data-engineering
    saas
    analytics
    ai
  • Mozart Data
    Mozart Data
    Y Combinator LogoS2020
    Active • 24 employees • San Francisco, CA, USA
    Mozart Data provides an out-of-the-box modern data stack that empowers anyone to easily consolidate, organize, and prepare their data for analysis. Spin up a data stack that’s built on a best-in-class data warehouse and ETL tool in hours, without any engineering. You can finally spend more time on generating insights and less time wrangling your data.
    saas
    b2b
    data-engineering
  • Jitsu
    Jitsu
    Y Combinator LogoS2020
    Active • 4 employees • San Francisco, CA, USA
    Jitsu is the fastest, most durable way to collect event data from every source - web, app, email, chatbot, CRM - into your data warehouse. 100% open-source. Purpose built, secure and ready in minutes.
    data-engineering
    saas
    b2b
    open-source
  • Supabase
    Supabase
    Y Combinator LogoS2020
    Active • 120 employees • San Francisco, CA, USA
    Supabase is the easiest way to get started with Postgres. Each project within Supabase is an isolated Postgres cluster, allowing customers to scale independently, while still providing the features that you need to build: instant database setup, auth, row level security, realtime data streams, auto-generating APIs, and a simple to use web interface. We are 100% remote.
    open-source
    databases
    data-engineering
    big-data
    developer-tools
  • Dataland
    Dataland
    Y Combinator LogoS2020
    Active • 2 employees • New York, NY, USA
    Our AI auto-resolves customer issues with deep accuracy, by plugging into your internal systems, knowledge base, and past ticket resolutions. Works with your existing helpdesk & channels.
    b2b
    data-engineering
    data-visualization
    ai
  • Airbyte
    Airbyte
    Y Combinator LogoW2020
    Active • 90 employees • San Francisco, CA, USA
    Airbyte is the leading open data movement platform that empowers data teams in the AI era by transforming raw data into actionable intelligence. With the largest catalog of over 350 connectors, it offers low-code, no-code, and AI-powered connector development, and provides flexible deployment options across self-hosted, cloud, and hybrid environments. https://github.com/airbytehq/airbyte
    developer-tools
    open-source
    data-engineering
    artificial-intelligence
Loading more companies...