Zep AI

Agent Context Is Hard. We Fixed It.

Lead Forward Deployed Engineer

$175K - $250K0.50% - 1.50%San Francisco, United States / Remote (US)
Job type
Full-time
Role
Engineering, Full stack
Experience
6+ years
Visa
US citizen/visa only
Skills
Python, TypeScript, LLMs, AI Agents
Connect directly with founders of the best YC-funded startups.
Apply to role ›
Daniel Chalef
Daniel Chalef
Founder

About the role

Zep is the memory and context layer for AI agents. As Lead Forward Deployed Engineer, you'll embed with customer engineering teams to integrate Zep into their production agent systems: diagnosing context-quality failures, designing memory architectures around their data, and shipping the integrations that make their agents actually work in the wild.

This is an applied AI engineering role with a customer surface. We're not looking for ML researchers or data scientists. We're looking for engineers who have already lived through the messy reality of taking an agent from demo to production.

How we work

We're a small, distributed team that works closely together. We pair on hard problems, review each other's designs, and treat learning as part of the job rather than something that happens after hours. We ask a lot of questions: of customers, of teammates, of our own assumptions. When we find pain, we go fix it.

We expect the same back: ask questions early, push back when you disagree, and care about the people on the other end of the API.

What you'll do

  • Own end-to-end delivery for strategic deployments: scope, design, build, rollout, stabilize.
  • Embed with customer engineers to integrate Zep into real systems: data, APIs, auth, infra.
  • Ship production code: integrations, reference implementations, performance and reliability fixes.
  • Help level up the FDE function: coach newer FDEs on execution, review designs and code when useful, and capture repeatable patterns.

What we're looking for

  • 6+ years of production engineering. You can own both architecture and implementation, and you've shipped systems that real customers depend on.
  • Hands-on AI agent / LLM application experience. You've shipped a non-trivial agentic system to production. That is, not a prototype, not a thin wrapper over a chat-completion API. We expect concrete examples: multi-turn agent loops with tool calling, retrieval and context pipelines you tuned against real failures, eval harnesses you built to catch regressions, or production memory and state systems for agents.
  • Working familiarity with the agent ecosystem: at least one of LangChain / LlamaIndex / model-provider SDKs, vector stores (pgvector, Pinecone, Weaviate), and eval tooling (Braintrust, LangSmith, custom harnesses).
  • Experience across diverse customer technology stacks and cloud platforms (AWS or GCP). Proficiency with Docker and networking fundamentals.
  • Fast debugging and strong operational instincts in complex, real-world environments.
  • Leadership through hands-on work; excellent communication for customer sessions and coaching junior engineers.

Tech stack: Python, TypeScript, AWS or GCP, Docker.

This role is probably NOT a fit if:

  • Your LLM experience is single-turn chat completions or RAG-as-a-feature.
  • You're an ML researcher or model trainer looking to move into agents — this role is for engineers already deep in agent production.
  • You haven't worked directly with customers on integration or delivery.

About the interview

We respect your time and keep our interview process tight and focussed.


Screening Call (w/ Daniel, our Founder) → Team Calls (2-3 hours back-to-back, may include a presentation) → Decision Call (Daniel, again)

About Zep AI

Zep is the context engineering platform for AI agents. We solve one of the hardest problems in production AI: getting the right context to agents at the right time. Our platform builds context graphs from conversations and business data, then assembles personalized, token-efficient context with sub-200ms retrieval. Three lines of code to production.

We're a seed-stage company (YC W24) with 50% month-over-month ARR growth, 240+ customers including Fortune 500s, and a well-capitalized balance sheet. Graphiti, our open-source temporal context graph engine, has 24,000+ GitHub stars and is becoming foundational infrastructure for agent memory.

Zep AI
Founded:2023
Batch:W24
Team Size:5
Status:
Active
Location:San Francisco
Founders
Daniel Chalef
Daniel Chalef
Founder