
Agent Context Is Hard. We Fixed It.
Zep is the memory and context layer for AI agents. As a Senior AI Engineer, you'll build low-latency backend systems, operate them in production on AWS, and ship LLM-powered capabilities our customers depend on.
You'll have the opportunity to work on Graphiti (25K+ GitHub stars), Zep’s popular open-source context graph framework.
This is a senior backend role centered on running LLM workloads at significant scale. We're not hiring ML researchers or data scientists. We're hiring engineers who have already lived through the messy reality of taking an LLM application from demo to production.
How we work
We're a small, distributed team that works closely together. We pair on hard problems, review each other's designs, and treat learning as part of the job rather than something that happens after hours. We ask a lot of questions: of customers, of teammates, of our own assumptions. When we find pain, we go fix it.
We expect the same back: ask questions early, push back when you disagree, and care about the people on the other end of the API.
What you'll do
What we're looking for
Nice to have
Tech stack: Go, Python, TypeScript, AWS.
This role is probably NOT a fit if:
We respect your time and keep our interview process tight and focussed.
Screening Call (w/ Daniel, our Founder) → Team Calls (2-3 hours back-to-back, may include a presentation) → Decision Call (Daniel, again)
Zep is the context engineering platform for AI agents. We solve one of the hardest problems in production AI: getting the right context to agents at the right time. Our platform builds context graphs from conversations and business data, then assembles personalized, token-efficient context with sub-200ms retrieval. Three lines of code to production.
We're a seed-stage company (YC W24) with 50% month-over-month ARR growth, 240+ customers including Fortune 500s, and a well-capitalized balance sheet. Graphiti, our open-source temporal context graph engine, has 24,000+ GitHub stars and is becoming foundational infrastructure for agent memory.