
Subagents and tools that improve coding agents
Goal: 99.99% uptime
We serve custom inference stacks that have irregular GPU load.
We're looking for people that have done genuinely amazing work in infrastructure that are interested in a challenge, working with both traditional infrastructure such as load balancers, NLB, etc., as well as very different infrastructure around inference engines and GPU loads.
This is a role that will inherently require deep experience with inference engines.
Contributions to vLLM, SGLang, trtllm, or inference frameworks a plus.
Every role at Morph comes with unlimited tokens on claude code/codex
MorphLLM is building Fast Apply models - get changes from Claude/Gemini into your code FAST