{"id":83213,"title":"Weavel - Ape 🐒 Your first AI Prompt Engineer","tagline":"Get fast (7 hours → 10 minutes🔻) \u0026 efficient (70% → 93%🔺)","body":"![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/8f68d0d9-5e48-46ee-852f-a6a841232f11)\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/d1d98093-7792-4954-bd9d-a588f0658a53)\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/85f4ffe2-bf2f-4cb2-966c-2501c2b4fbe2)\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/9cf399d9-e818-4bfb-9da4-42ac9ed3736f)\n\n# **☕️ TL;DR**\n\n* **Ape** is the ultimate **AI prompt engineer** 🐒, designed to optimize your prompts by reducing cost and latency while increasing performance.\n* Ape achieves an **impressive 94.5%** on the GSM8K benchmark, surpassing Vanilla (54.5%), CoT (87.5%) and DSPy (90.0%).\n* **Easy to set up evaluation**: Ape can auto-generate evaluation code and use LLMs as a judge, or you can use your own eval metrics.\n* Get set up in less than 15 minutes and see the difference.\n  * [Schedule a meeting](https://cal.com/team/weavel/weavel-onboarding) to discover more. Let's chat! 🙂\n\n# **🔒 Problem**\n\nYou’re an engineer of an LLM app, trying to get the prompts just right. Every time you type something in, the output changes—so you tweak a word here and there, and it changes again. Sometimes the outputs looks better, sometimes not. But you’re never sure. Hours go by, all spent on prompt engineering.\n\nGetting the outputs you want can feel like an endless game of trial and error. And you’re not alone. Over the past few weeks, we’ve talked to over 100 YC companies, and a lot of them are facing the same challenges:\n\n* **Measuring output quality is hard** (You’re heavily relying on manual evaluations at the moment.)\n* **Prompt engineering does not work as you want** (You hate spending 5-7 hours a day searching for that one great prompt.)\n\n# **🔑 Solution**\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/ddf5778d-78d0-4ff2-a4aa-b84624aeebc0)\n\nWe solve the problem with one simple formula:\n\n```\ngood input + right guidance = better prompts\n```\n\nToday, we launch Ape, your first **AI Prompt Engineer**. Inspired by DSPy, Reflexion, Expel and other research papers, Ape iteratively improves your prompts. Here’s how Ape works:\n\n1️⃣ Log your inputs and outputs to Weavel (with a single line of code!)\n\n2️⃣ Let Ape filter the logs into datasets.\n\n3️⃣ Ape then generates evaluation code and uses LLMs as judges for complex tasks.\n\n4️⃣ \u001dAs more production data is added, Ape continues refining and improving prompt performance.\n\n### **How to use**\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/0c1eeedd-ac08-40e7-ad35-acff6766c2c1)\n\n**Create a Dataset**\n\nChange just one line of code to start logging LLM calls with the Weavel Python SDK. The SDK supports sync/async OpenAI chat completions and OpenAI structured outputs.\n\nYou can also import existing data or manually create a dataset.\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/bdfc79bd-edeb-4dbf-8c9b-7d432007fa3a)\n\n**Create a Prompt**\n\nWrite a prompt that corresponds to your dataset. You can add an existing prompt as the base version, or if you prefer, create a blank prompt and provide a brief description for Ape to create a prompt from scratch.\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/1bfacea3-c2d9-4ac8-90d7-9a6d3e37fd60)\n\n**Optimize Prompts**\n\nTo optimize your prompt using Ape, fill in the necessary information (e.g. JSON schema as you want) and then run the optimization process. An enhanced version of your prompt will be created and available soon.\n\nTa-da! It’s that easy. Ape outperforms with a remarkable 94.5% score on the GSM8K benchmark, surpassing Vanilla (54.5%), CoT (87.5%) and DSPy (90.0%). With Ape, you can optimize the prompt engineering process, saving tons of time and cost while increasing performance.\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/2002273/45b5ec84-f89d-4d57-9e20-ad349f521c97)\n\nApe is **open source**. [Check out our repository on GitHub.](https://github.com/weavel-ai/Ape) (We’d appreciate a star 🌟)\n\n# **🚀 The Team**\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/201595e0-e85e-4931-81aa-a9947754b1b4)\n\nFrom left to right: [**Jun**](https://www.linkedin.com/in/jun-park314/), [**Andrew**](https://www.linkedin.com/in/sounhochung/), and [**HyunJie**](https://www.linkedin.com/in/hyunjiejung/) — together we’re building Weavel.\n\n**Andrew** and **Jun** built 10+ LLM-based products, open-sourced a prompt engineering platform, and co-authored a paper at a NeurIPS workshop last year. **HyunJie** worked on data analytics and optimization at Chartmetric and DevRev, and focused on growth marketing at Liner.\n\n# **🙏 Ask**\n\n* Try Ape! [Schedule](https://cal.com/team/weavel/weavel-onboarding) a walkthrough with the Weavel team or email [hyunjie@weavel.ai](mailto:hyunjie@weavel.ai).\n* Share thoughts on our [Discord](https://weavel.ai/discord) or DM us on [Twitter](https://x.com/weaveldotai).\n* If you know anyone struggling with prompt engineering or evaluations for LLM apps, connect them with us!\n  * Copy \u0026 paste blurb: A YC company named Weavel has developed an AI prompt engineer (Ape in short) which continuously improves your prompts. It’ll save tons of time for you. You can grab a time [here](https://cal.com/team/weavel/weavel-onboarding) for a demo from the founders.\n\n![uploaded image](/media/?type=post\u0026id=83213\u0026key=user_uploads/2002273/d937ab74-d39f-44cb-82f8-8454bf017ed1)\n\n","slug":"Le9-weavel-ape-your-first-ai-prompt-engineer","created_at":"2024-08-20T14:56:52.551Z","updated_at":"2026-05-25T03:16:18.683Z","total_vote_count":111,"url":"https://www.ycombinator.com/launches/Le9-weavel-ape-your-first-ai-prompt-engineer","share_image_url":"https://www.ycombinator.com/media/?type=post\u0026id=83213\u0026key=user_uploads/1564365/bf9fc4fc-d63d-4745-be93-aae22413b64d","company":{"id":29830,"name":"Typa","slug":"typa","url":"https://typa.ai","logo":"https://bookface-images.s3.amazonaws.com/small_logos/9922edca2847f43aae8a7cd38ca5819ed482590a.png","batch":"Summer 2024","industry":"B2B","tags":["Artificial Intelligence","Generative AI","Social Media","Marketing"],"search_path":"https://bookface.ycombinator.com/company/29830"}}