and all popular Inference APIs
Compose
Prompts like
a Pro
Promptmetheus breaks prompts down into LEGO-like blocks for better composability, e.g. Context ⇢ Task ⇢ Instructions ⇢ Samples (shots) ⇢ Primer. You can play with different variations for each section and systematically fine-tune your prompts for minimal cost and maximum performance.
Test
Prompt
Reliability
The Prompt IDE includes a range of tools to evaluate your prompts under various conditions. For instance, Datasets enable rapid iteration with different inputs, while completion Ratings and the respective visual statistics help gauge output quality.
Optimize
Prompt
Performance
End-to-end performance and reliability of prompt chains (agents) depend heavily on the accuracy of each prompt in the sequence. Errors can compound and compromise the final output. Promptmetheus can help you optimize each prompt in the chain to consistently generate great completions.
Traceability
Track the complete history of the prompt design process.
Cost Estimation
Calculate inference costs under different configurations.
Data Export
Export prompts and completions in different file formats.
Analytics
View prompt performance statistics, charts, and insights.

Prompt Chaining
Chain prompts together for advanced tasks and workflows.
Prompt Endpoints
Deploy prompts to dedicated AIPI endpoints.
Data Loaders
Inject external data sources directly into prompts.
Vector Embeddings
Add more context to prompts via vector search.
Models
Claude 3.7 Sonnet
Claude 3.5 Sonnet
Claude 3.5 Haiku
Claude 3 Opus
Claude 3 Sonnet
Claude 3 Haiku
Gemini 2.0 Pro
Gemini 2.0 Light
Gemini 2.0 Flash Thinking
Gemini 2.0 Flash
Gemini 1.5 Pro
Gemini 1.5 Flash
DeepSeek R1 Distill Llama 70B
DeepSeek R1 Distill Qwen 32B
Alibaba Qwen QwQ 32B
Alibaba Qwen 2.5 32B
Mistral Saba 24B
Meta Llama 3.3
Meta Llama 3.2
Meta Llama 3.1
Meta Llama 3
Google Gemma 2
DeepSeek R1
DeepSeek V3
Alibaba Qwen 2.5
Alibaba Qwen 2
Meta Llama 3.3
Meta Llama 3.2
Meta Llama 3.1
Meta Llama 2 HF
Mistral Nemo
Google Gemma 2
Microsoft WizardLM
Microsoft Phi 4
“The hottest new programming language is English.”
— Andrej Karpathy
Pricing
Playground
- Forge
- Single user
- Local data storage
- OpenAI LLMs
- Stats & Insights
- Data import / export
- Community support
Single
- IDE
- Single user
- Cloud sync
- All APIs and LLMs
- Stats & Insights
- Projects
- History and full traceability
- Data export
- Standard support
Team
- IDE
- Multiple users
- All Single features, plus
- Shared projects
- Shared prompt library
- Real-time collaboration
- Business support
PRO
- IDE
- Multiple users
- All Team features, plus
- Deploy prompts to AIPI endpoints
- AIPI versioning and monitoring
- Premium support
You can cancel subscriptions any time.
Subscriptions do not include LLM completion costs, you need to provide your own API keys.
Special pricing is available for students and startups.
What is Prompt Engineering?
What is a Prompt IDE?
How is Promptmetheus different from the OpenAI and Anthropic playgrounds?
How is Promptmetheus different from other prompt engineering tools?
Is there an API or SDK?
Can I use Promptmetheus together with LangChain, LangFlow, and other AI agent builders?
What is the difference between Forge and Archery?
What is an AIPI?
Does Promptmetheus integrate with automation tools like Make, Zapier, IFTTT, and n8n?