

LLMCap
Like
LLMCap is a reverse proxy that enforces real-time, dollar-based spending caps on LLM API calls. When you hit your cap, the next request returns 429 — the token is never consumed and money is never spent.
Cost / License
- Freemium (Subscription)
- Proprietary
Platforms
- Online
- Software as a Service (SaaS)
Features
- AI-Powered
LLMCap News & Activities
Highlights All activities
Recent activities
LLMCap information
No comments or reviews, maybe you want to be first?
What is LLMCap?
LLMCap is a reverse proxy that enforces real-time, dollar-based spending caps on LLM API calls. When you hit your cap, the next request returns 429 — the token is never consumed and money is never spent.
Supports Anthropic, OpenAI, Google Gemini, Mistral, Cohere, and AWS Bedrock. One line of code change: just update your base_url.
Key features:
- Hard caps (not alerts) — enforced at request level, not billing lag
- <35ms added latency
- Per-key, per-provider, per-model caps
- Streaming support
- VS Code extension, CLI, and desktop tray app available
Free 3-day trial. Starter $19/mo, Pro $49/mo.






