Simple, Honest Pricing

Start free with the open-source edition. Upgrade when you need managed hosting, support, or enterprise compliance features.

Monthly Annual Save 20%
OPEN SOURCE
Free forever
Self-hosted. Apache 2.0 license. You own the deployment and all the data. No call-home, no usage tracking.
  • All 13 pipeline stages
  • 17 LLM providers
  • PHI/PII redaction (15 entity types)
  • Prompt injection & guardrails
  • Semantic caching + compression
  • 6 routing strategies + circuit breaker
  • Agentic session governance
  • Prometheus + OpenTelemetry
  • Immutable audit logging
  • MCP agent support
  • Managed hosting
  • Managed upgrades
  • HIPAA BAA
  • Priority support
Get on GitHub →
ENTERPRISE
Custom
Unlimited scale, HIPAA BAA, on-premise support, and a dedicated engineering relationship.
  • Everything in Professional
  • Unlimited requests
  • HIPAA BAA included
  • On-premise or private VPC deployment
  • Data residency enforcement
  • 99.9% uptime SLA
  • Dedicated support engineer
  • Custom PHI entity types
  • Custom guardrail rules
  • SSO / SAML integration
  • Security review & penetration testing
  • Annual compliance reporting
Contact Sales
HEALTHCARE COMPLIANCE ADD-ON
+$199/month
Add-on to Professional or Enterprise. Purpose-built for HIPAA-covered entities handling patient data through AI workflows.
HIPAA BAA signed
PHI audit trail (7-year retention)
Clinical output scanning
Data residency enforcement
HL7/FHIR-aware patterns
Quarterly compliance review
Learn More

Full Feature Comparison

Feature Open Source Professional Enterprise

Frequently Asked Questions

Can I self-host for free? +
Yes. The full source code is available under the Apache 2.0 license on GitHub. You can deploy it anywhere — your own servers, Docker, Kubernetes, or any cloud provider — completely free, with no restrictions for commercial use.
Does the Professional plan include a HIPAA BAA? +
The Professional plan does not include a HIPAA BAA by default. You need the Enterprise plan or the Healthcare Compliance Add-On for a signed BAA. If you are a covered entity handling PHI, contact us to discuss your specific requirements.
What counts as a "request"? +
One request is one call to the gateway's /v1/chat/completions endpoint. Streaming responses count as one request. Cache hits count as one request (but save the LLM cost entirely). There is no per-token billing from us — you pay your LLM provider directly.
Can I switch from self-hosted to managed without downtime? +
Yes. Because the gateway is a drop-in proxy, you simply point your base_url to the new managed endpoint. All your API keys, configuration, and provider integrations migrate with a config file import — no code changes in your applications.
Do you store my LLM responses? +
In the managed plans, audit logs are stored (encrypted at rest) to provide the immutable trail needed for compliance. The semantic cache stores a hash of the request and the response — never the raw PHI. You can disable caching for PHI traffic in your config. On self-hosted, you control all storage.
What LLM providers do you support? +
17 providers: OpenAI, Anthropic, Azure OpenAI, AWS Bedrock, Google Gemini, Groq, Mistral, DeepSeek, Cohere, Fireworks AI, HuggingFace, Ollama (local), Perplexity, Replicate, Together AI, xAI (Grok), and AI21. New providers are added regularly — open a GitHub issue to request one.

Talk to Us

Evaluating the gateway for your organization? We're happy to run a proof of concept, answer compliance questions, or demo PHI redaction on your actual use case.

Contact Information

✉️
⏱️
Response Time
Within 1 business day for all inquiries
🐙
Open Source Issues
GitHub Issues

WHY TEAMS CHOOSE ENTERPRISE

HIPAA BAA signed within 48 hours
Dedicated Slack channel with your team
On-site or video onboarding
Custom PHI patterns for your workflows
99.9% SLA with credits for downtime