AI Control Plane Gateway

Capabilities

Everything a governed gateway needs

Built for teams that need control, not just a proxy.

HIPAA PHI Redaction

15 entity types including MRN, NPI, ICD-10, CPT codes, drug names, and provider identities. Full round-trip: redact before the LLM sees it, restore in the response. Session-scoped deterministic tokenization keeps the same entity mapped to the same token across a 50-turn agent run.

Healthcare-Grade

INPUT Patient John Smith, MRN 123456789, prescribed Lisinopril by Dr. Sarah Johnson NPI 1234567890

↓

TO LLM Patient [NAME_1], MRN [MRN_1], prescribed [DRUG_1] by [PROVIDER_1] NPI [NPI_1]

Guardrails: Regex + ML

35+ regex patterns for jailbreaks, token smuggling, and prompt extraction, plus an optional pluggable ML classifier layer for paraphrased attacks that regex misses.

New

20+ LLM Providers

OpenAI, Anthropic, Azure OpenAI, AWS Bedrock, Google Gemini, Groq, Mistral, DeepSeek, Cohere, Fireworks, Together AI, Ollama, xAI, Perplexity, and more. Switch providers with zero code changes.

Multi-Provider

Live Model & Pricing Registry

Pricing auto-refreshes from a maintained source with a local override for negotiated rates — no more hand-editing a stale price table when a provider ships a new model.

New

Semantic Caching + Embeddings

/v1/embeddings now runs through the full governance pipeline, and backs an embedding-aware cache layer that catches true paraphrases, not just near-identical strings — 40%+ cost reduction on repetitive workloads.

Cost Optimization

FinOps Cost Attribution & Showback

Per-team, per-user, per-feature, and per-agent-run cost breakdowns with CSV export and anomaly attribution — see exactly which workflow caused the spend spike.

New

Shadow-MCP Detection & Kill Switch

Discover tool servers your agents are calling that were never registered, block unsanctioned calls automatically, and cut off a runaway agent session or MCP server with one action.

New

Agentic Session Governance

Sub-agent depth limiting, per-run cost caps, infinite-loop detection via fingerprinting, and consistent PHI redaction across every turn of an agent session.

AI Agents

Anomaly Detection & Observability

Prometheus metrics, OpenTelemetry traces, and a hash-chained tamper-evident audit log. EMA-based anomaly detection on spend and error-rate spikes, delivered via Slack, MS Teams, PagerDuty, or email.

Observability

Six Routing Strategies

Round-robin, weighted, least-latency, cost-optimized, quality-scored, and fallback — with a circuit breaker per provider and data-residency enforcement for PHI traffic.

Reliability

Azure Entra ID SSO

RS256/ES256 JWT validation against Microsoft's JWKS endpoint, App Role enforcement, and multi-tenant support — no separate API keys required for org members.

Enterprise Auth

Prompt Management + Evals

Versioned prompt templates served by the gateway, with built-in A/B testing and lightweight LLM-as-judge evals reusing the same experiments engine.

New

Architecture

Runs inside your perimeter

Self-host on your own infrastructure, or let us run it — your data path is the same either way.

Clients

OpenAI SDK

REST / HTTP

MCP Agents

Admin Console

CI/CD Pipeline

Any OpenAI-compatible SDK

AI Control Plane Gateway

Auth · Rate Limit · Policy

Budget · Guardrails · PHI/PII · A/B

Compression · Semantic Cache

Routing · LLM Call · Output · Audit

Standard HTTPS

LLM Providers

OpenAI

Anthropic

Azure

Bedrock

Google

Groq

Mistral

DeepSeek

+ 9 more

Infrastructure

Prometheus

OpenTelemetry

Hash-Chained Audit

Semantic Cache

Budget Store

Alerting

Use Cases

Built for regulated and agentic workloads

Healthcare

HIPAA-grade AI workflows with PHI redaction across 15 entity types. Output guardrails scan LLM responses for PHI leakage before they reach your application.

PHI never reaches the LLM
Output content guardrails
Data residency enforcement
Hash-chained audit trail

Financial Services

PII redaction for customer data, budget controls per team and product line, a policy engine to block regulated content, and a full audit trail.

PII/PCI data redaction
Per-product budget caps
Regulatory policy enforcement
Audit logs for SOC 2

Legal & Professional Services

Prevent confidential client data from reaching public LLM APIs. Route sensitive matters to on-premise models while using cloud LLMs for non-sensitive work.

Data classification routing
On-premise model support
Client confidentiality
Matter-level budget caps

Agentic AI Platforms

MCP tool-call governance, shadow-server detection, and a per-session kill switch for teams running fleets of autonomous agents against real infrastructure.

Unsanctioned MCP server detection
Agent inventory + spend per run
Sub-agent depth & loop limits
One-click session kill switch

Three Ways to Run It

One engine, three editions

Same governance pipeline underneath. Choose who manages the infrastructure.

Open Source

Self-Hosted

Free forever

Apache 2.0. Run the full pipeline on your own infrastructure — you manage everything.

All 13 pipeline stages
20+ LLM providers
PHI/PII redaction & guardrails
Full source, no telemetry

Get on GitHub

Cloud

Managed, Self-Serve

From $0 /month

We run the infrastructure. Sign up, get an API key, start routing traffic in minutes.

Free tier, no credit card
Hosted portal & dashboards
FinOps showback per team
Usage-based billing via Stripe

Start Free →

Enterprise

On-Prem / VPC

Custom

Install inside your own VPC or data center. No usage billing — flat annual license.

Full admin portal, self-hosted
Entra ID SSO on day one
HIPAA BAA available
Dedicated deployment support

Talk to Sales

One gateway.
Complete AI control.

Thirteen stages, one request

Everything a governed gateway needs

HIPAA PHI Redaction

Guardrails: Regex + ML

20+ LLM Providers

Live Model & Pricing Registry

Semantic Caching + Embeddings

FinOps Cost Attribution & Showback

Shadow-MCP Detection & Kill Switch

Agentic Session Governance

Anomaly Detection & Observability

Six Routing Strategies

Azure Entra ID SSO

Prompt Management + Evals

Governance is nearly free

Runs inside your perimeter

Built for regulated and agentic workloads

Healthcare

Financial Services

Legal & Professional Services

Agentic AI Platforms

One engine, three editions

Self-Hosted

Managed, Self-Serve

On-Prem / VPC

See it live

Priced against the market, not against you

One gateway. Complete AI control.

Thirteen stages, one request

Everything a governed gateway needs

HIPAA PHI Redaction

Guardrails: Regex + ML

20+ LLM Providers

Live Model & Pricing Registry

Semantic Caching + Embeddings

FinOps Cost Attribution & Showback

Shadow-MCP Detection & Kill Switch

Agentic Session Governance

Anomaly Detection & Observability

Six Routing Strategies

Azure Entra ID SSO

Prompt Management + Evals

Governance is nearly free

Runs inside your perimeter

Built for regulated and agentic workloads

Healthcare

Financial Services

Legal & Professional Services

Agentic AI Platforms

One engine, three editions

Self-Hosted

Managed, Self-Serve

On-Prem / VPC

See it live

Priced against the market, not against you

One gateway.
Complete AI control.