MCP Gateway

Top 5 Open-Source MCP Gateways for Self-Hosted AI Infrastructure

Compare the top open-source MCP gateways for self-hosted AI infrastructure on performance, governance, token efficiency, and deployment flexibility in 2026.

Teams running production AI agents in 2026 are increasingly choosing open-source MCP gateways for self-hosted AI infrastructure rather than managed services. The reasons are practical: data residency requirements, latency sensitivity in agentic workflows, the need to keep tool credentials inside a private network, and the operational reality that gateway overhead compounds across every LLM call. Bifrost, the open-source AI gateway by Maxim AI, leads this category by delivering microsecond-class latency, native MCP support, and enterprise governance in a single self-hostable binary.

The Model Context Protocol (MCP) ecosystem has scaled past 97 million monthly SDK downloads and over 13,000 public servers as of early 2026, according to Anthropic's announcement of MCP's transfer to the Linux Foundation. The protocol itself, governed under the official MCP specification, defines a JSON-RPC interface for tools, resources, and prompts that any compliant client can call. At that scale, ungoverned tool access stops being a developer-experience problem and becomes a production risk. A self-hosted MCP gateway is the control plane that addresses it.

Why Self-Hosting Matters for MCP Gateways

Self-hosted MCP gateways give engineering teams full control over where tool calls execute, where credentials live, and how data crosses network boundaries. Hosted MCP services solve onboarding speed but introduce three structural constraints: third-party network hops on every agent call, vendor-side visibility into tool arguments and results, and dependency on someone else's compliance posture for SOC 2, HIPAA, or GDPR audits.

Open-source MCP gateways for self-hosted AI infrastructure address each of those constraints directly:

Data sovereignty: tool arguments, results, and audit trails stay inside the team's network or VPC.
Predictable latency: removing the hop to a managed service eliminates a variable contributor to P99 latency in multi-step agent workflows.
License transparency: an open-source core means the gateway code is auditable and the upgrade path is not gated by vendor pricing.
Operational fit: the gateway runs on the same Kubernetes, Docker, or bare-metal infrastructure already in use, alongside existing identity and observability stacks.

Key Criteria for Evaluating Self-Hosted MCP Gateways

Before listing the top open-source MCP gateways for self-hosted AI infrastructure, here is the evaluation framework production teams use:

Gateway overhead: how many microseconds the gateway adds per request, measured under sustained load.
MCP feature completeness: support for STDIO, HTTP, and SSE transports; OAuth 2.0 with token refresh; tool filtering; and code-execution-based token optimization.
Governance depth: virtual keys or equivalent scoped credentials, per-consumer budgets and rate limits, RBAC, and audit logs that satisfy SOC 2 and HIPAA reviewers.
Deployment model: Docker, Kubernetes, in-VPC, and air-gapped support for regulated workloads.
License posture: a permissive open-source license for the core, with optional enterprise add-ons for advanced governance.
Ecosystem fit: integrations with Claude Code, Cursor, Codex CLI, Gemini CLI, and other MCP-compatible clients teams already use.

1. Bifrost

Bifrost is a high-performance, open-source AI gateway built in Go by Maxim AI. Among the open-source MCP gateways for self-hosted AI infrastructure, it is the only option that operates as both an LLM gateway and an MCP gateway in a single binary. One self-hosted deployment handles model routing, tool discovery, governance, execution, and exposure to clients like Claude Desktop, Cursor, Claude Code, and custom agents.

Bifrost is available on GitHub under Apache 2.0 and installs in 30 seconds:

npx -y @maximhq/bifrost

Bifrost MCP gateway capabilities

Acts as both an MCP client and server, connecting to upstream tool servers via STDIO, HTTP, and SSE, then exposing them through a single /mcp endpoint.
Code Mode replaces the entire tool catalog with four meta-tools and lets the model write Python (Starlark) to orchestrate work in a sandbox. The pattern follows Anthropic's published engineering work on code execution with MCP, which documented context dropping from 150,000 to 2,000 tokens on a representative agent workflow. Bifrost benchmarks show input token reductions of up to 92% at 500-tool scale with no accuracy loss.
Agent Mode handles autonomous tool execution with configurable auto-approval, so agents can chain calls without per-step prompts.
OAuth 2.0 with PKCE, automatic token refresh, and dynamic client registration for upstream MCP server connections.
Tool filtering per virtual key, so each consumer sees only the MCP tools they are authorized to use.

Performance and governance

Published benchmarks report 11 microseconds of overhead at 5,000 RPS, sub-3ms latency on MCP operations, and stateless operation that scales horizontally without sticky sessions. The Bifrost MCP gateway resource page covers the full architecture in detail.

Governance is built around virtual keys: scoped credentials that combine access control, rate limits, budgets, and tool-level permissions. Every tool execution is a first-class log entry with tool name, server, arguments, result, latency, and the triggering virtual key. Bifrost Enterprise adds clustering, in-VPC deployments, vault support, and audit logs aligned with SOC 2, GDPR, and HIPAA requirements.

The full Bifrost MCP gateway launch post walks through access control, cost governance, and the 92% token reduction benchmark in production.

Best for: Teams that need both MCP and LLM traffic flowing through one self-hosted gateway, with code-execution-based token optimization, enterprise governance, and a fully open-source core.

2. Docker MCP Gateway

Docker MCP Gateway is Docker's open-source project for orchestrating MCP servers as containerized workloads. It runs each MCP server inside an isolated container with limited privileges, network access, and resource quotas, which appeals to security-focused teams that already standardize on Docker.

Capabilities

CLI-driven configuration via docker mcp commands, with profiles that scope which tools are visible to each client.
Generic interceptors for cross-cutting security: signature verification on container images, secret scanning on inbound and outbound payloads, and call logging.
Available out of the box in Docker Desktop and as a standalone open-source project on GitHub.
Integrates with Docker Compose, so MCP server fleets can be defined declaratively alongside the rest of an application stack.

Limitations for AI infrastructure

Docker MCP Gateway focuses on container orchestration and security. It does not include LLM routing, virtual-key governance, or code-execution-based token optimization. Teams that adopt it typically pair it with a separate LLM gateway and identity layer. Performance characteristics depend on the underlying Docker runtime rather than the gateway itself.

Best for: Container-native teams that want strong MCP server isolation and image provenance guarantees, and that are willing to compose Docker MCP Gateway with other infrastructure for LLM routing and enterprise governance.

3. Obot

Obot, maintained by Acorn Labs, is an open-source platform that combines an MCP gateway with hosting, a registry, and a built-in chat client. It targets organizations that want a single self-hosted package to cover the full MCP lifecycle, from server publishing to access control to end-user consumption.

Capabilities

Deploys on Docker or Kubernetes with a single container image.
Manages MCP server hosting alongside gateway functions, so internal teams can publish servers to a private catalog.
Enforces authentication, authorization, and full audit logging for every request, with enterprise support for Okta and Microsoft Entra in the paid tier.
Network egress policies on MCP servers reduce the risk of data exfiltration from compromised tools.
Includes a chat UI that lets non-developers interact with approved MCP tools without setting up a separate client.

Trade-offs

The breadth of the platform is its strength and its cost. Teams that only need a gateway and already have hosting figured out will inherit components they do not use. The ecosystem is younger than Docker's or Kong's, so community plugins are still maturing. Performance is acceptable for most enterprise workloads but is not the differentiator.

Best for: Mid-sized organizations that want a unified self-hosted MCP environment (hosting plus gateway plus chat) with built-in governance, and that prefer one vendor for the full stack.

4. MCPJungle

MCPJungle is a lightweight, self-hosted MCP gateway written for developers and teams that want a single registry and gateway without heavy infrastructure prerequisites. It runs as a single binary or via Docker Compose, with a development mode for local use and an enterprise mode for shared team deployments.

Capabilities

Single endpoint at /mcp that aggregates every registered MCP server, accessible to any MCP-compatible client.
Tool Groups: curated subsets of tools exposed via group-specific endpoints, so teams can reduce client-side tool counts when registries grow large.
Enterprise mode adds RBAC, access control, and OpenTelemetry metrics out of the box.
Multi-transport support for STDIO and Streamable HTTP, covering the most common MCP server topologies.

Trade-offs

MCPJungle prioritizes simplicity over feature breadth. There is no LLM gateway, no semantic caching, and no code-execution-based token optimization, so token costs scale linearly with tool count. The project's documentation has matured significantly through 2025 and 2026, but the contributor base is smaller than Docker's or IBM's.

Best for: Small to mid-sized teams that want a self-hosted MCP gateway with minimal operational overhead and are comfortable composing additional infrastructure for LLM routing and advanced governance.

5. IBM ContextForge

ContextForge is IBM's open-source MCP gateway framework, hosted under IBM's GitHub organization and built on FastAPI. It targets large enterprises with multi-region requirements, where multiple gateway instances need to discover each other and federate tool registries across geographic boundaries.

Capabilities

Multi-cluster federation: gateway instances automatically share tool registries, supporting distributed enterprise deployments without manual synchronization.
Multi-protocol support including MCP, A2A, REST-to-MCP, and gRPC-to-MCP, which lets teams expose existing internal APIs as MCP tools without rewriting them.
Built-in LLM proxy supporting major providers, plus a plugin architecture covering more than 40 transports and integrations.
OpenTelemetry observability with native support for Phoenix, Jaeger, and Zipkin.

Trade-offs

ContextForge is the most architecturally ambitious gateway in this list, which translates to deployment complexity. Production rollouts typically require dedicated platform engineering teams and weeks to months of integration work. Enterprise governance features are less complete than the federation capabilities.

Best for: Global enterprises with mature platform engineering teams, multi-region Kubernetes infrastructure, and the resource budget to invest in a federation-first MCP architecture.

How These Self-Hosted MCP Gateways Compare

For teams evaluating open-source MCP gateways for self-hosted AI infrastructure, the practical differences come down to four questions:

Is one gateway handling both LLM routing and MCP, or are they separate planes? Bifrost is the only option in this list that unifies both. The others handle MCP only or include a basic LLM proxy as a secondary feature.
Does the gateway address token cost at the protocol level? Code Mode is unique to Bifrost among open-source self-hosted gateways at the time of writing, with a public 92% token reduction benchmark at 500-tool scale.
What governance ships with the open-source core? Bifrost ships virtual keys, tool filtering, audit logs, and rate limits in the open-source build. The other gateways either gate governance behind enterprise tiers or expect teams to assemble it from external components.
What does deployment day one look like? Bifrost and MCPJungle both stand up in minutes. Docker MCP Gateway is fast on existing Docker infrastructure. Obot adds a few more components but is still single-image. ContextForge requires real Kubernetes investment.

For a deeper feature matrix and capability comparison, the LLM Gateway Buyer's Guide covers the criteria enterprise procurement teams use when evaluating gateways for production agentic AI infrastructure.

Getting Started with Bifrost

Among open-source MCP gateways for self-hosted AI infrastructure, Bifrost is the default choice for teams that need microsecond-class latency, native MCP support with Code Mode, enterprise governance, and unified LLM routing in a single self-hostable binary. Installation takes under 30 seconds (npx -y @maximhq/bifrost or a single Docker container), MCP servers register through the built-in web UI, and existing OpenAI, Anthropic, AWS Bedrock, or LiteLLM SDK code works as a drop-in by changing only the base URL.

To see Bifrost handling production agent traffic at scale, book a Bifrost demo.

Top 5 Open-Source MCP Gateways for Self-Hosted AI Infrastructure

Why Self-Hosting Matters for MCP Gateways

Key Criteria for Evaluating Self-Hosted MCP Gateways

1. Bifrost

Bifrost MCP gateway capabilities

Performance and governance

2. Docker MCP Gateway

Capabilities

Limitations for AI infrastructure

3. Obot

Capabilities

Trade-offs

4. MCPJungle

Capabilities

Trade-offs

5. IBM ContextForge

Capabilities

Trade-offs

How These Self-Hosted MCP Gateways Compare

Getting Started with Bifrost

Read next

MCP Authentication Explained: OAuth, API Keys, and Token Management

The Complete Guide to MCP Gateways for Enterprise AI Teams

What is an MCP Gateway? Key Features and Benefits

Ship your AI agents 5x faster ⚡️