Bifrost pricing

Compare Bifrost OSS and Enterprise pricing for an open-source AI gateway with drop-in SDK compatibility, automatic fallbacks, observability, MCP Gateway, and enterprise governance.

OSS and Enterprise

Pricing Plans

Bifrost pricing is split into a free open-source plan and a custom-priced enterprise plan.

Bifrost OSS is free forever for developers, small teams, and self-managed deployments. Bifrost Enterprise is for teams running production AI systems at scale with private deployment, governance, identity, compliance, and support requirements.

  • Free foreverOSS. Self-hosted with Docker, Kubernetes, or Go binary. Includes drop-in replacement, OpenTelemetry-compatible metrics and traces, built-in observability, budget management, virtual keys, routing rules, Bifrost CLI, automatic fallbacks, caching, MCP Gateway with Code Mode, prompt repository, custom plugins, Maxim AI integration, docs, and community support. [GitHub] [Docs]
  • Custom pricingEnterprise. Includes everything in OSS plus guardrails, cluster mode, adaptive load balancing, enterprise SSO via SAML and OIDC, vault support, MCP with federated auth, log exports, audit logs, role-based access control, SLA-backed enterprise support, and enterprise-ready VPC, on-prem, and air-gapped deployment. [Book a Demo] [Enterprise]

Free forever

Bifrost OSS

The OSS plan is meant for developers, small teams, and self-managed deployments that want a high-performance LLM gateway without a commercial license.

It includes the core routing, fallback, observability, virtual key, budget, cache, MCP, CLI, plugin, and drop-in replacement capabilities needed to run Bifrost locally or in self-managed infrastructure. [GitHub] [Quickstart docs] [Bifrost CLI resource]

Custom pricing

Bifrost Enterprise

The Enterprise plan adds production controls for organizations standardizing AI usage across teams, applications, and regulated environments.

Enterprise capabilities focus on private deployment, identity provider integration, RBAC, auditability, guardrails, adaptive load balancing, clustering, vault-backed secrets, log exports, federated MCP authentication, and commercial support.

  • Private deployment. Deploy Bifrost in VPC, on-premise, air-gapped, or multi-cloud environments. [Enterprise deployment]
  • Governance. Use SSO, RBAC, virtual keys, budgets, audit logs, and policy controls for production AI access. [Governance resource]
  • Guardrails. Add enterprise safety controls for content moderation, PII handling, and policy enforcement. [Guardrails resource]

Capability matrix

OSS vs Enterprise Feature Comparison

OSS includes the core open-source gateway capabilities. Enterprise adds private deployment, identity, governance, compliance, reliability, and support controls for production organizations.

CapabilityOSSEnterprise
Single OpenAI-compatible APIIncludedIncluded
Drop-in ReplacementIncludedIncluded
OpenTelemetryIncludedIncluded
Built-in ObservabilityIncludedIncluded
Budget and Rate LimitsIncludedIncluded
Virtual KeysIncludedIncluded
Custom Routing Rules and FlowsIncludedIncluded
Bifrost CLIIncludedIncluded
MCP GatewayIncludedIncluded
MCP Code ModeIncludedIncluded
Semantic CachingIncludedIncluded
Prometheus MetricsIncludedIncluded
Prompt RepositoryIncludedIncluded
Custom Plugin DevelopmentIncludedIncluded
GuardrailsNot includedIncluded
ClusteringNot includedIncluded
Adaptive Load BalancingNot includedIncluded
SAML, OpenID, and Audit ReportsNot includedIncluded
Vault SupportNot includedIncluded
Log ExportsNot includedIncluded
In-VPC DeploymentsNot includedIncluded
Identity Providers like Okta and EntraNot includedIncluded
Role-Based Access ControlNot includedIncluded
MCP with Federated AuthNot includedIncluded
Audit LogsNot includedIncluded
External OTel ConnectorsNot includedIncluded
Commercial onboarding and production supportCommunityIncluded

Production requirements

When Enterprise Is The Right Fit

  • Private networking and deployment. Choose Enterprise when Bifrost must run inside private cloud infrastructure with network isolation and stronger security boundaries.
  • Advanced governance. Choose Enterprise when identity provider integration, Okta or Entra support, RBAC, audit logs, vault-backed key management, and compliance workflows are required.
  • Enterprise-grade reliability. Choose Enterprise when production AI systems need guardrails, adaptive load balancing, clustering, zero-downtime deployment support, log exports, and SLA-backed support.

Open Source & Enterprise

OSS Features

  • 01Model Catalog. Access 8+ providers and 1000+ AI models through a unified interface. Also supports custom deployed models.
  • 02Budgeting. Set spending limits and track costs across teams, projects, and models.
  • 03Provider Fallback. Automatic failover between providers ensures 99.99% uptime for your applications.
  • 04MCP Gateway. Centralize all MCP tool connections, governance, security, and auth. Your AI can safely use MCP tools with centralized policy enforcement. [MCP Gateway resource]
  • 05Virtual Key Management. Create different virtual keys for different use cases with independent budgets and access control.
  • 06Unified Interface. One consistent API for all providers. Switch models without changing code.
  • 07Drop-in Replacement. Replace your existing SDK with just one line change. Compatible with OpenAI, Anthropic, LiteLLM, Google GenAI, LangChain, and more. [Drop-in replacement docs]
  • 08Built-in Observability. Out-of-the-box OpenTelemetry support. Built-in dashboard for quick visibility without complex setup.
  • 09Community Support. Active Discord community with responsive support and regular updates.

Enterprise Features

  • 01Governance. SAML support for SSO and role-based access control with policy enforcement for team collaboration. [Governance resource]
  • 02Adaptive Load Balancing. Automatically optimizes traffic distribution across provider keys and models based on real-time performance metrics.
  • 03Cluster Mode. High availability deployment with automatic failover and load balancing. Peer-to-peer clustering where every instance is equal.
  • 04Alerts. Real-time notifications for budget limits, failures, and performance issues on Email, Slack, PagerDuty, Teams, Webhook, and more.
  • 05Log Exports. Export and analyze request logs, traces, and telemetry data from Bifrost with enterprise-grade data export for compliance, monitoring, and analytics.
  • 06Audit Logs. Comprehensive logging and audit trails for compliance and debugging.
  • 07Vault Support. Secure API key management with HashiCorp Vault, AWS Secrets Manager, Google Secret Manager, and Azure Key Vault integration.
  • 08VPC Deployment. Deploy Bifrost within your private cloud infrastructure with VPC isolation, custom networking, and enhanced security controls. [Enterprise deployment resource]
  • 09Guardrails. Automatically detect and block unsafe model outputs with real-time policy enforcement and content moderation across all agents. [Guardrails resource]

Drop-in replacement for compatible AI SDKs

Change one line of code to point compatible SDKs at Bifrost. Works with OpenAI, Anthropic, LiteLLM, Google GenAI, LangChain, and Vercel AI SDK. [Gateway setup docs] [Drop-in replacement docs]

OpenAIopenai.py
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ.get("OPENAI_API_KEY"),
    base_url="https://<bifrost_url>/openai",
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}],
)
Anthropicanthropic.py
import os
from anthropic import Anthropic

anthropic = Anthropic(
    api_key=os.environ.get("ANTHROPIC_API_KEY"),
    base_url="https://<bifrost_url>/anthropic",
)

message = anthropic.messages.create(
    model="claude-3-5-sonnet-20241022",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello, Claude"}],
)
LiteLLMlitellm.py
import litellm

# Set the base URL to your Bifrost deployment
litellm.api_base = "https://<bifrost_url>"

response = litellm.completion(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}],
)
Google GenAIgenai.py
import google.generativeai as genai

genai.configure(
    api_key="YOUR_API_KEY",
    transport="rest",
    client_options={"api_endpoint": "<bifrost_url>/google"},
)

model = genai.GenerativeModel("gemini-pro")
response = model.generate_content("Hello!")
  • Point the SDK base URL at your Bifrost deployment.
  • Keep API keys in your environment or secret manager.
  • See the docs for provider-specific configuration and deployment steps.

Trust

  • Open Source. Bifrost is open source under the Apache 2.0 License. [GitHub]
  • Publisher. Bifrost is published by H3 Labs Inc. and Maxim AI.
  • Compliance. The site references SOC 2 Type II, GDPR, HIPAA, and ISO 27001 signals. [Enterprise deployment]
  • Deployment. Enterprise resources cover VPC, on-premise, air-gapped, and multi-cloud use. [Enterprise deployment]

FAQ

What is Bifrost?

Bifrost is an open-source LLM gateway that introduces 11 microseconds of overhead at 5K RPS on a t3.xlarge machine. It provides a unified layer for model access, guardrails, and governance across AI systems. [Docs] [GitHub]

How is my data protected?

Bifrost offers zero-touch in-VPC deployments, so no data ever leaves your environment or passes through Bifrost/Maxim servers. [Governance] [Enterprise deployment]

Can Bifrost integrate with my existing AI stack?

Yes. Bifrost works with major LLM SDKs and frameworks. Compatible SDKs include OpenAI, Anthropic, Mistral, LangChain, LangGraph, and LiteLLM. [Drop-in replacement docs]

How much does Bifrost cost?

Pricing is based on the number of devices Edge runs on. Bifrost Edge is currently available in early access, and we are offering it at no cost to our existing enterprise customers. Full pricing will be released soon. [Pricing]

How can I get started with Bifrost?

You can get started with the open-source version in seconds: npx @maximhq/bifrost [Docs] [GitHub]