Features#akshitOpen Source

Access GPT, Gemini, Claude, Mistral etc. through 1 Gateway: Configure Providers in Bifrost

Akshay Deo

Oct 16, 2025 · 4 min read

When building AI-powered applications, chances are you don’t want to rely on a single provider:

Claude for reasoning-heavy tasks
GPT-4o for multimodal inputs
Gemini for Google ecosystem integrations
Mistral for fast, cost-effective completions

Each provider has different SDKs, auth methods, rate limits, and response formats. Maintaining them quickly becomes messy.

Bifrost solves this by acting as a unified AI gateway: one API surface, multiple providers behind it. In this post, we’ll walk through configuring providers in Bifrost so you can switch (or mix) GPT, Claude, Gemini, and Mistral with almost no extra code.

Why Use a Gateway?

Imagine you’re building an AI support assistant:

For routine queries, you want Mistral (cheap + fast).
For escalations, you switch to Claude Sonnet (better reasoning).
For multimodal inputs, you need GPT-4o.
And if you’re on GCP, Gemini integrates best.

Instead of coding against four different SDKs, Bifrost gives you a single /v1/chat/completions API that works across all of them.

Run Bifrost

Install and run Bifrost using Docker:

bash

By default, the dashboard runs at:

👉 http://localhost:8080

Configure Providers

You can add providers via the Web UI, API, or a config.json file. Below are API examples.

OpenAI (GPT)

plaintext

Anthropic (Claude)

plaintext

Google Vertex (Gemini)

plaintext

Mistral

plaintext

Make a Request

Once configured, you can query any provider through the same endpoint:

plaintext

Bifrost handles the provider-specific API calls and returns a normalized response.

Advanced Routing

Say you want to split load between two OpenAI keys (70/30):

plaintext

This is useful for rate limit management or cost control across accounts.

Managing Retries Gracefully

Retries are tricky: too aggressive and you waste tokens + cost, too light and users see errors. The below example sets up exponential backoff with up to 5 retries, starting with 1ms delay and capping at 10 seconds - ideal for handling transient network issues.

Example:

plaintext

Concurrency and Buffer Size

When you scale from dozens to thousands of requests, concurrency control saves you from provider bans.

This example gives OpenAI higher limits (100 workers, 500 queue) for high throughput, while Anthropic gets conservative limits to respect their rate limits.

plaintext

Setting Up a Proxy

Route requests through proxies for compliance, security, or geographic requirements. This example shows both HTTP proxy for OpenAI and authenticated SOCKS5 proxy for Anthropic, useful for corporate environments or regional access.

plaintext

Now all calls to LLMs will be routed through the proxy you’ve specified.

Returning Raw Responses

By default, Bifrost normalizes responses across providers into a common schema (/v1/chat/completions).

But sometimes you want the raw response (for logging, debugging, or preserving model-specific metadata).

You can request raw output like this:

plaintext

When enabled, the raw provider response appears in extra_fields.raw_response:

plaintext

Putting It Together: Multi-Model AI Support Assistant

With this setup, your support assistant can:

Use Mistral for 80% of queries
Escalate tricky ones to Claude Sonnet
Handle screenshots via GPT-4o
Run sensitive workloads on Gemini if hosted on GCP

All through one gateway - consistent API, retries, observability, and proxy support out of the box.

Bifrost makes it possible to plug GPT, Claude, Gemini, and Mistral into your app in minutes, without juggling multiple SDKs.

Access GPT, Gemini, Claude, Mistral etc. through 1 Gateway: Configure Providers in Bifrost

Why Use a Gateway?

Run Bifrost

Configure Providers

OpenAI (GPT)

Anthropic (Claude)

Google Vertex (Gemini)

Mistral

Make a Request

Advanced Routing

Managing Retries Gracefully

Concurrency and Buffer Size

Setting Up a Proxy

Returning Raw Responses

Putting It Together: Multi-Model AI Support Assistant

[ Features ]

[ Developers ]

[ Resources ]

[ Company ]