Provider Configuration

Multi-Provider Setup

Configure multiple providers to seamlessly switch between them. This example shows how to configure OpenAI, Anthropic, and Mistral providers.

Go to http://localhost:8080
Navigate to “Providers” in the sidebar
Click “Add Provider”
Select provider and configure keys
Save configuration

Making Requests

Once providers are configured, you can make requests to any specific provider. This example shows how to send a request directly to OpenAI’s GPT-4o Mini model. Bifrost handles the provider-specific API formatting automatically.

curl --location 'http://localhost:8080/v1/chat/completions' \
--header 'Content-Type: application/json' \
--data '{
    "model": "openai/gpt-4o-mini",
    "messages": [
        {"role": "user", "content": "Hello!"}
    ]
}'

Environment Variables

Set up your API keys for the providers you want to use. Bifrost supports both direct key values and environment variable references with the env. prefix:

export OPENAI_API_KEY="your-openai-api-key"
export ANTHROPIC_API_KEY="your-anthropic-api-key"
export MISTRAL_API_KEY="your-mistral-api-key"
export GROQ_API_KEY="your-groq-api-key"
export COHERE_API_KEY="your-cohere-api-key"

Environment Variable Handling:

Use "value": "env.VARIABLE_NAME" to reference environment variables
Use "value": "sk-proj-xxxxxxxxx" to pass keys directly
All sensitive data is automatically redacted in GET requests and UI responses for security

Advanced Configuration

Weighted Load Balancing

Distribute requests across multiple API keys or providers based on custom weights. This example shows how to split traffic 70/30 between two OpenAI keys, useful for managing rate limits or costs across different accounts.

Navigate to “Providers” → “OpenAI”
Click “Add Key” to add multiple keys
Set weight values (0.7 and 0.3)
Save configuration

Model-Specific Keys

Use different API keys for specific models, allowing you to manage access controls and billing separately. This example uses a premium key for advanced reasoning models (o1-preview, o1-mini) and a standard key for regular GPT models.

Navigate to “Providers” → “OpenAI”
Add first key with models: ["gpt-4o", "gpt-4o-mini"]
Add premium key with models: ["o1-preview", "o1-mini"]
Save configuration

Custom Network Settings

Customize the network configuration for each provider, including custom base URLs, extra headers, and timeout settings. This example shows how to use a local OpenAI-compatible server with custom headers for user identification.

Navigate to “Providers” → “OpenAI” → “Advanced”
Set Base URL: http://localhost:8000/v1
Set Timeout: 30 seconds
Save configuration

Managing Retries

Configure retry behavior for handling temporary failures and rate limits. This example sets up exponential backoff with up to 5 retries, starting with 1ms delay and capping at 10 seconds - ideal for handling transient network issues.

Navigate to “Providers” → “OpenAI” → “Advanced”
Set Max Retries: 5
Set Initial Backoff: 1 ms
Set Max Backoff: 10000 ms
Save configuration

Custom Concurrency and Buffer Size

Fine-tune performance by adjusting worker concurrency and queue sizes per provider. This example gives OpenAI higher limits (100 workers, 500 queue) for high throughput, while Anthropic gets conservative limits to respect their rate limits.

Navigate to “Providers” → Provider → “Performance”
Set Concurrency: Worker count (100 for OpenAI, 25 for Anthropic)
Set Buffer Size: Queue size (500 for OpenAI, 100 for Anthropic)
Save configuration

Setting Up a Proxy

Route requests through proxies for compliance, security, or geographic requirements. This example shows both HTTP proxy for OpenAI and authenticated SOCKS5 proxy for Anthropic, useful for corporate environments or regional access.

Navigate to “Providers” → Provider → “Proxy”
Select Proxy Type: HTTP or SOCKS5
Set Proxy URL: http://localhost:8000
Add credentials if needed (username/password)
Save configuration

Send Back Raw Response

Include the original provider response alongside Bifrost’s standardized response format. Useful for debugging and accessing provider-specific metadata.

Navigate to “Providers” → Provider → “Advanced”
Toggle “Include Raw Response” to enabled
Save configuration

When enabled, the raw provider response appears in extra_fields.raw_response:

{
    "choices": [...],
    "usage": {...},
    "extra_fields": {
        "provider": "openai",
        "raw_response": {
            // Original OpenAI response here
        }
    }
}

Provider-Specific Authentication

Enterprise cloud providers require additional configuration beyond API keys. Configure Azure OpenAI, AWS Bedrock, and Google Vertex with platform-specific authentication details.

Azure OpenAI

Azure OpenAI requires endpoint URLs, deployment mappings, and API version configuration:

Navigate to “Providers” → “Azure OpenAI”
Set API Key: Your Azure API key
Set Endpoint: Your Azure endpoint URL
Configure Deployments: Map model names to deployment names
Set API Version: e.g., 2024-08-01-preview
Save configuration

AWS Bedrock

AWS Bedrock supports both explicit credentials and IAM role authentication:

Navigate to “Providers” → “AWS Bedrock”
Set Access Key: AWS Access Key ID (or leave empty for IAM)
Set Secret Key: AWS Secret Access Key (or leave empty for IAM)
Set Region: e.g., us-east-1
Configure Deployments: Map model names to inference profiles
Set ARN: Required for deployments mapping
Save configuration

Notes:

If both access_key and secret_key are empty, Bifrost uses IAM role authentication from environment
arn is required for URL formation - deployments mapping is ignored without it
When using arn + deployments, Bifrost uses model profiles; otherwise forms path with incoming model name directly

Google Vertex

Google Vertex requires project configuration and authentication credentials:

Navigate to “Providers” → “Google Vertex”
Set API Key: Your Vertex API key
Set Project ID: Your Google Cloud project ID
Set Region: e.g., us-central1
Set Auth Credentials: Service account credentials JSON
Save configuration

Next Steps

Now that you understand provider configuration, explore these related topics:

Essential Topics

Streaming Responses - Real-time response generation
Tool Calling - Enable AI to use external functions
Multimodal AI - Process images, audio, and text
Integrations - Drop-in compatibility with existing SDKs

Advanced Topics

Core Features - Advanced Bifrost capabilities
Architecture - How Bifrost works internally
Deployment - Production setup and scaling

Quick Start

Integrations

Open Source Features

Enterprise Features

Provider Configuration

Multi-Provider Setup

Making Requests

Environment Variables

Advanced Configuration

Weighted Load Balancing

Model-Specific Keys

Custom Network Settings

Managing Retries

Custom Concurrency and Buffer Size

Setting Up a Proxy

Send Back Raw Response

Provider-Specific Authentication

Azure OpenAI

AWS Bedrock

Google Vertex

Next Steps

Essential Topics

Advanced Topics

Quick Start

Integrations

Open Source Features

Enterprise Features

​Multi-Provider Setup

​Making Requests

​Environment Variables

​Advanced Configuration

​Weighted Load Balancing

​Model-Specific Keys

​Custom Network Settings

​Managing Retries

​Custom Concurrency and Buffer Size

​Setting Up a Proxy

​Send Back Raw Response

​Provider-Specific Authentication

​Azure OpenAI

​AWS Bedrock

​Google Vertex

​Next Steps

​Essential Topics

​Advanced Topics

Multi-Provider Setup

Making Requests

Environment Variables

Advanced Configuration

Weighted Load Balancing

Model-Specific Keys

Custom Network Settings

Managing Retries

Custom Concurrency and Buffer Size

Setting Up a Proxy

Send Back Raw Response

Provider-Specific Authentication

Azure OpenAI

AWS Bedrock

Google Vertex

Next Steps

Essential Topics

Advanced Topics