AI Governance

Role-Based LLM Access for Engineering, Sales & Exec Teams

Set up role-based LLM access for engineering, sales, and executive teams with Bifrost. Scope models, budgets, rate limits, and tool permissions by role.

Most organizations hand every employee the same API key to a shared LLM account, which means a sales rep drafting an email and an engineer running an agentic workflow draw from the same credential, the same budget, and the same unrestricted model list. Bifrost, the open-source AI gateway built in Go by Maxim AI, replaces that flat setup with role-based LLM access, so every team gets its own scoped models, spending limits, rate limits, and tool permissions through a single control plane. This guide covers how to configure role-based LLM access for engineering, sales, and executive teams, from virtual keys to identity-provider-driven role assignment.

What Is Role-Based LLM Access?

Role-based LLM access is an access-control model that grants users LLM permissions based on their job role rather than giving everyone the same API key. Each role defines which models and providers a user can call, how much they can spend, how many requests they can send, and which tools they can invoke. Permissions are attached to roles, and users inherit them through role assignment.

This is the LLM-specific application of role-based access control (RBAC), the model formalized by NIST in 1992 that has since become the predominant approach to access control because it removes the need to assign privileges to every individual user by hand. Applied to an AI gateway, RBAC governs not just what a user can read or edit, but which model endpoints they can reach and at what cost. Bifrost's governance system implements this directly at the request layer.

Why Per-Team LLM Access Control Matters

A shared key gives every user the maximum possible privilege by default, which is the opposite of what security frameworks recommend. The OWASP Top 10 for LLM Applications lists excessive permissions, granting an identity broader access than its task requires, as a primary root cause of the Excessive Agency risk category. Per-team LLM access control directly addresses this by scoping each role to the minimum it needs.

The practical reasons to adopt role-based LLM access:

Cost containment: A single unbounded key lets one team or one runaway script consume the entire monthly AI budget. Per-role budgets isolate spend.
Model scoping: Engineering may need frontier reasoning models, while sales needs only a cost-efficient model for drafting. Scoping prevents accidental use of expensive models.
Blast-radius reduction: If one credential leaks, the exposure is limited to that role's models, budget, and tools rather than the whole account.
Audit and compliance: Regulated teams need to show who accessed which models and how much was spent. Per-role attribution makes usage traceable.

Bifrost's LLM access governance is built around these requirements, so access policy and spend control live in the same place that routes the traffic.

How Bifrost Implements Role-Based LLM Access

The Bifrost platform implements role-based LLM access through three connected primitives: virtual keys, RBAC roles, and access profiles.

Virtual keys are the primary governance entity. A virtual key carries its own access control (model and provider filtering), its own budget, and its own token-and-request rate limits. Applications authenticate with a virtual key using a standard header such as x-bf-vk or an Authorization: Bearer header, so the gateway can apply the correct policy to every request.

Role-based access control governs what users can do inside the control plane. Bifrost ships three system roles, Admin (full access across all resources), Developer (create-read-update-delete access to technical resources plus view access to logs), and Viewer (read-only), and supports custom roles when the three defaults do not match your org.

Access profiles tie the two together. An access profile is a reusable policy template that describes a provider list, model whitelist, budgets, rate limits, and MCP tool access. When you assign a profile to a role, Bifrost issues a per-user virtual key automatically and write-protects it, so a user cannot weaken their own policy by editing the key directly.

Budgets and limits are hierarchical. Bifrost's budget and rate-limit system flows from customer to team to virtual key to provider config, where each level can hold an independent budget and reset on a calendar-aligned schedule. A team-level cap can sit above per-user caps so an individual overage never breaches the team's ceiling.

Setting Up Access for Engineering, Sales, and Exec Teams

The cleanest way to roll this out is one access profile per team. Define the profile once, mark it as the default for the matching role, and Bifrost auto-issues managed virtual keys to every user who lands in that role.

Engineering: Broad Model Access with Guardrails

Engineering profiles carry the widest scope: multiple providers, current frontier models, and MCP tool access enabled for agentic workflows. Pair that breadth with a generous but bounded monthly budget and a Developer dashboard role. Because virtual keys support per-key MCP tool filtering, you can allow engineering to use code-execution and repository tools while keeping higher-risk tools off by default.

A team member's call then looks like an ordinary OpenAI-style request, just pointed at the gateway with their virtual key:

curl -X POST <http://localhost:8080/v1/chat/completions> \
  -H "Content-Type: application/json" \
  -H "x-bf-vk: sk-bf-engineering-xxxx" \
  -d '{
    "model": "anthropic/claude-sonnet-4-5",
    "messages": [{"role": "user", "content": "Refactor this function for readability"}]
  }'

Sales: A Narrow, Cost-Capped Model Set

Sales rarely needs frontier reasoning. A sales profile typically whitelists one or two cost-efficient models for drafting outreach and summarizing calls, disables MCP tool execution, and sets a low monthly budget that resets on the first of the month. Token-and-request rate limits on the virtual key prevent a bulk-email script from draining the allocation in an afternoon. Assign these users a Viewer or a custom dashboard role so they can see their own usage without editing shared configuration.

Executives: Read-Only Analytics and a Limited Assistant

Executive access is usually two things: visibility into aggregate AI spend and a small personal assistant allowance. Build an exec profile with a single approved model, a modest budget, and no tool access, then grant a dashboard role scoped to usage analytics. Leadership sees where AI spend is going across teams without holding a credential that can run production workloads.

Connecting Roles to Your Identity Provider

Manually creating accounts and handing out keys does not scale, and it drifts out of sync the moment someone changes teams. Bifrost supports user provisioning over OIDC and SCIM, so role assignment is driven by your existing identity provider rather than a separate list.

With advanced governance configured, a single setup gives you:

Single sign-on via OAuth 2.0 / OIDC with JWKS-based token validation.
Automatic role assignment from custom claims, app roles, or group-to-role mappings, so an engineer in the IdP's engineering group inherits the engineering role and its access profile on first login.
Team synchronization from IdP groups into Bifrost teams, plus business-unit mapping from IdP attributes.
Background lifecycle reconciliation every 24 hours and OIDC session refresh checks every 15 minutes, so deactivated users lose access without manual cleanup.
Inbound SCIM 2.0, letting the IdP push user and group changes to Bifrost in real time.

Bifrost provides step-by-step setup guides for the major providers, including Okta, Microsoft Entra, Keycloak, Zitadel, and Google Workspace. For regulated environments, the Bifrost Enterprise tier adds immutable audit logs that record access and configuration changes for SOC 2, GDPR, HIPAA, and ISO 27001 evidence.

Common Questions About Role-Based LLM Access

Can I set different budgets for each team?

Yes. Budgets are independent at the virtual key, team, and customer level, and they can be combined hierarchically. A team budget acts as a ceiling above the per-user budgets attached to that team's virtual keys.

Does role-based access work with my existing OpenAI or Anthropic SDK?

Yes. Bifrost is a drop-in replacement for the major provider SDKs. Each team points its SDK at the gateway base URL and authenticates with its virtual key, so no application code changes beyond the base URL and key.

What happens when someone changes teams?

When the user's group changes in your identity provider, the next reconciliation cycle or SCIM push updates their role in Bifrost, which reassigns the matching access profile and virtual key. Access follows the person's current role automatically.

Is role-based LLM access available in the open-source version?

Virtual keys, model and provider filtering, budgets, and rate limits are part of the open-source governance feature set. RBAC roles, access profiles, OIDC and SCIM provisioning, and audit logs are Bifrost Enterprise capabilities.

Getting Started with Role-Based LLM Access

Role-based LLM access turns a single shared credential into per-team policy: scoped models, isolated budgets, enforced rate limits, and tool permissions that match each role. Engineering gets the breadth it needs, sales gets a cost-capped model set, executives get visibility without a production key, and every request is attributable to a role for audit. Bifrost enforces all of it at the gateway, in front of every provider, with policy and routing in one place.

To see how Bifrost can structure LLM access governance for your teams, book a demo with the Bifrost team, or explore the full set of Bifrost resources to plan your rollout.