Try Bifrost Enterprise free for 14 days.
Request access
P
E
R
F
O
R
M
A
N
C
E
F
E
A
T
U
R
E
S
E
N
T
E
R
P
R
I
S
E
P
R
I
C
I
N
G
D
O
C
S
B
L
O
G
Discord
Github
Book a Demo
Home
Google Vertex AI
Google Vertex AI Models
Browse all 146 AI models from Google Vertex AI
Total Models
146
Modes
7
Avg Input (1M Tokens)
$2.22
Avg Output (1M Tokens)
$11.17
All Google Vertex AI Models
Click on any model to view details
All Modes
Showing 46 of 146 models
Model Name↑
Provider
Mode
Pricing
(tokens, images, audio, or pages)
Max Input
Tokens
Max Output
Tokens
Capabilities
claude-sonnet-4-6@default
vertex_ai/claude-sonnet-4-6@default
Google Vertex AI
Chat
Input $3.00 / 1M tokens
Output $15.00 / 1M tokens
1000k
64,000
Functions
Vision
Reasoning
deepseek-ocr-maas
vertex_ai/deepseek-ai/deepseek-ocr-maas
Google Vertex AI
OCR
Input $0.300000 / 1M tokens
Output $1.20 / 1M tokens
,
,
gemini-flash-experimental
vertex_ai/gemini-flash-experimental
Google Vertex AI
Embedding
Input $0.0000000000 / 1M tokens
Output $0.0000000000 / 1M tokens
1000k
8,192
gemma-4-26b-a4b-it
vertex_ai/gemma-4-26b-a4b-it
Google Vertex AI
Chat
Input $0.070000 / 1M tokens
Output $0.340000 / 1M tokens
,
8,192
Functions
Vision
gemma-4-26b-a4b-it-maas
vertex_ai/google/gemma-4-26b-a4b-it-maas
Google Vertex AI
Chat
Input $0.070000 / 1M tokens
Output $0.340000 / 1M tokens
,
8,192
Functions
Vision
gemma-4-26b-a4b-it-maas
vertex_ai/gemma-4-26b-a4b-it-maas
Google Vertex AI
Chat
Input $0.070000 / 1M tokens
Output $0.340000 / 1M tokens
,
8,192
Functions
Vision
gemma-4-31b
vertex_ai/gemma-4-31b
Google Vertex AI
Chat
Input $0.150000 / 1M tokens
Output $0.600000 / 1M tokens
,
8,192
Functions
Vision
glm-4.7-maas
vertex_ai/zai-org/glm-4.7-maas
Google Vertex AI
Chat
Input $0.600000 / 1M tokens
Output $2.20 / 1M tokens
200k
128k
Functions
Reasoning
glm-5-maas
vertex_ai/zai-org/glm-5-maas
Google Vertex AI
Chat
Input $1.00 / 1M tokens
Output $3.20 / 1M tokens
200k
128k
Functions
Reasoning
gpt-oss-120b-maas
vertex_ai/openai/gpt-oss-120b-maas
Google Vertex AI
Chat
Input $0.150000 / 1M tokens
Output $0.600000 / 1M tokens
131k
32,768
Reasoning
gpt-oss-20b-maas
vertex_ai/openai/gpt-oss-20b-maas
Google Vertex AI
Chat
Input $0.075000 / 1M tokens
Output $0.300000 / 1M tokens
131k
32,768
Reasoning
grok-4.1-fast-non-reasoning
vertex_ai/xai/grok-4.1-fast-non-reasoning
Google Vertex AI
Chat
Input $0.200000 / 1M tokens
Output $0.500000 / 1M tokens
2000k
2000k
Functions
Vision
Reasoning
Web Search
Audio In
grok-4.1-fast-reasoning
vertex_ai/xai/grok-4.1-fast-reasoning
Google Vertex AI
Chat
Input $0.200000 / 1M tokens
Output $0.500000 / 1M tokens
2000k
2000k
Functions
Vision
Reasoning
Web Search
Audio In
grok-4.20-beta-0309-non-reasoning
vertex_ai/xai/grok-4.20-beta-0309-non-reasoning
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
2000k
2000k
Functions
Vision
Web Search
grok-4.20-non-reasoning
vertex_ai/xai/grok-4.20-non-reasoning
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
2000k
2000k
Functions
Vision
Web Search
grok-4.20-reasoning
vertex_ai/xai/grok-4.20-reasoning
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
2000k
2000k
Functions
Vision
Reasoning
Web Search
kimi-k2-thinking-maas
vertex_ai/moonshotai/kimi-k2-thinking-maas
Google Vertex AI
Chat
Input $0.600000 / 1M tokens
Output $2.50 / 1M tokens
256k
256k
Functions
Web Search
llama-4-scout-17b-128e-instruct-maas
vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas
Google Vertex AI
Chat
Input $0.250000 / 1M tokens
Output $0.700000 / 1M tokens
10000k
10000k
Functions
llama-4-scout-17b-16e-instruct-maas
vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas
Google Vertex AI
Chat
Input $0.250000 / 1M tokens
Output $0.700000 / 1M tokens
10000k
10000k
Functions
llama3-405b-instruct-maas
vertex_ai/meta/llama3-405b-instruct-maas
Google Vertex AI
Chat
Input $0.0000000000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,000
32,000
llama3-70b-instruct-maas
vertex_ai/meta/llama3-70b-instruct-maas
Google Vertex AI
Chat
Input $0.0000000000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,000
32,000
llama3-8b-instruct-maas
vertex_ai/meta/llama3-8b-instruct-maas
Google Vertex AI
Chat
Input $0.0000000000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,000
32,000
minimax-m2-maas
vertex_ai/minimaxai/minimax-m2-maas
Google Vertex AI
Chat
Input $0.300000 / 1M tokens
Output $1.20 / 1M tokens
196k
196k
Functions
mistral-large-2411
vertex_ai/mistral-large-2411
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
128k
8,191
Functions
mistral-large@2407
vertex_ai/mistral-large@2407
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
128k
8,191
Functions
mistral-large@2411-001
vertex_ai/mistral-large@2411-001
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
128k
8,191
Functions
mistral-large@latest
vertex_ai/mistral-large@latest
Google Vertex AI
Chat
Input $2.00 / 1M tokens
Output $6.00 / 1M tokens
128k
8,191
Functions
mistral-medium-3
vertex_ai/mistral-medium-3
Google Vertex AI
Chat
Input $0.400000 / 1M tokens
Output $2.00 / 1M tokens
128k
8,191
Functions
mistral-medium-3
vertex_ai/mistralai/mistral-medium-3
Google Vertex AI
Chat
Input $0.400000 / 1M tokens
Output $2.00 / 1M tokens
128k
8,191
Functions
mistral-medium-3@001
vertex_ai/mistral-medium-3@001
Google Vertex AI
Chat
Input $0.400000 / 1M tokens
Output $2.00 / 1M tokens
128k
8,191
Functions
mistral-medium-3@001
vertex_ai/mistralai/mistral-medium-3@001
Google Vertex AI
Chat
Input $0.400000 / 1M tokens
Output $2.00 / 1M tokens
128k
8,191
Functions
mistral-nemo@2407
vertex_ai/mistral-nemo@2407
Google Vertex AI
Chat
Input $3.00 / 1M tokens
Output $3.00 / 1M tokens
128k
128k
Functions
mistral-nemo@latest
vertex_ai/mistral-nemo@latest
Google Vertex AI
Chat
Input $0.150000 / 1M tokens
Output $0.150000 / 1M tokens
128k
128k
Functions
mistral-small-2503
vertex_ai/mistral-small-2503
Google Vertex AI
Chat
Input $1.00 / 1M tokens
Output $3.00 / 1M tokens
128k
128k
Functions
Vision
mistral-small-2503@001
vertex_ai/mistral-small-2503@001
Google Vertex AI
Chat
Input $1.00 / 1M tokens
Output $3.00 / 1M tokens
32,000
8,191
Functions
qwen3-235b-a22b-instruct-2507-maas
vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas
Google Vertex AI
Chat
Input $0.250000 / 1M tokens
Output $1.00 / 1M tokens
262k
16,384
Functions
qwen3-coder-480b-a35b-instruct-maas
vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas
Google Vertex AI
Chat
Input $1.00 / 1M tokens
Output $4.00 / 1M tokens
262k
32,768
Functions
qwen3-next-80b-a3b-instruct-maas
vertex_ai/qwen/qwen3-next-80b-a3b-instruct-maas
Google Vertex AI
Chat
Input $0.150000 / 1M tokens
Output $1.20 / 1M tokens
262k
262k
Functions
qwen3-next-80b-a3b-thinking-maas
vertex_ai/qwen/qwen3-next-80b-a3b-thinking-maas
Google Vertex AI
Chat
Input $0.150000 / 1M tokens
Output $1.20 / 1M tokens
262k
262k
Functions
veo-2.0-generate-001
vertex_ai/veo-2.0-generate-001
Google Vertex AI
Video Generation
Output $0.350000 / second
1,024
,
veo-3.0-fast-generate-001
vertex_ai/veo-3.0-fast-generate-001
Google Vertex AI
Video Generation
Output $0.150000 / second
1,024
,
veo-3.0-generate-001
vertex_ai/veo-3.0-generate-001
Google Vertex AI
Video Generation
Output $0.400000 / second
1,024
,
veo-3.1-fast-generate-001
vertex_ai/veo-3.1-fast-generate-001
Google Vertex AI
Video Generation
Output $0.150000 / second
1,024
,
veo-3.1-fast-generate-preview
vertex_ai/veo-3.1-fast-generate-preview
Google Vertex AI
Video Generation
Output $0.150000 / second
1,024
,
veo-3.1-generate-001
vertex_ai/veo-3.1-generate-001
Google Vertex AI
Video Generation
Output $0.400000 / second
1,024
,
veo-3.1-generate-preview
vertex_ai/veo-3.1-generate-preview
Google Vertex AI
Video Generation
Output $0.400000 / second
1,024
,
Showing
101
–
146
of
146
Prev
Page
2
of
2
1
2
Next
Related Resources
Calculate Google Vertex AI costs →