Try Bifrost Enterprise free for 14 days.
Request access
P
E
R
F
O
R
M
A
N
C
E
F
E
A
T
U
R
E
S
E
N
T
E
R
P
R
I
S
E
D
O
C
S
B
L
O
G
Discord
Github
Book a Demo
Home
Nebius
Nebius Models
Browse all 30 AI models from Nebius
Total Models
30
Modes
2
Avg Input (1M Tokens)
$0.24
Avg Output (1M Tokens)
$0.80
All Nebius Models
Click on any model to view details
All Modes
Showing 30 of 30 models
Model Name↑
Provider
Mode
Pricing
(tokens, images, audio, or pages)
Max Input
Tokens
Max Output
Tokens
Capabilities
bge-en-icl
nebius/BAAI/bge-en-icl
Nebius
Embedding
Input $0.010000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,768
—
bge-multilingual-gemma2
nebius/BAAI/bge-multilingual-gemma2
Nebius
Embedding
Input $0.010000 / 1M tokens
Output $0.0000000000 / 1M tokens
8,192
—
DeepSeek-R1
nebius/deepseek-ai/DeepSeek-R1
Nebius
Chat
Input $0.800000 / 1M tokens
Output $2.40 / 1M tokens
128k
128k
Functions
Reasoning
DeepSeek-R1-0528
nebius/deepseek-ai/DeepSeek-R1-0528
Nebius
Chat
Input $0.800000 / 1M tokens
Output $2.40 / 1M tokens
164k
164k
Functions
Reasoning
DeepSeek-R1-Distill-Llama-70B
nebius/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Nebius
Chat
Input $0.250000 / 1M tokens
Output $0.750000 / 1M tokens
128k
128k
Functions
DeepSeek-V3
nebius/deepseek-ai/DeepSeek-V3
Nebius
Chat
Input $0.500000 / 1M tokens
Output $1.50 / 1M tokens
128k
128k
Functions
DeepSeek-V3-0324
nebius/deepseek-ai/DeepSeek-V3-0324
Nebius
Chat
Input $0.500000 / 1M tokens
Output $1.50 / 1M tokens
128k
128k
Functions
e5-mistral-7b-instruct
nebius/intfloat/e5-mistral-7b-instruct
Nebius
Embedding
Input $0.010000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,768
—
gemma-3-27b-it
nebius/google/gemma-3-27b-it
Nebius
Chat
Input $0.060000 / 1M tokens
Output $0.200000 / 1M tokens
128k
128k
Functions
Vision
Hermes-3-Llama-3.1-405B
nebius/NousResearch/Hermes-3-Llama-3.1-405B
Nebius
Chat
Input $1.00 / 1M tokens
Output $3.00 / 1M tokens
128k
128k
Functions
Llama-3.1-Nemotron-Ultra-253B-v1
nebius/nvidia/Llama-3.1-Nemotron-Ultra-253B-v1
Nebius
Chat
Input $0.600000 / 1M tokens
Output $1.80 / 1M tokens
128k
128k
Functions
Llama-3.3-70B-Instruct
nebius/meta-llama/Llama-3.3-70B-Instruct
Nebius
Chat
Input $0.130000 / 1M tokens
Output $0.400000 / 1M tokens
128k
128k
Functions
Llama-3.3-Nemotron-Super-49B-v1
nebius/nvidia/Llama-3.3-Nemotron-Super-49B-v1
Nebius
Chat
Input $0.100000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
Llama-Guard-3-8B
nebius/meta-llama/Llama-Guard-3-8B
Nebius
Chat
Input $0.020000 / 1M tokens
Output $0.060000 / 1M tokens
128k
128k
Meta-Llama-3.1-405B-Instruct
nebius/meta-llama/Meta-Llama-3.1-405B-Instruct
Nebius
Chat
Input $1.00 / 1M tokens
Output $3.00 / 1M tokens
128k
128k
Functions
Meta-Llama-3.1-70B-Instruct
nebius/meta-llama/Meta-Llama-3.1-70B-Instruct
Nebius
Chat
Input $0.130000 / 1M tokens
Output $0.400000 / 1M tokens
128k
128k
Functions
Meta-Llama-3.1-8B-Instruct
nebius/meta-llama/Meta-Llama-3.1-8B-Instruct
Nebius
Chat
Input $0.020000 / 1M tokens
Output $0.060000 / 1M tokens
128k
128k
Functions
Mistral-Nemo-Instruct-2407
nebius/mistralai/Mistral-Nemo-Instruct-2407
Nebius
Chat
Input $0.040000 / 1M tokens
Output $0.120000 / 1M tokens
128k
128k
Functions
Qwen2-VL-72B-Instruct
nebius/Qwen/Qwen2-VL-72B-Instruct
Nebius
Chat
Input $0.130000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
Vision
Qwen2-VL-7B-Instruct
nebius/Qwen/Qwen2-VL-7B-Instruct
Nebius
Chat
Input $0.020000 / 1M tokens
Output $0.060000 / 1M tokens
131k
131k
Vision
Qwen2.5-32B-Instruct
nebius/Qwen/Qwen2.5-32B-Instruct
Nebius
Chat
Input $0.060000 / 1M tokens
Output $0.200000 / 1M tokens
128k
128k
Functions
Qwen2.5-72B-Instruct
nebius/Qwen/Qwen2.5-72B-Instruct
Nebius
Chat
Input $0.130000 / 1M tokens
Output $0.400000 / 1M tokens
128k
128k
Functions
Qwen2.5-Coder-7B
nebius/Qwen/Qwen2.5-Coder-7B
Nebius
Chat
Input $0.010000 / 1M tokens
Output $0.030000 / 1M tokens
32,768
32,768
Functions
Qwen2.5-VL-72B-Instruct
nebius/Qwen/Qwen2.5-VL-72B-Instruct
Nebius
Chat
Input $0.130000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
Vision
Qwen3-14B
nebius/Qwen/Qwen3-14B
Nebius
Chat
Input $0.080000 / 1M tokens
Output $0.240000 / 1M tokens
32,768
32,768
Functions
Qwen3-235B-A22B
nebius/Qwen/Qwen3-235B-A22B
Nebius
Chat
Input $0.200000 / 1M tokens
Output $0.600000 / 1M tokens
262k
262k
Functions
Qwen3-30B-A3B
nebius/Qwen/Qwen3-30B-A3B
Nebius
Chat
Input $0.100000 / 1M tokens
Output $0.300000 / 1M tokens
32,768
32,768
Functions
Qwen3-32B
nebius/Qwen/Qwen3-32B
Nebius
Chat
Input $0.100000 / 1M tokens
Output $0.300000 / 1M tokens
32,768
32,768
Functions
Qwen3-4B
nebius/Qwen/Qwen3-4B
Nebius
Chat
Input $0.080000 / 1M tokens
Output $0.240000 / 1M tokens
32,768
32,768
Functions
QwQ-32B
nebius/Qwen/QwQ-32B
Nebius
Chat
Input $0.150000 / 1M tokens
Output $0.450000 / 1M tokens
32,768
32,768
Functions
Reasoning