Try Bifrost Enterprise free for 14 days.
Request access
P
E
R
F
O
R
M
A
N
C
E
F
E
A
T
U
R
E
S
E
N
T
E
R
P
R
I
S
E
D
O
C
S
B
L
O
G
Discord
Github
Book a Demo
Home
Deepinfra
Deepinfra Models
Browse all 67 AI models from Deepinfra
Total Models
67
Modes
1
Avg Input (1M Tokens)
$0.59
Avg Output (1M Tokens)
$2.56
All Deepinfra Models
Click on any model to view details
All Modes
Showing 67 of 67 models
Model Name↑
Provider
Mode
Pricing
(tokens, images, audio, or pages)
Max Input
Tokens
Max Output
Tokens
Capabilities
claude-3-7-sonnet-latest
deepinfra/anthropic/claude-3-7-sonnet-latest
Deepinfra
Chat
Input $3.30 / 1M tokens
Output $16.50 / 1M tokens
200k
200k
Functions
claude-4-opus
deepinfra/anthropic/claude-4-opus
Deepinfra
Chat
Input $16.50 / 1M tokens
Output $82.50 / 1M tokens
200k
200k
Functions
claude-4-sonnet
deepinfra/anthropic/claude-4-sonnet
Deepinfra
Chat
Input $3.30 / 1M tokens
Output $16.50 / 1M tokens
200k
200k
Functions
DeepSeek-R1
deepinfra/deepseek-ai/DeepSeek-R1
Deepinfra
Chat
Input $0.700000 / 1M tokens
Output $2.40 / 1M tokens
163k
163k
Functions
DeepSeek-R1-0528
deepinfra/deepseek-ai/DeepSeek-R1-0528
Deepinfra
Chat
Input $0.500000 / 1M tokens
Output $2.15 / 1M tokens
163k
163k
Functions
DeepSeek-R1-0528-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
Deepinfra
Chat
Input $1.00 / 1M tokens
Output $3.00 / 1M tokens
32,768
32,768
Functions
DeepSeek-R1-Distill-Llama-70B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Deepinfra
Chat
Input $0.200000 / 1M tokens
Output $0.600000 / 1M tokens
131k
131k
DeepSeek-R1-Distill-Qwen-32B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Deepinfra
Chat
Input $0.270000 / 1M tokens
Output $0.270000 / 1M tokens
131k
131k
Functions
DeepSeek-R1-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
Deepinfra
Chat
Input $1.00 / 1M tokens
Output $3.00 / 1M tokens
40,960
40,960
Functions
DeepSeek-V3
deepinfra/deepseek-ai/DeepSeek-V3
Deepinfra
Chat
Input $0.380000 / 1M tokens
Output $0.890000 / 1M tokens
163k
163k
Functions
DeepSeek-V3-0324
deepinfra/deepseek-ai/DeepSeek-V3-0324
Deepinfra
Chat
Input $0.250000 / 1M tokens
Output $0.880000 / 1M tokens
163k
163k
Functions
DeepSeek-V3.1
deepinfra/deepseek-ai/DeepSeek-V3.1
Deepinfra
Chat
Input $0.270000 / 1M tokens
Output $1.00 / 1M tokens
163k
163k
Functions
Reasoning
DeepSeek-V3.1-Terminus
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus
Deepinfra
Chat
Input $0.270000 / 1M tokens
Output $1.00 / 1M tokens
163k
163k
Functions
gemini-2.0-flash-001
deepinfra/google/gemini-2.0-flash-001
Deepinfra
Chat
Input $0.100000 / 1M tokens
Output $0.400000 / 1M tokens
1000k
1000k
Functions
gemini-2.5-flash
deepinfra/google/gemini-2.5-flash
Deepinfra
Chat
Input $0.300000 / 1M tokens
Output $2.50 / 1M tokens
1000k
1000k
Functions
gemini-2.5-pro
deepinfra/google/gemini-2.5-pro
Deepinfra
Chat
Input $1.25 / 1M tokens
Output $10.00 / 1M tokens
1000k
1000k
Functions
gemma-3-12b-it
deepinfra/google/gemma-3-12b-it
Deepinfra
Chat
Input $0.050000 / 1M tokens
Output $0.100000 / 1M tokens
131k
131k
Functions
gemma-3-27b-it
deepinfra/google/gemma-3-27b-it
Deepinfra
Chat
Input $0.090000 / 1M tokens
Output $0.160000 / 1M tokens
131k
131k
Functions
gemma-3-4b-it
deepinfra/google/gemma-3-4b-it
Deepinfra
Chat
Input $0.040000 / 1M tokens
Output $0.080000 / 1M tokens
131k
131k
Functions
GLM-4.5
deepinfra/zai-org/GLM-4.5
Deepinfra
Chat
Input $0.400000 / 1M tokens
Output $1.60 / 1M tokens
131k
131k
Functions
gpt-oss-120b
deepinfra/openai/gpt-oss-120b
Deepinfra
Chat
Input $0.050000 / 1M tokens
Output $0.450000 / 1M tokens
131k
131k
Functions
gpt-oss-20b
deepinfra/openai/gpt-oss-20b
Deepinfra
Chat
Input $0.040000 / 1M tokens
Output $0.150000 / 1M tokens
131k
131k
Functions
Hermes-3-Llama-3.1-405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
Deepinfra
Chat
Input $1.00 / 1M tokens
Output $1.00 / 1M tokens
131k
131k
Functions
Hermes-3-Llama-3.1-70B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
Deepinfra
Chat
Input $0.300000 / 1M tokens
Output $0.300000 / 1M tokens
131k
131k
Kimi-K2-Instruct
deepinfra/moonshotai/Kimi-K2-Instruct
Deepinfra
Chat
Input $0.500000 / 1M tokens
Output $2.00 / 1M tokens
131k
131k
Functions
Kimi-K2-Instruct-0905
deepinfra/moonshotai/Kimi-K2-Instruct-0905
Deepinfra
Chat
Input $0.500000 / 1M tokens
Output $2.00 / 1M tokens
262k
262k
Functions
L3-8B-Lunaris-v1-Turbo
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
Deepinfra
Chat
Input $0.040000 / 1M tokens
Output $0.050000 / 1M tokens
8,192
8,192
L3.1-70B-Euryale-v2.2
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
Deepinfra
Chat
Input $0.650000 / 1M tokens
Output $0.750000 / 1M tokens
131k
131k
L3.3-70B-Euryale-v2.3
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
Deepinfra
Chat
Input $0.650000 / 1M tokens
Output $0.750000 / 1M tokens
131k
131k
Llama-3.1-Nemotron-70B-Instruct
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
Deepinfra
Chat
Input $0.600000 / 1M tokens
Output $0.600000 / 1M tokens
131k
131k
Functions
Llama-3.2-11B-Vision-Instruct
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
Deepinfra
Chat
Input $0.049000 / 1M tokens
Output $0.049000 / 1M tokens
131k
131k
Llama-3.2-3B-Instruct
deepinfra/meta-llama/Llama-3.2-3B-Instruct
Deepinfra
Chat
Input $0.020000 / 1M tokens
Output $0.020000 / 1M tokens
131k
131k
Functions
Llama-3.3-70B-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct
Deepinfra
Chat
Input $0.230000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
Llama-3.3-70B-Instruct-Turbo
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
Deepinfra
Chat
Input $0.130000 / 1M tokens
Output $0.390000 / 1M tokens
131k
131k
Functions
Llama-3.3-Nemotron-Super-49B-v1.5
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
Deepinfra
Chat
Input $0.100000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
Llama-4-Maverick-17B-128E-Instruct-FP8
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Deepinfra
Chat
Input $0.150000 / 1M tokens
Output $0.600000 / 1M tokens
1048k
1048k
Functions
Llama-4-Scout-17B-16E-Instruct
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
Deepinfra
Chat
Input $0.080000 / 1M tokens
Output $0.300000 / 1M tokens
327k
327k
Functions
Llama-Guard-3-8B
deepinfra/meta-llama/Llama-Guard-3-8B
Deepinfra
Chat
Input $0.055000 / 1M tokens
Output $0.055000 / 1M tokens
131k
131k
Llama-Guard-4-12B
deepinfra/meta-llama/Llama-Guard-4-12B
Deepinfra
Chat
Input $0.180000 / 1M tokens
Output $0.180000 / 1M tokens
163k
163k
Meta-Llama-3-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
Deepinfra
Chat
Input $0.030000 / 1M tokens
Output $0.060000 / 1M tokens
8,192
8,192
Functions
Meta-Llama-3.1-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
Deepinfra
Chat
Input $0.400000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
Meta-Llama-3.1-70B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
Deepinfra
Chat
Input $0.100000 / 1M tokens
Output $0.280000 / 1M tokens
131k
131k
Functions
Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
Deepinfra
Chat
Input $0.030000 / 1M tokens
Output $0.050000 / 1M tokens
131k
131k
Functions
Meta-Llama-3.1-8B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
Deepinfra
Chat
Input $0.020000 / 1M tokens
Output $0.030000 / 1M tokens
131k
131k
Functions
Mistral-Nemo-Instruct-2407
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
Deepinfra
Chat
Input $0.020000 / 1M tokens
Output $0.040000 / 1M tokens
131k
131k
Functions
Mistral-Small-24B-Instruct-2501
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
Deepinfra
Chat
Input $0.050000 / 1M tokens
Output $0.080000 / 1M tokens
32,768
32,768
Functions
Mistral-Small-3.2-24B-Instruct-2506
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
Deepinfra
Chat
Input $0.075000 / 1M tokens
Output $0.200000 / 1M tokens
128k
128k
Functions
Mixtral-8x7B-Instruct-v0.1
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
Deepinfra
Chat
Input $0.400000 / 1M tokens
Output $0.400000 / 1M tokens
32,768
32,768
Functions
MythoMax-L2-13b
deepinfra/Gryphe/MythoMax-L2-13b
Deepinfra
Chat
Input $0.080000 / 1M tokens
Output $0.090000 / 1M tokens
4,096
4,096
Functions
NVIDIA-Nemotron-Nano-9B-v2
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
Deepinfra
Chat
Input $0.040000 / 1M tokens
Output $0.160000 / 1M tokens
131k
131k
Functions
olmOCR-7B-0725-FP8
deepinfra/allenai/olmOCR-7B-0725-FP8
Deepinfra
Chat
Input $0.270000 / 1M tokens
Output $1.50 / 1M tokens
16,384
16,384
phi-4
deepinfra/microsoft/phi-4
Deepinfra
Chat
Input $0.070000 / 1M tokens
Output $0.140000 / 1M tokens
16,384
16,384
Functions
Qwen2.5-72B-Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
Deepinfra
Chat
Input $0.120000 / 1M tokens
Output $0.390000 / 1M tokens
32,768
32,768
Functions
Qwen2.5-7B-Instruct
deepinfra/Qwen/Qwen2.5-7B-Instruct
Deepinfra
Chat
Input $0.040000 / 1M tokens
Output $0.100000 / 1M tokens
32,768
32,768
Qwen2.5-VL-32B-Instruct
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
Deepinfra
Chat
Input $0.200000 / 1M tokens
Output $0.600000 / 1M tokens
128k
128k
Functions
Vision
Qwen3-14B
deepinfra/Qwen/Qwen3-14B
Deepinfra
Chat
Input $0.060000 / 1M tokens
Output $0.240000 / 1M tokens
40,960
40,960
Functions
Qwen3-235B-A22B
deepinfra/Qwen/Qwen3-235B-A22B
Deepinfra
Chat
Input $0.180000 / 1M tokens
Output $0.540000 / 1M tokens
40,960
40,960
Functions
Qwen3-235B-A22B-Instruct-2507
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
Deepinfra
Chat
Input $0.090000 / 1M tokens
Output $0.600000 / 1M tokens
262k
262k
Functions
Qwen3-235B-A22B-Thinking-2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
Deepinfra
Chat
Input $0.300000 / 1M tokens
Output $2.90 / 1M tokens
262k
262k
Functions
Qwen3-30B-A3B
deepinfra/Qwen/Qwen3-30B-A3B
Deepinfra
Chat
Input $0.080000 / 1M tokens
Output $0.290000 / 1M tokens
40,960
40,960
Functions
Qwen3-32B
deepinfra/Qwen/Qwen3-32B
Deepinfra
Chat
Input $0.100000 / 1M tokens
Output $0.280000 / 1M tokens
40,960
40,960
Functions
Qwen3-Coder-480B-A35B-Instruct
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
Deepinfra
Chat
Input $0.400000 / 1M tokens
Output $1.60 / 1M tokens
262k
262k
Functions
Qwen3-Coder-480B-A35B-Instruct-Turbo
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
Deepinfra
Chat
Input $0.290000 / 1M tokens
Output $1.20 / 1M tokens
262k
262k
Functions
Qwen3-Next-80B-A3B-Instruct
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct
Deepinfra
Chat
Input $0.140000 / 1M tokens
Output $1.40 / 1M tokens
262k
262k
Functions
Qwen3-Next-80B-A3B-Thinking
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking
Deepinfra
Chat
Input $0.140000 / 1M tokens
Output $1.40 / 1M tokens
262k
262k
Functions
QwQ-32B
deepinfra/Qwen/QwQ-32B
Deepinfra
Chat
Input $0.150000 / 1M tokens
Output $0.400000 / 1M tokens
131k
131k
Functions
WizardLM-2-8x22B
deepinfra/microsoft/WizardLM-2-8x22B
Deepinfra
Chat
Input $0.480000 / 1M tokens
Output $0.480000 / 1M tokens
65,536
65,536