UPDATES
Try Bifrost Enterprise free for 14 days.
Explore now
Performance
Features
Enterprise
Docs
Blog
Discord
Github
Book a Demo
Home
>
DeepInfra
DeepInfra Models
Browse all 67 AI models from DeepInfra
Total Models
67
Modes
1
Avg Input (1M Tokens)
$0.62
Avg Output (1M Tokens)
$2.58
All DeepInfra Models
Click on any model to calculate costs
All Modes
Showing 67 of 67 models
Model Name↑
Provider
Mode
Input
Cost
(per 1M tokens)
Output
Cost
(per 1M tokens)
Max Input
Tokens
Max Output
Tokens
Capabilities
claude-3-7-sonnet-latest
deepinfra/anthropic/claude-3-7-sonnet-latest
DeepInfra
Chat
$3.30
$16.50
200k
200k
claude-4-opus
deepinfra/anthropic/claude-4-opus
DeepInfra
Chat
$16.50
$82.50
200k
200k
claude-4-sonnet
deepinfra/anthropic/claude-4-sonnet
DeepInfra
Chat
$3.30
$16.50
200k
200k
DeepSeek-R1
deepinfra/deepseek-ai/DeepSeek-R1
DeepInfra
Chat
$0.70
$2.40
164k
164k
DeepSeek-R1-0528
deepinfra/deepseek-ai/DeepSeek-R1-0528
DeepInfra
Chat
$0.50
$2.15
164k
164k
DeepSeek-R1-0528-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
DeepInfra
Chat
$1.00
$3.00
32,768
32,768
DeepSeek-R1-Distill-Llama-70B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
DeepInfra
Chat
$0.60
$1.20
131k
131k
DeepSeek-R1-Distill-Qwen-32B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
DeepInfra
Chat
$0.27
$0.27
131k
131k
DeepSeek-R1-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
DeepInfra
Chat
$1.00
$3.00
40,960
40,960
DeepSeek-V3
deepinfra/deepseek-ai/DeepSeek-V3
DeepInfra
Chat
$0.38
$0.89
164k
164k
DeepSeek-V3-0324
deepinfra/deepseek-ai/DeepSeek-V3-0324
DeepInfra
Chat
$0.20
$0.88
164k
164k
DeepSeek-V3.1
deepinfra/deepseek-ai/DeepSeek-V3.1
DeepInfra
Chat
$0.21
$0.79
164k
164k
Reasoning
DeepSeek-V3.1-Terminus
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus
DeepInfra
Chat
$0.21
$0.79
164k
164k
gemini-2.0-flash-001
deepinfra/google/gemini-2.0-flash-001
DeepInfra
Chat
$0.10
$0.40
1000k
1000k
gemini-2.5-flash
deepinfra/google/gemini-2.5-flash
DeepInfra
Chat
$0.30
$2.50
1000k
1000k
gemini-2.5-pro
deepinfra/google/gemini-2.5-pro
DeepInfra
Chat
$1.25
$10.00
1000k
1000k
gemma-3-12b-it
deepinfra/google/gemma-3-12b-it
DeepInfra
Chat
$0.04
$0.13
131k
131k
gemma-3-27b-it
deepinfra/google/gemma-3-27b-it
DeepInfra
Chat
$0.09
$0.16
131k
131k
gemma-3-4b-it
deepinfra/google/gemma-3-4b-it
DeepInfra
Chat
$0.04
$0.08
131k
131k
GLM-4.5
deepinfra/zai-org/GLM-4.5
DeepInfra
Chat
$0.60
$2.20
131k
131k
gpt-oss-120b
deepinfra/openai/gpt-oss-120b
DeepInfra
Chat
$0.04
$0.19
131k
131k
gpt-oss-20b
deepinfra/openai/gpt-oss-20b
DeepInfra
Chat
$0.03
$0.14
131k
131k
Hermes-3-Llama-3.1-405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
DeepInfra
Chat
$1.00
$1.00
131k
131k
Hermes-3-Llama-3.1-70B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
DeepInfra
Chat
$0.30
$0.30
131k
131k
Kimi-K2-Instruct
deepinfra/moonshotai/Kimi-K2-Instruct
DeepInfra
Chat
$0.50
$2.00
131k
131k
Kimi-K2-Instruct-0905
deepinfra/moonshotai/Kimi-K2-Instruct-0905
DeepInfra
Chat
$0.50
$2.00
262k
262k
L3-8B-Lunaris-v1-Turbo
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
DeepInfra
Chat
$0.04
$0.05
8,192
8,192
L3.1-70B-Euryale-v2.2
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
DeepInfra
Chat
$0.85
$0.85
131k
131k
L3.3-70B-Euryale-v2.3
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
DeepInfra
Chat
$0.85
$0.85
131k
131k
Llama-3.1-Nemotron-70B-Instruct
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
DeepInfra
Chat
$1.20
$1.20
131k
131k
Llama-3.2-11B-Vision-Instruct
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
DeepInfra
Chat
$0.05
$0.05
131k
131k
Llama-3.2-3B-Instruct
deepinfra/meta-llama/Llama-3.2-3B-Instruct
DeepInfra
Chat
$0.02
$0.02
131k
131k
Llama-3.3-70B-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct
DeepInfra
Chat
$0.23
$0.40
131k
131k
Llama-3.3-70B-Instruct-Turbo
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
DeepInfra
Chat
$0.13
$0.38
131k
131k
Llama-3.3-Nemotron-Super-49B-v1.5
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
DeepInfra
Chat
$0.10
$0.40
131k
131k
Llama-4-Maverick-17B-128E-Instruct-FP8
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
DeepInfra
Chat
$0.15
$0.60
1049k
1049k
Llama-4-Scout-17B-16E-Instruct
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
DeepInfra
Chat
$0.08
$0.30
328k
328k
Llama-Guard-3-8B
deepinfra/meta-llama/Llama-Guard-3-8B
DeepInfra
Chat
$0.06
$0.06
131k
131k
Llama-Guard-4-12B
deepinfra/meta-llama/Llama-Guard-4-12B
DeepInfra
Chat
$0.18
$0.18
164k
164k
Meta-Llama-3-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
DeepInfra
Chat
$0.03
$0.06
8,192
8,192
Meta-Llama-3.1-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
DeepInfra
Chat
$0.40
$0.40
131k
131k
Meta-Llama-3.1-70B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
DeepInfra
Chat
$0.40
$0.40
131k
131k
Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
DeepInfra
Chat
$0.03
$0.05
131k
131k
Meta-Llama-3.1-8B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
DeepInfra
Chat
$0.02
$0.03
131k
131k
Mistral-Nemo-Instruct-2407
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
DeepInfra
Chat
$0.02
$0.04
131k
131k
Mistral-Small-24B-Instruct-2501
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
DeepInfra
Chat
$0.05
$0.08
32,768
32,768
Mistral-Small-3.2-24B-Instruct-2506
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
DeepInfra
Chat
$0.08
$0.20
128k
128k
Mixtral-8x7B-Instruct-v0.1
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
DeepInfra
Chat
$0.54
$0.54
32,768
32,768
MythoMax-L2-13b
deepinfra/Gryphe/MythoMax-L2-13b
DeepInfra
Chat
$0.08
$0.08
4,096
4,096
NVIDIA-Nemotron-Nano-9B-v2
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
DeepInfra
Chat
$0.04
$0.16
131k
131k
olmOCR-7B-0725-FP8
deepinfra/allenai/olmOCR-7B-0725-FP8
DeepInfra
Chat
$0.27
$1.50
16,384
16,384
phi-4
deepinfra/microsoft/phi-4
DeepInfra
Chat
$0.07
$0.14
16,384
16,384
Qwen2.5-72B-Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
DeepInfra
Chat
$0.12
$0.39
32,768
32,768
Qwen2.5-7B-Instruct
deepinfra/Qwen/Qwen2.5-7B-Instruct
DeepInfra
Chat
$0.04
$0.10
32,768
32,768
Qwen2.5-VL-32B-Instruct
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
DeepInfra
Chat
$0.20
$0.60
128k
128k
Vision
Qwen3-14B
deepinfra/Qwen/Qwen3-14B
DeepInfra
Chat
$0.08
$0.24
40,960
40,960
Qwen3-235B-A22B
deepinfra/Qwen/Qwen3-235B-A22B
DeepInfra
Chat
$0.18
$0.54
40,960
40,960
Qwen3-235B-A22B-Instruct-2507
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
DeepInfra
Chat
$0.07
$0.46
262k
262k
Qwen3-235B-A22B-Thinking-2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
DeepInfra
Chat
$0.30
$2.90
262k
262k
Qwen3-30B-A3B
deepinfra/Qwen/Qwen3-30B-A3B
DeepInfra
Chat
$0.08
$0.29
40,960
40,960
Qwen3-32B
deepinfra/Qwen/Qwen3-32B
DeepInfra
Chat
$0.10
$0.28
40,960
40,960
Qwen3-Coder-480B-A35B-Instruct
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
DeepInfra
Chat
$0.40
$1.60
262k
262k
Qwen3-Coder-480B-A35B-Instruct-Turbo
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
DeepInfra
Chat
$0.29
$1.20
262k
262k
Qwen3-Next-80B-A3B-Instruct
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct
DeepInfra
Chat
$0.14
$1.10
262k
262k
Qwen3-Next-80B-A3B-Thinking
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking
DeepInfra
Chat
$0.14
$1.40
262k
262k
QwQ-32B
deepinfra/Qwen/QwQ-32B
DeepInfra
Chat
$0.15
$0.40
131k
131k
WizardLM-2-8x22B
deepinfra/microsoft/WizardLM-2-8x22B
DeepInfra
Chat
$0.48
$0.48
65,536
65,536