Try Bifrost Enterprise free for 14 days.
Request access
[ PROVIDER OVERVIEW ]

DeepInfra Models

Browse all 67 AI models from DeepInfra. Compare pricing, context windows, and capabilities.

Total Models
67
Modes
1
Avg Input (1M Tokens)
$0.67
Avg Output (1M Tokens)
$2.80

All DeepInfra Models

Click on any model to calculate costs

Showing 67 of 67 models
Model Name
Provider
Mode
Input
Output
Capabilities
claude-3-7-sonnet-latest
deepinfra/anthropic/claude-3-7-sonnet-latest
DeepInfraChat$3.30$16.50
Functions
claude-4-opus
deepinfra/anthropic/claude-4-opus
DeepInfraChat$16.50$82.50
Functions
claude-4-sonnet
deepinfra/anthropic/claude-4-sonnet
DeepInfraChat$3.30$16.50
Functions
DeepSeek-R1
deepinfra/deepseek-ai/DeepSeek-R1
DeepInfraChat$0.50$2.15
Functions
DeepSeek-R1-0528
deepinfra/deepseek-ai/DeepSeek-R1-0528
DeepInfraChat$0.50$2.15
Functions
DeepSeek-R1-0528-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
DeepInfraChat$1.00$3.00
Functions
DeepSeek-R1-Distill-Llama-70B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
DeepInfraChat$0.70$0.80
DeepSeek-R1-Distill-Qwen-32B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
DeepInfraChat$0.70$0.80
Functions
DeepSeek-R1-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
DeepInfraChat$0.50$2.15
Functions
DeepSeek-V3
deepinfra/deepseek-ai/DeepSeek-V3
DeepInfraChat$0.32$0.89
Functions
DeepSeek-V3-0324
deepinfra/deepseek-ai/DeepSeek-V3-0324
DeepInfraChat$0.20$0.77
Functions
DeepSeek-V3.1
deepinfra/deepseek-ai/DeepSeek-V3.1
DeepInfraChat$0.21$0.79
FunctionsReasoning
DeepSeek-V3.1-Terminus
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus
DeepInfraChat$0.27$0.95
Functions
gemini-2.0-flash-001
deepinfra/google/gemini-2.0-flash-001
DeepInfraChat$0.10$0.40
Functions
gemini-2.5-flash
deepinfra/google/gemini-2.5-flash
DeepInfraChat$0.30$2.50
Functions
gemini-2.5-pro
deepinfra/google/gemini-2.5-pro
DeepInfraChat$1.25$10.00
Functions
gemma-3-12b-it
deepinfra/google/gemma-3-12b-it
DeepInfraChat$0.04$0.13
Functions
gemma-3-27b-it
deepinfra/google/gemma-3-27b-it
DeepInfraChat$0.08$0.16
Functions
gemma-3-4b-it
deepinfra/google/gemma-3-4b-it
DeepInfraChat$0.04$0.08
Functions
GLM-4.5
deepinfra/zai-org/GLM-4.5
DeepInfraChat$0.43$1.74
Functions
gpt-oss-120b
deepinfra/openai/gpt-oss-120b
DeepInfraChat$0.04$0.19
Functions
gpt-oss-20b
deepinfra/openai/gpt-oss-20b
DeepInfraChat$0.03$0.14
Functions
Hermes-3-Llama-3.1-405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
DeepInfraChat$1.00$1.00
Functions
Hermes-3-Llama-3.1-70B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
DeepInfraChat$0.30$0.30
Kimi-K2-Instruct
deepinfra/moonshotai/Kimi-K2-Instruct
DeepInfraChat$4.00$20.00
Functions
Kimi-K2-Instruct-0905
deepinfra/moonshotai/Kimi-K2-Instruct-0905
DeepInfraChat$0.40$2.00
Functions
L3-8B-Lunaris-v1-Turbo
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
DeepInfraChat$0.04$0.05
L3.1-70B-Euryale-v2.2
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
DeepInfraChat$0.85$0.85
L3.3-70B-Euryale-v2.3
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
DeepInfraChat$0.85$0.85
Llama-3.1-Nemotron-70B-Instruct
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
DeepInfraChat$1.20$1.20
Functions
Llama-3.2-11B-Vision-Instruct
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
DeepInfraChat$0.24$0.24
Llama-3.2-3B-Instruct
deepinfra/meta-llama/Llama-3.2-3B-Instruct
DeepInfraChat$0.02$0.02
Functions
Llama-3.3-70B-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct
DeepInfraChat$0.10$0.32
Functions
Llama-3.3-70B-Instruct-Turbo
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
DeepInfraChat$0.10$0.32
Functions
Llama-3.3-Nemotron-Super-49B-v1.5
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
DeepInfraChat$0.10$0.40
Functions
Llama-4-Maverick-17B-128E-Instruct-FP8
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
DeepInfraChat$0.15$0.60
Functions
Llama-4-Scout-17B-16E-Instruct
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
DeepInfraChat$0.08$0.30
Functions
Llama-Guard-3-8B
deepinfra/meta-llama/Llama-Guard-3-8B
DeepInfraChat$0.18$0.18
Llama-Guard-4-12B
deepinfra/meta-llama/Llama-Guard-4-12B
DeepInfraChat$0.18$0.18
Meta-Llama-3-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
DeepInfraChat$0.03$0.04
Functions
Meta-Llama-3.1-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
DeepInfraChat$0.40$0.40
Functions
Meta-Llama-3.1-70B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
DeepInfraChat$0.40$0.40
Functions
Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
DeepInfraChat$0.02$0.05
Functions
Meta-Llama-3.1-8B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
DeepInfraChat$0.02$0.03
Functions
Mistral-Nemo-Instruct-2407
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
DeepInfraChat$0.02$0.04
Functions
Mistral-Small-24B-Instruct-2501
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
DeepInfraChat$0.05$0.08
Functions
Mistral-Small-3.2-24B-Instruct-2506
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
DeepInfraChat$0.08$0.20
Functions
Mixtral-8x7B-Instruct-v0.1
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
DeepInfraChat$0.54$0.54
Functions
MythoMax-L2-13b
deepinfra/Gryphe/MythoMax-L2-13b
DeepInfraChat$0.40$0.40
Functions
NVIDIA-Nemotron-Nano-9B-v2
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
DeepInfraChat$0.04$0.16
Functions
olmOCR-7B-0725-FP8
deepinfra/allenai/olmOCR-7B-0725-FP8
DeepInfraChat$0.09$0.19
phi-4
deepinfra/microsoft/phi-4
DeepInfraChat$0.07$0.14
Functions
Qwen2.5-72B-Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
DeepInfraChat$0.36$0.40
Functions
Qwen2.5-7B-Instruct
deepinfra/Qwen/Qwen2.5-7B-Instruct
DeepInfraChat$0.12$0.24
Qwen2.5-VL-32B-Instruct
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
DeepInfraChat$0.20$0.60
FunctionsVision
Qwen3-14B
deepinfra/Qwen/Qwen3-14B
DeepInfraChat$0.12$0.24
Functions
Qwen3-235B-A22B
deepinfra/Qwen/Qwen3-235B-A22B
DeepInfraChat$0.07$0.10
Functions
Qwen3-235B-A22B-Instruct-2507
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
DeepInfraChat$0.07$0.10
Functions
Qwen3-235B-A22B-Thinking-2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
DeepInfraChat$0.23$2.30
Functions
Qwen3-30B-A3B
deepinfra/Qwen/Qwen3-30B-A3B
DeepInfraChat$0.09$0.45
Functions
Qwen3-32B
deepinfra/Qwen/Qwen3-32B
DeepInfraChat$0.08$0.28
Functions
Qwen3-Coder-480B-A35B-Instruct
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
DeepInfraChat$0.40$1.60
Functions
Qwen3-Coder-480B-A35B-Instruct-Turbo
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
DeepInfraChat$0.30$1.00
Functions
Qwen3-Next-80B-A3B-Instruct
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct
DeepInfraChat$0.09$1.10
Functions
Qwen3-Next-80B-A3B-Thinking
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking
DeepInfraChat$0.14$1.40
Functions
QwQ-32B
deepinfra/Qwen/QwQ-32B
DeepInfraChat$0.08$0.28
Functions
WizardLM-2-8x22B
deepinfra/microsoft/WizardLM-2-8x22B
DeepInfraChat$0.48$0.48