Try Bifrost Enterprise free for 14 days.
Request access

Deepinfra logoDeepinfra Models

Browse all 67 AI models from Deepinfra

Total Models
67
Modes
1
Avg Input (1M Tokens)
$0.59
Avg Output (1M Tokens)
$2.56

All Deepinfra Models

Click on any model to view details

Showing 67 of 67 models
Model Name
Provider
Mode
Pricing
Capabilities
claude-3-7-sonnet-latest
deepinfra/anthropic/claude-3-7-sonnet-latest
Deepinfra logoDeepinfraChat
Input $3.30 / 1M tokensOutput $16.50 / 1M tokens
Functions
claude-4-opus
deepinfra/anthropic/claude-4-opus
Deepinfra logoDeepinfraChat
Input $16.50 / 1M tokensOutput $82.50 / 1M tokens
Functions
claude-4-sonnet
deepinfra/anthropic/claude-4-sonnet
Deepinfra logoDeepinfraChat
Input $3.30 / 1M tokensOutput $16.50 / 1M tokens
Functions
DeepSeek-R1
deepinfra/deepseek-ai/DeepSeek-R1
Deepinfra logoDeepinfraChat
Input $0.700000 / 1M tokensOutput $2.40 / 1M tokens
Functions
DeepSeek-R1-0528
deepinfra/deepseek-ai/DeepSeek-R1-0528
Deepinfra logoDeepinfraChat
Input $0.500000 / 1M tokensOutput $2.15 / 1M tokens
Functions
DeepSeek-R1-0528-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo
Deepinfra logoDeepinfraChat
Input $1.00 / 1M tokensOutput $3.00 / 1M tokens
Functions
DeepSeek-R1-Distill-Llama-70B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Deepinfra logoDeepinfraChat
Input $0.200000 / 1M tokensOutput $0.600000 / 1M tokens
DeepSeek-R1-Distill-Qwen-32B
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Deepinfra logoDeepinfraChat
Input $0.270000 / 1M tokensOutput $0.270000 / 1M tokens
Functions
DeepSeek-R1-Turbo
deepinfra/deepseek-ai/DeepSeek-R1-Turbo
Deepinfra logoDeepinfraChat
Input $1.00 / 1M tokensOutput $3.00 / 1M tokens
Functions
DeepSeek-V3
deepinfra/deepseek-ai/DeepSeek-V3
Deepinfra logoDeepinfraChat
Input $0.380000 / 1M tokensOutput $0.890000 / 1M tokens
Functions
DeepSeek-V3-0324
deepinfra/deepseek-ai/DeepSeek-V3-0324
Deepinfra logoDeepinfraChat
Input $0.250000 / 1M tokensOutput $0.880000 / 1M tokens
Functions
DeepSeek-V3.1
deepinfra/deepseek-ai/DeepSeek-V3.1
Deepinfra logoDeepinfraChat
Input $0.270000 / 1M tokensOutput $1.00 / 1M tokens
FunctionsReasoning
DeepSeek-V3.1-Terminus
deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus
Deepinfra logoDeepinfraChat
Input $0.270000 / 1M tokensOutput $1.00 / 1M tokens
Functions
gemini-2.0-flash-001
deepinfra/google/gemini-2.0-flash-001
Deepinfra logoDeepinfraChat
Input $0.100000 / 1M tokensOutput $0.400000 / 1M tokens
Functions
gemini-2.5-flash
deepinfra/google/gemini-2.5-flash
Deepinfra logoDeepinfraChat
Input $0.300000 / 1M tokensOutput $2.50 / 1M tokens
Functions
gemini-2.5-pro
deepinfra/google/gemini-2.5-pro
Deepinfra logoDeepinfraChat
Input $1.25 / 1M tokensOutput $10.00 / 1M tokens
Functions
gemma-3-12b-it
deepinfra/google/gemma-3-12b-it
Deepinfra logoDeepinfraChat
Input $0.050000 / 1M tokensOutput $0.100000 / 1M tokens
Functions
gemma-3-27b-it
deepinfra/google/gemma-3-27b-it
Deepinfra logoDeepinfraChat
Input $0.090000 / 1M tokensOutput $0.160000 / 1M tokens
Functions
gemma-3-4b-it
deepinfra/google/gemma-3-4b-it
Deepinfra logoDeepinfraChat
Input $0.040000 / 1M tokensOutput $0.080000 / 1M tokens
Functions
GLM-4.5
deepinfra/zai-org/GLM-4.5
Deepinfra logoDeepinfraChat
Input $0.400000 / 1M tokensOutput $1.60 / 1M tokens
Functions
gpt-oss-120b
deepinfra/openai/gpt-oss-120b
Deepinfra logoDeepinfraChat
Input $0.050000 / 1M tokensOutput $0.450000 / 1M tokens
Functions
gpt-oss-20b
deepinfra/openai/gpt-oss-20b
Deepinfra logoDeepinfraChat
Input $0.040000 / 1M tokensOutput $0.150000 / 1M tokens
Functions
Hermes-3-Llama-3.1-405B
deepinfra/NousResearch/Hermes-3-Llama-3.1-405B
Deepinfra logoDeepinfraChat
Input $1.00 / 1M tokensOutput $1.00 / 1M tokens
Functions
Hermes-3-Llama-3.1-70B
deepinfra/NousResearch/Hermes-3-Llama-3.1-70B
Deepinfra logoDeepinfraChat
Input $0.300000 / 1M tokensOutput $0.300000 / 1M tokens
Kimi-K2-Instruct
deepinfra/moonshotai/Kimi-K2-Instruct
Deepinfra logoDeepinfraChat
Input $0.500000 / 1M tokensOutput $2.00 / 1M tokens
Functions
Kimi-K2-Instruct-0905
deepinfra/moonshotai/Kimi-K2-Instruct-0905
Deepinfra logoDeepinfraChat
Input $0.500000 / 1M tokensOutput $2.00 / 1M tokens
Functions
L3-8B-Lunaris-v1-Turbo
deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo
Deepinfra logoDeepinfraChat
Input $0.040000 / 1M tokensOutput $0.050000 / 1M tokens
L3.1-70B-Euryale-v2.2
deepinfra/Sao10K/L3.1-70B-Euryale-v2.2
Deepinfra logoDeepinfraChat
Input $0.650000 / 1M tokensOutput $0.750000 / 1M tokens
L3.3-70B-Euryale-v2.3
deepinfra/Sao10K/L3.3-70B-Euryale-v2.3
Deepinfra logoDeepinfraChat
Input $0.650000 / 1M tokensOutput $0.750000 / 1M tokens
Llama-3.1-Nemotron-70B-Instruct
deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct
Deepinfra logoDeepinfraChat
Input $0.600000 / 1M tokensOutput $0.600000 / 1M tokens
Functions
Llama-3.2-11B-Vision-Instruct
deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct
Deepinfra logoDeepinfraChat
Input $0.049000 / 1M tokensOutput $0.049000 / 1M tokens
Llama-3.2-3B-Instruct
deepinfra/meta-llama/Llama-3.2-3B-Instruct
Deepinfra logoDeepinfraChat
Input $0.020000 / 1M tokensOutput $0.020000 / 1M tokens
Functions
Llama-3.3-70B-Instruct
deepinfra/meta-llama/Llama-3.3-70B-Instruct
Deepinfra logoDeepinfraChat
Input $0.230000 / 1M tokensOutput $0.400000 / 1M tokens
Functions
Llama-3.3-70B-Instruct-Turbo
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
Deepinfra logoDeepinfraChat
Input $0.130000 / 1M tokensOutput $0.390000 / 1M tokens
Functions
Llama-3.3-Nemotron-Super-49B-v1.5
deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
Deepinfra logoDeepinfraChat
Input $0.100000 / 1M tokensOutput $0.400000 / 1M tokens
Functions
Llama-4-Maverick-17B-128E-Instruct-FP8
deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Deepinfra logoDeepinfraChat
Input $0.150000 / 1M tokensOutput $0.600000 / 1M tokens
Functions
Llama-4-Scout-17B-16E-Instruct
deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct
Deepinfra logoDeepinfraChat
Input $0.080000 / 1M tokensOutput $0.300000 / 1M tokens
Functions
Llama-Guard-3-8B
deepinfra/meta-llama/Llama-Guard-3-8B
Deepinfra logoDeepinfraChat
Input $0.055000 / 1M tokensOutput $0.055000 / 1M tokens
Llama-Guard-4-12B
deepinfra/meta-llama/Llama-Guard-4-12B
Deepinfra logoDeepinfraChat
Input $0.180000 / 1M tokensOutput $0.180000 / 1M tokens
Meta-Llama-3-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3-8B-Instruct
Deepinfra logoDeepinfraChat
Input $0.030000 / 1M tokensOutput $0.060000 / 1M tokens
Functions
Meta-Llama-3.1-70B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct
Deepinfra logoDeepinfraChat
Input $0.400000 / 1M tokensOutput $0.400000 / 1M tokens
Functions
Meta-Llama-3.1-70B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
Deepinfra logoDeepinfraChat
Input $0.100000 / 1M tokensOutput $0.280000 / 1M tokens
Functions
Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
Deepinfra logoDeepinfraChat
Input $0.030000 / 1M tokensOutput $0.050000 / 1M tokens
Functions
Meta-Llama-3.1-8B-Instruct-Turbo
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
Deepinfra logoDeepinfraChat
Input $0.020000 / 1M tokensOutput $0.030000 / 1M tokens
Functions
Mistral-Nemo-Instruct-2407
deepinfra/mistralai/Mistral-Nemo-Instruct-2407
Deepinfra logoDeepinfraChat
Input $0.020000 / 1M tokensOutput $0.040000 / 1M tokens
Functions
Mistral-Small-24B-Instruct-2501
deepinfra/mistralai/Mistral-Small-24B-Instruct-2501
Deepinfra logoDeepinfraChat
Input $0.050000 / 1M tokensOutput $0.080000 / 1M tokens
Functions
Mistral-Small-3.2-24B-Instruct-2506
deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506
Deepinfra logoDeepinfraChat
Input $0.075000 / 1M tokensOutput $0.200000 / 1M tokens
Functions
Mixtral-8x7B-Instruct-v0.1
deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1
Deepinfra logoDeepinfraChat
Input $0.400000 / 1M tokensOutput $0.400000 / 1M tokens
Functions
MythoMax-L2-13b
deepinfra/Gryphe/MythoMax-L2-13b
Deepinfra logoDeepinfraChat
Input $0.080000 / 1M tokensOutput $0.090000 / 1M tokens
Functions
NVIDIA-Nemotron-Nano-9B-v2
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
Deepinfra logoDeepinfraChat
Input $0.040000 / 1M tokensOutput $0.160000 / 1M tokens
Functions
olmOCR-7B-0725-FP8
deepinfra/allenai/olmOCR-7B-0725-FP8
Deepinfra logoDeepinfraChat
Input $0.270000 / 1M tokensOutput $1.50 / 1M tokens
phi-4
deepinfra/microsoft/phi-4
Deepinfra logoDeepinfraChat
Input $0.070000 / 1M tokensOutput $0.140000 / 1M tokens
Functions
Qwen2.5-72B-Instruct
deepinfra/Qwen/Qwen2.5-72B-Instruct
Deepinfra logoDeepinfraChat
Input $0.120000 / 1M tokensOutput $0.390000 / 1M tokens
Functions
Qwen2.5-7B-Instruct
deepinfra/Qwen/Qwen2.5-7B-Instruct
Deepinfra logoDeepinfraChat
Input $0.040000 / 1M tokensOutput $0.100000 / 1M tokens
Qwen2.5-VL-32B-Instruct
deepinfra/Qwen/Qwen2.5-VL-32B-Instruct
Deepinfra logoDeepinfraChat
Input $0.200000 / 1M tokensOutput $0.600000 / 1M tokens
FunctionsVision
Qwen3-14B
deepinfra/Qwen/Qwen3-14B
Deepinfra logoDeepinfraChat
Input $0.060000 / 1M tokensOutput $0.240000 / 1M tokens
Functions
Qwen3-235B-A22B
deepinfra/Qwen/Qwen3-235B-A22B
Deepinfra logoDeepinfraChat
Input $0.180000 / 1M tokensOutput $0.540000 / 1M tokens
Functions
Qwen3-235B-A22B-Instruct-2507
deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507
Deepinfra logoDeepinfraChat
Input $0.090000 / 1M tokensOutput $0.600000 / 1M tokens
Functions
Qwen3-235B-A22B-Thinking-2507
deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507
Deepinfra logoDeepinfraChat
Input $0.300000 / 1M tokensOutput $2.90 / 1M tokens
Functions
Qwen3-30B-A3B
deepinfra/Qwen/Qwen3-30B-A3B
Deepinfra logoDeepinfraChat
Input $0.080000 / 1M tokensOutput $0.290000 / 1M tokens
Functions
Qwen3-32B
deepinfra/Qwen/Qwen3-32B
Deepinfra logoDeepinfraChat
Input $0.100000 / 1M tokensOutput $0.280000 / 1M tokens
Functions
Qwen3-Coder-480B-A35B-Instruct
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct
Deepinfra logoDeepinfraChat
Input $0.400000 / 1M tokensOutput $1.60 / 1M tokens
Functions
Qwen3-Coder-480B-A35B-Instruct-Turbo
deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo
Deepinfra logoDeepinfraChat
Input $0.290000 / 1M tokensOutput $1.20 / 1M tokens
Functions
Qwen3-Next-80B-A3B-Instruct
deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct
Deepinfra logoDeepinfraChat
Input $0.140000 / 1M tokensOutput $1.40 / 1M tokens
Functions
Qwen3-Next-80B-A3B-Thinking
deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking
Deepinfra logoDeepinfraChat
Input $0.140000 / 1M tokensOutput $1.40 / 1M tokens
Functions
QwQ-32B
deepinfra/Qwen/QwQ-32B
Deepinfra logoDeepinfraChat
Input $0.150000 / 1M tokensOutput $0.400000 / 1M tokens
Functions
WizardLM-2-8x22B
deepinfra/microsoft/WizardLM-2-8x22B
Deepinfra logoDeepinfraChat
Input $0.480000 / 1M tokensOutput $0.480000 / 1M tokens