Try Bifrost Enterprise free for 14 days.
Request access
[ LLM COST CALCULATOR ]

Calculate LLM API Costs

Compare pricing across hundreds of AI models. Calculate costs for chat, image generation, audio transcription, and more.

[ OUR NUMBERS AT A GLANCE ]

Models
2,084
Providers
54
Modes
10

Browse by Provider

View all models from a specific provider

Loading providers...

All Models

Click on any model to calculate costs for your specific use case

Showing 100 of 2084 models
Model Name
Provider
Mode
Input
Output
Capabilities
claude-3.5-haiku
replicate/anthropic/claude-3.5-haiku
ReplicateChat$1.00$5.00
FunctionsVision
claude-3.5-sonnet
replicate/anthropic/claude-3.5-sonnet
ReplicateChat$3.75$18.75
FunctionsVision
claude-3.7-sonnet
replicate/anthropic/claude-3.7-sonnet
ReplicateChat$3.00$15.00
FunctionsVision
claude-4-sonnet
replicate/anthropic/claude-4-sonnet
ReplicateChat$3.00$15.00
FunctionsVision
claude-4.5-haiku
replicate/anthropic/claude-4.5-haiku
ReplicateChat$1.00$5.00
FunctionsVision
claude-4.5-sonnet
replicate/anthropic/claude-4.5-sonnet
ReplicateChat$3.00$15.00
FunctionsVision
deepseek-r1
replicate/deepseek-ai/deepseek-r1
ReplicateChat$3.75$10.00
Reasoning
DeepSeek-R1
sambanova/DeepSeek-R1
SambaNovaChat$5.00$7.00
DeepSeek-R1
together_ai/deepseek-ai/DeepSeek-R1
Together AIChat$3.00$7.00
Functions
DeepSeek-R1-0528-tput
together_ai/deepseek-ai/DeepSeek-R1-0528-tput
Together AIChat$0.55$2.19
Functions
DeepSeek-R1-Distill-Llama-70B
sambanova/DeepSeek-R1-Distill-Llama-70B
SambaNovaChat$0.70$1.40
deepseek-v3
replicate/deepseek-ai/deepseek-v3
ReplicateChat$1.45$1.45
Functions
DeepSeek-V3
together_ai/deepseek-ai/DeepSeek-V3
Together AIChat$1.25$1.25
Functions
DeepSeek-V3-0324
sambanova/DeepSeek-V3-0324
SambaNovaChat$3.00$4.50
FunctionsReasoning
deepseek-v3.1
replicate/deepseek-ai/deepseek-v3.1
ReplicateChat$0.67$2.02
FunctionsReasoning
DeepSeek-V3.1
sambanova/DeepSeek-V3.1
SambaNovaChat$3.00$4.50
FunctionsReasoning
DeepSeek-V3.1
together_ai/deepseek-ai/DeepSeek-V3.1
Together AIChat$0.60$1.70
FunctionsReasoning
gemini-2.5-flash
replicate/google/gemini-2.5-flash
ReplicateChat$2.50$2.50
FunctionsVision
gemini-3-pro
replicate/google/gemini-3-pro
ReplicateChat$2.00$12.00
FunctionsVision
gpt-4.1
replicate/openai/gpt-4.1
ReplicateChat$2.00$8.00
FunctionsVision
gpt-4.1-mini
replicate/openai/gpt-4.1-mini
ReplicateChat$0.40$1.60
FunctionsVision
gpt-4.1-nano
replicate/openai/gpt-4.1-nano
ReplicateChat$0.10$0.40
Functions
gpt-4o
replicate/openai/gpt-4o
ReplicateChat$2.50$10.00
FunctionsVisionAudio InAudio Out
gpt-4o-mini
replicate/openai/gpt-4o-mini
ReplicateChat$0.15$0.60
FunctionsVision
gpt-5
replicate/openai/gpt-5
ReplicateChat$1.25$10.00
FunctionsVision
gpt-5-mini
replicate/openai/gpt-5-mini
ReplicateChat$0.25$2.00
FunctionsVision
gpt-5-nano
replicate/openai/gpt-5-nano
ReplicateChat$0.05$0.40
Functions
gpt-oss-120b
replicate/openai/gpt-oss-120b
ReplicateChat$0.18$0.72
Functions
gpt-oss-120b
sambanova/gpt-oss-120b
SambaNovaChat$3.00$4.50
FunctionsReasoning
gpt-oss-120b
together_ai/openai/gpt-oss-120b
Together AIChat$0.15$0.60
FunctionsReasoning
gpt-oss-20b
replicateopenai/gpt-oss-20b
ReplicateChat$0.09$0.36
Functions
gpt-oss-20b
together_ai/openai/gpt-oss-20b
Together AIChat$0.05$0.20
Functions
granite-3.3-8b-instruct
replicate/ibm-granite/granite-3.3-8b-instruct
ReplicateChat$0.03$0.25
Functions
grok-4
replicate/xai/grok-4
ReplicateChat$7.20$36.00
Functions
Kimi-K2-Instruct
together_ai/moonshotai/Kimi-K2-Instruct
Together AIChat$1.00$3.00
Functions
llama-2-13b
replicate/meta/llama-2-13b
ReplicateChat$0.10$0.50
llama-2-13b-chat
replicate/meta/llama-2-13b-chat
ReplicateChat$0.10$0.50
llama-2-70b
replicate/meta/llama-2-70b
ReplicateChat$0.65$2.75
llama-2-70b-chat
replicate/meta/llama-2-70b-chat
ReplicateChat$0.65$2.75
llama-2-7b
replicate/meta/llama-2-7b
ReplicateChat$0.05$0.25
llama-2-7b-chat
replicate/meta/llama-2-7b-chat
ReplicateChat$0.05$0.25
llama-3-70b
replicate/meta/llama-3-70b
ReplicateChat$0.65$2.75
llama-3-70b-instruct
replicate/meta/llama-3-70b-instruct
ReplicateChat$0.65$2.75
llama-3-8b
replicate/meta/llama-3-8b
ReplicateChat$0.05$0.25
llama-3-8b-instruct
replicate/meta/llama-3-8b-instruct
ReplicateChat$0.05$0.25
Llama-3.3-70B-Instruct-Turbo
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo
Together AIChat$0.88$0.88
Functions
Llama-4-Maverick-17B-128E-Instruct
sambanova/Llama-4-Maverick-17B-128E-Instruct
SambaNovaChat$0.63$1.80
FunctionsVision
Llama-4-Maverick-17B-128E-Instruct-FP8
together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Together AIChat$0.27$0.85
Functions
Llama-4-Scout-17B-16E-Instruct
sambanova/Llama-4-Scout-17B-16E-Instruct
SambaNovaChat$0.40$0.70
Functions
Llama-4-Scout-17B-16E-Instruct
together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct
Together AIChat$0.18$0.59
Functions
Meta-Llama-3.1-405B-Instruct
sambanova/Meta-Llama-3.1-405B-Instruct
SambaNovaChat$5.00$10.00
Functions
Meta-Llama-3.1-405B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Together AIChat$3.50$3.50
Functions
Meta-Llama-3.1-70B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
Together AIChat$0.88$0.88
Functions
Meta-Llama-3.1-8B-Instruct
sambanova/Meta-Llama-3.1-8B-Instruct
SambaNovaChat$0.10$0.20
Functions
Meta-Llama-3.1-8B-Instruct-Turbo
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
Together AIChat$0.18$0.18
Functions
Meta-Llama-3.2-1B-Instruct
sambanova/Meta-Llama-3.2-1B-Instruct
SambaNovaChat$0.04$0.08
Meta-Llama-3.2-3B-Instruct
sambanova/Meta-Llama-3.2-3B-Instruct
SambaNovaChat$0.08$0.16
Meta-Llama-3.3-70B-Instruct
sambanova/Meta-Llama-3.3-70B-Instruct
SambaNovaChat$0.60$1.20
Functions
Meta-Llama-Guard-3-8B
sambanova/Meta-Llama-Guard-3-8B
SambaNovaChat$0.30$0.30
MiniMax-M2.7
sambanova/MiniMax-M2.7
SambaNovaChat$0.30$1.20
FunctionsReasoning
mistral-7b-instruct-v0.2
replicate/mistralai/mistral-7b-instruct-v0.2
ReplicateChat$0.05$0.25
mistral-7b-v0.1
replicate/mistralai/mistral-7b-v0.1
ReplicateChat$0.05$0.25
mixtral-8x7b-instruct-v0.1
replicate/mistralai/mixtral-8x7b-instruct-v0.1
ReplicateChat$0.30$1.00
Mixtral-8x7B-Instruct-v0.1
together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1
Together AIChat$0.60$0.60
Functions
o1
replicate/openai/o1
ReplicateChat$15.00$60.00
Reasoning
o1-mini
replicate/openai/o1-mini
ReplicateChat$1.10$4.40
Reasoning
o4-mini
replicate/openai/o4-mini
ReplicateChat$1.00$4.00
Reasoning
pplx-embed-v1-4b
perplexity/pplx-embed-v1-4b
PerplexityEmbedding$0.03
qwen.qwen3-235b-a22b-2507-v1:0
qwen.qwen3-235b-a22b-2507-v1:0
AWS BedrockChat$0.22$0.88
FunctionsReasoning
qwen.qwen3-32b-v1:0
qwen.qwen3-32b-v1:0
AWS BedrockChat$0.15$0.60
FunctionsReasoning
qwen.qwen3-coder-30b-a3b-v1:0
qwen.qwen3-coder-30b-a3b-v1:0
AWS BedrockChat$0.15$0.60
FunctionsReasoning
qwen.qwen3-coder-480b-a35b-v1:0
qwen.qwen3-coder-480b-a35b-v1:0
AWS BedrockChat$0.22$1.80
FunctionsReasoning
qwen.qwen3-coder-next
qwen.qwen3-coder-next
AWS BedrockChat$0.50$1.20
Functions
qwen.qwen3-next-80b-a3b
qwen.qwen3-next-80b-a3b
AWS BedrockChat$0.15$1.20
Functions
qwen.qwen3-vl-235b-a22b
qwen.qwen3-vl-235b-a22b
AWS BedrockChat$0.53$2.66
FunctionsVision
Qwen2-Audio-7B-Instruct
sambanova/Qwen2-Audio-7B-Instruct
SambaNovaChat$0.50$100.00
Audio In
Qwen3-235B-A22B-fp8-tput
together_ai/Qwen/Qwen3-235B-A22B-fp8-tput
Together AIChat$0.20$0.60
qwen3-235b-a22b-instruct-2507
replicate/qwen/qwen3-235b-a22b-instruct-2507
ReplicateChat$0.26$1.06
Functions
Qwen3-235B-A22B-Instruct-2507-tput
together_ai/Qwen/Qwen3-235B-A22B-Instruct-2507-tput
Together AIChat$0.20$6.00
Functions
Qwen3-235B-A22B-Thinking-2507
together_ai/Qwen/Qwen3-235B-A22B-Thinking-2507
Together AIChat$0.65$3.00
Functions
Qwen3-32B
sambanova/Qwen3-32B
SambaNovaChat$0.40$0.80
FunctionsReasoning
Qwen3-Coder-480B-A35B-Instruct-FP8
together_ai/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Together AIChat$2.00$2.00
Functions
QwQ-32B
sambanova/QwQ-32B
SambaNovaChat$0.50$1.00
text-embedding-004
text-embedding-004
Vertex AIEmbedding$0.10
text-embedding-005
text-embedding-005
Vertex AIEmbedding$0.10
text-embedding-3-large
text-embedding-3-large
OpenAIEmbedding$0.13
text-embedding-3-small
text-embedding-3-small
OpenAIEmbedding$0.02
text-embedding-ada-002
text-embedding-ada-002
OpenAIEmbedding$0.10
text-embedding-ada-002-v2
text-embedding-ada-002-v2
OpenAIEmbedding$0.10
text-embedding-large-exp-03-07
text-embedding-large-exp-03-07
Vertex AIEmbedding$0.10
text-multilingual-embedding-002
text-multilingual-embedding-002
Vertex AIEmbedding$0.10
text-unicorn
text-unicorn
Vertex AICompletion$10.00$28.00
text-unicorn@001
text-unicorn@001
Vertex AICompletion$10.00$28.00
together-ai-21.1b-41b
together-ai-21.1b-41b
Together AIChat$0.80$0.80
together-ai-4.1b-8b
together-ai-4.1b-8b
Together AIChat$0.20$0.20
together-ai-41.1b-80b
together-ai-41.1b-80b
Together AIChat$0.90$0.90
together-ai-8.1b-21b
together-ai-8.1b-21b
Together AIChat$0.30$0.30
together-ai-81.1b-110b
together-ai-81.1b-110b
Together AIChat$1.80$1.80
together-ai-embedding-151m-to-350m
together-ai-embedding-151m-to-350m
Together AIEmbedding$0.02
together-ai-up-to-4b
together-ai-up-to-4b
Together AIChat$0.10$0.10
Showing 1,3011,400 of 2,084