Try Bifrost Enterprise free for 14 days.
Request access
P
E
R
F
O
R
M
A
N
C
E
F
E
A
T
U
R
E
S
E
N
T
E
R
P
R
I
S
E
D
O
C
S
B
L
O
G
Discord
Github
Book a Demo
Home
Replicate
Replicate Models
Browse all 40 AI models from Replicate
Total Models
40
Modes
1
Avg Input (1M Tokens)
$1.50
Avg Output (1M Tokens)
$6.40
All Replicate Models
Click on any model to view details
All Modes
Showing 40 of 40 models
Model Name↑
Provider
Mode
Pricing
(tokens, images, audio, or pages)
Max Input
Tokens
Max Output
Tokens
Capabilities
claude-3.5-haiku
replicate/anthropic/claude-3.5-haiku
Replicate
Chat
Input $1.00 / 1M tokens
Output $5.00 / 1M tokens
—
—
Functions
Vision
claude-3.5-sonnet
replicate/anthropic/claude-3.5-sonnet
Replicate
Chat
Input $3.75 / 1M tokens
Output $18.75 / 1M tokens
—
—
Functions
Vision
claude-3.7-sonnet
replicate/anthropic/claude-3.7-sonnet
Replicate
Chat
Input $3.00 / 1M tokens
Output $15.00 / 1M tokens
—
—
Functions
Vision
claude-4-sonnet
replicate/anthropic/claude-4-sonnet
Replicate
Chat
Input $3.00 / 1M tokens
Output $15.00 / 1M tokens
—
—
Functions
Vision
claude-4.5-haiku
replicate/anthropic/claude-4.5-haiku
Replicate
Chat
Input $1.00 / 1M tokens
Output $5.00 / 1M tokens
—
—
Functions
Vision
claude-4.5-sonnet
replicate/anthropic/claude-4.5-sonnet
Replicate
Chat
Input $3.00 / 1M tokens
Output $15.00 / 1M tokens
—
—
Functions
Vision
deepseek-r1
replicate/deepseek-ai/deepseek-r1
Replicate
Chat
Input $3.75 / 1M tokens
Output $10.00 / 1M tokens
65,536
8,192
Reasoning
deepseek-v3
replicate/deepseek-ai/deepseek-v3
Replicate
Chat
Input $1.45 / 1M tokens
Output $1.45 / 1M tokens
65,536
8,192
Functions
deepseek-v3.1
replicate/deepseek-ai/deepseek-v3.1
Replicate
Chat
Input $0.672000 / 1M tokens
Output $2.02 / 1M tokens
163k
163k
Functions
Reasoning
gemini-2.5-flash
replicate/google/gemini-2.5-flash
Replicate
Chat
Input $2.50 / 1M tokens
Output $2.50 / 1M tokens
—
—
Functions
Vision
gemini-3-pro
replicate/google/gemini-3-pro
Replicate
Chat
Input $2.00 / 1M tokens
Output $12.00 / 1M tokens
—
—
Functions
Vision
gpt-4.1
replicate/openai/gpt-4.1
Replicate
Chat
Input $2.00 / 1M tokens
Output $8.00 / 1M tokens
—
—
Functions
Vision
gpt-4.1-mini
replicate/openai/gpt-4.1-mini
Replicate
Chat
Input $0.400000 / 1M tokens
Output $1.60 / 1M tokens
—
—
Functions
Vision
gpt-4.1-nano
replicate/openai/gpt-4.1-nano
Replicate
Chat
Input $0.100000 / 1M tokens
Output $0.400000 / 1M tokens
—
—
Functions
gpt-4o
replicate/openai/gpt-4o
Replicate
Chat
Input $2.50 / 1M tokens
Output $10.00 / 1M tokens
—
—
Functions
Vision
Audio In
Audio Out
gpt-4o-mini
replicate/openai/gpt-4o-mini
Replicate
Chat
Input $0.150000 / 1M tokens
Output $0.600000 / 1M tokens
—
—
Functions
Vision
gpt-5
replicate/openai/gpt-5
Replicate
Chat
Input $1.25 / 1M tokens
Output $10.00 / 1M tokens
—
—
Functions
Vision
gpt-5-mini
replicate/openai/gpt-5-mini
Replicate
Chat
Input $0.250000 / 1M tokens
Output $2.00 / 1M tokens
—
—
Functions
Vision
gpt-5-nano
replicate/openai/gpt-5-nano
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.400000 / 1M tokens
—
—
Functions
gpt-oss-120b
replicate/openai/gpt-oss-120b
Replicate
Chat
Input $0.180000 / 1M tokens
Output $0.720000 / 1M tokens
—
—
Functions
gpt-oss-20b
replicateopenai/gpt-oss-20b
Replicate
Chat
Input $0.090000 / 1M tokens
Output $0.360000 / 1M tokens
—
—
Functions
granite-3.3-8b-instruct
replicate/ibm-granite/granite-3.3-8b-instruct
Replicate
Chat
Input $0.030000 / 1M tokens
Output $0.250000 / 1M tokens
—
—
Functions
grok-4
replicate/xai/grok-4
Replicate
Chat
Input $7.20 / 1M tokens
Output $36.00 / 1M tokens
—
—
Functions
llama-2-13b
replicate/meta/llama-2-13b
Replicate
Chat
Input $0.100000 / 1M tokens
Output $0.500000 / 1M tokens
4,096
4,096
llama-2-13b-chat
replicate/meta/llama-2-13b-chat
Replicate
Chat
Input $0.100000 / 1M tokens
Output $0.500000 / 1M tokens
4,096
4,096
llama-2-70b
replicate/meta/llama-2-70b
Replicate
Chat
Input $0.650000 / 1M tokens
Output $2.75 / 1M tokens
4,096
4,096
llama-2-70b-chat
replicate/meta/llama-2-70b-chat
Replicate
Chat
Input $0.650000 / 1M tokens
Output $2.75 / 1M tokens
4,096
4,096
llama-2-7b
replicate/meta/llama-2-7b
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
4,096
4,096
llama-2-7b-chat
replicate/meta/llama-2-7b-chat
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
4,096
4,096
llama-3-70b
replicate/meta/llama-3-70b
Replicate
Chat
Input $0.650000 / 1M tokens
Output $2.75 / 1M tokens
8,192
8,192
llama-3-70b-instruct
replicate/meta/llama-3-70b-instruct
Replicate
Chat
Input $0.650000 / 1M tokens
Output $2.75 / 1M tokens
8,192
8,192
llama-3-8b
replicate/meta/llama-3-8b
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
8,086
8,086
llama-3-8b-instruct
replicate/meta/llama-3-8b-instruct
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
8,086
8,086
mistral-7b-instruct-v0.2
replicate/mistralai/mistral-7b-instruct-v0.2
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
4,096
4,096
mistral-7b-v0.1
replicate/mistralai/mistral-7b-v0.1
Replicate
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
4,096
4,096
mixtral-8x7b-instruct-v0.1
replicate/mistralai/mixtral-8x7b-instruct-v0.1
Replicate
Chat
Input $0.300000 / 1M tokens
Output $1.00 / 1M tokens
4,096
4,096
o1
replicate/openai/o1
Replicate
Chat
Input $15.00 / 1M tokens
Output $60.00 / 1M tokens
—
—
Reasoning
o1-mini
replicate/openai/o1-mini
Replicate
Chat
Input $1.10 / 1M tokens
Output $4.40 / 1M tokens
—
—
Reasoning
o4-mini
replicate/openai/o4-mini
Replicate
Chat
Input $1.00 / 1M tokens
Output $4.00 / 1M tokens
—
—
Reasoning
qwen3-235b-a22b-instruct-2507
replicate/qwen/qwen3-235b-a22b-instruct-2507
Replicate
Chat
Input $0.264000 / 1M tokens
Output $1.06 / 1M tokens
—
—
Functions