Try Bifrost Enterprise free for 14 days.
Request access
P
E
R
F
O
R
M
A
N
C
E
F
E
A
T
U
R
E
S
E
N
T
E
R
P
R
I
S
E
D
O
C
S
B
L
O
G
Discord
Github
Book a Demo
Home
Llamagate
Llamagate Models
Browse all 16 AI models from Llamagate
Total Models
16
Modes
2
Avg Input (1M Tokens)
$0.07
Avg Output (1M Tokens)
$0.16
All Llamagate Models
Click on any model to view details
All Modes
Showing 16 of 16 models
Model Name↑
Provider
Mode
Pricing
(tokens, images, audio, or pages)
Max Input
Tokens
Max Output
Tokens
Capabilities
codellama-7b
llamagate/codellama-7b
Llamagate
Chat
Input $0.060000 / 1M tokens
Output $0.120000 / 1M tokens
16,384
4,096
Functions
deepseek-coder-6.7b
llamagate/deepseek-coder-6.7b
Llamagate
Chat
Input $0.060000 / 1M tokens
Output $0.120000 / 1M tokens
16,384
4,096
Functions
deepseek-r1-7b-qwen
llamagate/deepseek-r1-7b-qwen
Llamagate
Chat
Input $0.080000 / 1M tokens
Output $0.150000 / 1M tokens
131k
16,384
Functions
Reasoning
deepseek-r1-8b
llamagate/deepseek-r1-8b
Llamagate
Chat
Input $0.100000 / 1M tokens
Output $0.200000 / 1M tokens
65,536
16,384
Functions
Reasoning
dolphin3-8b
llamagate/dolphin3-8b
Llamagate
Chat
Input $0.080000 / 1M tokens
Output $0.150000 / 1M tokens
128k
8,192
Functions
gemma3-4b
llamagate/gemma3-4b
Llamagate
Chat
Input $0.030000 / 1M tokens
Output $0.080000 / 1M tokens
128k
8,192
Functions
Vision
llama-3.1-8b
llamagate/llama-3.1-8b
Llamagate
Chat
Input $0.030000 / 1M tokens
Output $0.050000 / 1M tokens
131k
8,192
Functions
llama-3.2-3b
llamagate/llama-3.2-3b
Llamagate
Chat
Input $0.040000 / 1M tokens
Output $0.080000 / 1M tokens
131k
8,192
Functions
llava-7b
llamagate/llava-7b
Llamagate
Chat
Input $0.100000 / 1M tokens
Output $0.200000 / 1M tokens
4,096
2,048
Vision
mistral-7b-v0.3
llamagate/mistral-7b-v0.3
Llamagate
Chat
Input $0.100000 / 1M tokens
Output $0.150000 / 1M tokens
32,768
8,192
Functions
nomic-embed-text
llamagate/nomic-embed-text
Llamagate
Embedding
Input $0.020000 / 1M tokens
Output $0.0000000000 / 1M tokens
8,192
—
openthinker-7b
llamagate/openthinker-7b
Llamagate
Chat
Input $0.080000 / 1M tokens
Output $0.150000 / 1M tokens
32,768
8,192
Functions
Reasoning
qwen2.5-coder-7b
llamagate/qwen2.5-coder-7b
Llamagate
Chat
Input $0.060000 / 1M tokens
Output $0.120000 / 1M tokens
32,768
8,192
Functions
qwen3-8b
llamagate/qwen3-8b
Llamagate
Chat
Input $0.040000 / 1M tokens
Output $0.140000 / 1M tokens
32,768
8,192
Functions
qwen3-embedding-8b
llamagate/qwen3-embedding-8b
Llamagate
Embedding
Input $0.020000 / 1M tokens
Output $0.0000000000 / 1M tokens
40,960
—
qwen3-vl-8b
llamagate/qwen3-vl-8b
Llamagate
Chat
Input $0.150000 / 1M tokens
Output $0.550000 / 1M tokens
32,768
8,192
Functions
Vision
Llamagate Models - Bifrost AI Model Library