UPDATES
Try Bifrost Enterprise free for 14 days.
Explore now
Performance
Features
Enterprise
Docs
Blog
Discord
Github
Book a Demo
Home
>
Llamagate
Llamagate Models
Browse all 16 AI models from Llamagate
Total Models
16
Modes
2
Avg Input (1M Tokens)
$0.07
Avg Output (1M Tokens)
$0.16
All Llamagate Models
Click on any model to calculate costs
All Modes
Showing 16 of 16 models
Model Name↑
Provider
Mode
Input
Cost
(per 1M tokens)
Output
Cost
(per 1M tokens)
Max Input
Tokens
Max Output
Tokens
Capabilities
codellama-7b
llamagate/codellama-7b
Llamagate
Chat
$0.06
$0.12
16,384
4,096
Functions
deepseek-coder-6.7b
llamagate/deepseek-coder-6.7b
Llamagate
Chat
$0.06
$0.12
16,384
4,096
Functions
deepseek-r1-7b-qwen
llamagate/deepseek-r1-7b-qwen
Llamagate
Chat
$0.08
$0.15
131k
16,384
Functions
Reasoning
deepseek-r1-8b
llamagate/deepseek-r1-8b
Llamagate
Chat
$0.10
$0.20
65,536
16,384
Functions
Reasoning
dolphin3-8b
llamagate/dolphin3-8b
Llamagate
Chat
$0.08
$0.15
128k
8,192
Functions
gemma3-4b
llamagate/gemma3-4b
Llamagate
Chat
$0.03
$0.08
128k
8,192
Functions
Vision
llama-3.1-8b
llamagate/llama-3.1-8b
Llamagate
Chat
$0.03
$0.05
131k
8,192
Functions
llama-3.2-3b
llamagate/llama-3.2-3b
Llamagate
Chat
$0.04
$0.08
131k
8,192
Functions
llava-7b
llamagate/llava-7b
Llamagate
Chat
$0.10
$0.20
4,096
2,048
Vision
mistral-7b-v0.3
llamagate/mistral-7b-v0.3
Llamagate
Chat
$0.10
$0.15
32,768
8,192
Functions
nomic-embed-text
llamagate/nomic-embed-text
Llamagate
Embedding
$0.02
—
8,192
—
openthinker-7b
llamagate/openthinker-7b
Llamagate
Chat
$0.08
$0.15
32,768
8,192
Functions
Reasoning
qwen2.5-coder-7b
llamagate/qwen2.5-coder-7b
Llamagate
Chat
$0.06
$0.12
32,768
8,192
Functions
qwen3-8b
llamagate/qwen3-8b
Llamagate
Chat
$0.04
$0.14
32,768
8,192
Functions
qwen3-embedding-8b
llamagate/qwen3-embedding-8b
Llamagate
Embedding
$0.02
—
40,960
—
qwen3-vl-8b
llamagate/qwen3-vl-8b
Llamagate
Chat
$0.15
$0.55
32,768
8,192
Functions
Vision
Llamagate Models - LLM Cost Calculator | Bifrost