Try Bifrost Enterprise free for 14 days.
Request access
P
E
R
F
O
R
M
A
N
C
E
F
E
A
T
U
R
E
S
E
N
T
E
R
P
R
I
S
E
D
O
C
S
B
L
O
G
Discord
Github
Book a Demo
Home
Novita
Novita Models
Browse all 85 AI models from Novita
Total Models
85
Modes
3
Avg Input (1M Tokens)
$0.31
Avg Output (1M Tokens)
$0.96
All Novita Models
Click on any model to view details
All Modes
Showing 85 of 85 models
Model Name↑
Provider
Mode
Pricing
(tokens, images, audio, or pages)
Max Input
Tokens
Max Output
Tokens
Capabilities
autoglm-phone-9b-multilingual
novita/zai-org/autoglm-phone-9b-multilingual
Novita
Chat
Input $0.035000 / 1M tokens
Output $0.138000 / 1M tokens
65,536
65,536
Vision
baichuan-m2-32b
novita/baichuan/baichuan-m2-32b
Novita
Chat
Input $0.070000 / 1M tokens
Output $0.070000 / 1M tokens
131k
131k
bge-m3
novita/baai/bge-m3
Novita
Embedding
Input $0.010000 / 1M tokens
Output $0.010000 / 1M tokens
8,192
96,000
bge-reranker-v2-m3
novita/baai/bge-reranker-v2-m3
Novita
Rerank
Input $0.010000 / 1M tokens
Output $0.010000 / 1M tokens
8,000
8,000
deepseek-ocr
novita/deepseek/deepseek-ocr
Novita
Chat
Input $0.030000 / 1M tokens
Output $0.030000 / 1M tokens
8,192
8,192
Vision
deepseek-prover-v2-671b
novita/deepseek/deepseek-prover-v2-671b
Novita
Chat
Input $0.700000 / 1M tokens
Output $2.50 / 1M tokens
160k
160k
deepseek-r1-0528
novita/deepseek/deepseek-r1-0528
Novita
Chat
Input $0.700000 / 1M tokens
Output $2.50 / 1M tokens
163k
32,768
Functions
Reasoning
deepseek-r1-0528-qwen3-8b
novita/deepseek/deepseek-r1-0528-qwen3-8b
Novita
Chat
Input $0.060000 / 1M tokens
Output $0.090000 / 1M tokens
128k
32,000
Reasoning
deepseek-r1-distill-llama-70b
novita/deepseek/deepseek-r1-distill-llama-70b
Novita
Chat
Input $0.800000 / 1M tokens
Output $0.800000 / 1M tokens
8,192
8,192
Reasoning
deepseek-r1-distill-qwen-14b
novita/deepseek/deepseek-r1-distill-qwen-14b
Novita
Chat
Input $0.150000 / 1M tokens
Output $0.150000 / 1M tokens
32,768
16,384
Reasoning
deepseek-r1-distill-qwen-32b
novita/deepseek/deepseek-r1-distill-qwen-32b
Novita
Chat
Input $0.300000 / 1M tokens
Output $0.300000 / 1M tokens
64,000
32,000
Reasoning
deepseek-r1-turbo
novita/deepseek/deepseek-r1-turbo
Novita
Chat
Input $0.700000 / 1M tokens
Output $2.50 / 1M tokens
64,000
16,000
Functions
Reasoning
deepseek-v3-0324
novita/deepseek/deepseek-v3-0324
Novita
Chat
Input $0.270000 / 1M tokens
Output $1.12 / 1M tokens
163k
163k
Functions
deepseek-v3-turbo
novita/deepseek/deepseek-v3-turbo
Novita
Chat
Input $0.400000 / 1M tokens
Output $1.30 / 1M tokens
64,000
16,000
Functions
deepseek-v3.1
novita/deepseek/deepseek-v3.1
Novita
Chat
Input $0.270000 / 1M tokens
Output $1.00 / 1M tokens
131k
32,768
Functions
Reasoning
deepseek-v3.1-terminus
novita/deepseek/deepseek-v3.1-terminus
Novita
Chat
Input $0.270000 / 1M tokens
Output $1.00 / 1M tokens
131k
32,768
Functions
Reasoning
deepseek-v3.2
novita/deepseek/deepseek-v3.2
Novita
Chat
Input $0.269000 / 1M tokens
Output $0.400000 / 1M tokens
163k
65,536
Functions
Reasoning
deepseek-v3.2-exp
novita/deepseek/deepseek-v3.2-exp
Novita
Chat
Input $0.270000 / 1M tokens
Output $0.410000 / 1M tokens
163k
65,536
Functions
Reasoning
ernie-4.5-21B-a3b
novita/baidu/ernie-4.5-21B-a3b
Novita
Chat
Input $0.070000 / 1M tokens
Output $0.280000 / 1M tokens
120k
8,000
Functions
ernie-4.5-21B-a3b-thinking
novita/baidu/ernie-4.5-21B-a3b-thinking
Novita
Chat
Input $0.070000 / 1M tokens
Output $0.280000 / 1M tokens
131k
65,536
Reasoning
ernie-4.5-300b-a47b-paddle
novita/baidu/ernie-4.5-300b-a47b-paddle
Novita
Chat
Input $0.280000 / 1M tokens
Output $1.10 / 1M tokens
123k
12,000
ernie-4.5-vl-28b-a3b
novita/baidu/ernie-4.5-vl-28b-a3b
Novita
Chat
Input $0.140000 / 1M tokens
Output $0.560000 / 1M tokens
30,000
8,000
Functions
Vision
Reasoning
ernie-4.5-vl-28b-a3b-thinking
novita/baidu/ernie-4.5-vl-28b-a3b-thinking
Novita
Chat
Input $0.390000 / 1M tokens
Output $0.390000 / 1M tokens
131k
65,536
Functions
Vision
Reasoning
ernie-4.5-vl-424b-a47b
novita/baidu/ernie-4.5-vl-424b-a47b
Novita
Chat
Input $0.420000 / 1M tokens
Output $1.25 / 1M tokens
123k
16,000
Vision
Reasoning
gemma-3-12b-it
novita/google/gemma-3-12b-it
Novita
Chat
Input $0.050000 / 1M tokens
Output $0.100000 / 1M tokens
131k
8,192
Vision
gemma-3-27b-it
novita/google/gemma-3-27b-it
Novita
Chat
Input $0.119000 / 1M tokens
Output $0.200000 / 1M tokens
98,304
16,384
Vision
glm-4.5
novita/zai-org/glm-4.5
Novita
Chat
Input $0.600000 / 1M tokens
Output $2.20 / 1M tokens
131k
98,304
Functions
Reasoning
glm-4.5-air
novita/zai-org/glm-4.5-air
Novita
Chat
Input $0.130000 / 1M tokens
Output $0.850000 / 1M tokens
131k
98,304
Functions
Reasoning
glm-4.5v
novita/zai-org/glm-4.5v
Novita
Chat
Input $0.600000 / 1M tokens
Output $1.80 / 1M tokens
65,536
16,384
Functions
Vision
Reasoning
glm-4.6
novita/zai-org/glm-4.6
Novita
Chat
Input $0.550000 / 1M tokens
Output $2.20 / 1M tokens
204k
131k
Functions
Reasoning
glm-4.6v
novita/zai-org/glm-4.6v
Novita
Chat
Input $0.300000 / 1M tokens
Output $0.900000 / 1M tokens
131k
32,768
Functions
Vision
Reasoning
glm-4.7
novita/zai-org/glm-4.7
Novita
Chat
Input $0.600000 / 1M tokens
Output $2.20 / 1M tokens
204k
131k
Functions
Reasoning
gpt-oss-120b
novita/openai/gpt-oss-120b
Novita
Chat
Input $0.050000 / 1M tokens
Output $0.250000 / 1M tokens
131k
32,768
Functions
Vision
Reasoning
gpt-oss-20b
novita/openai/gpt-oss-20b
Novita
Chat
Input $0.040000 / 1M tokens
Output $0.150000 / 1M tokens
131k
32,768
Vision
Reasoning
hermes-2-pro-llama-3-8b
novita/nousresearch/hermes-2-pro-llama-3-8b
Novita
Chat
Input $0.140000 / 1M tokens
Output $0.140000 / 1M tokens
8,192
8,192
kat-coder-pro
novita/kwaipilot/kat-coder-pro
Novita
Chat
Input $0.300000 / 1M tokens
Output $1.20 / 1M tokens
256k
128k
Functions
kimi-k2-0905
novita/moonshotai/kimi-k2-0905
Novita
Chat
Input $0.600000 / 1M tokens
Output $2.50 / 1M tokens
262k
262k
Functions
kimi-k2-instruct
novita/moonshotai/kimi-k2-instruct
Novita
Chat
Input $0.570000 / 1M tokens
Output $2.30 / 1M tokens
131k
131k
Functions
kimi-k2-thinking
novita/moonshotai/kimi-k2-thinking
Novita
Chat
Input $0.600000 / 1M tokens
Output $2.50 / 1M tokens
262k
262k
Functions
Reasoning
l3-70b-euryale-v2.1
novita/sao10k/l3-70b-euryale-v2.1
Novita
Chat
Input $1.48 / 1M tokens
Output $1.48 / 1M tokens
8,192
8,192
Functions
l3-8b-lunaris
novita/sao10k/l3-8b-lunaris
Novita
Chat
Input $0.050000 / 1M tokens
Output $0.050000 / 1M tokens
8,192
8,192
L3-8B-Stheno-v3.2
novita/Sao10K/L3-8B-Stheno-v3.2
Novita
Chat
Input $0.050000 / 1M tokens
Output $0.050000 / 1M tokens
8,192
32,000
Functions
l31-70b-euryale-v2.2
novita/sao10k/l31-70b-euryale-v2.2
Novita
Chat
Input $1.48 / 1M tokens
Output $1.48 / 1M tokens
8,192
8,192
Functions
llama-3-70b-instruct
novita/meta-llama/llama-3-70b-instruct
Novita
Chat
Input $0.510000 / 1M tokens
Output $0.740000 / 1M tokens
8,192
8,000
llama-3-8b-instruct
novita/meta-llama/llama-3-8b-instruct
Novita
Chat
Input $0.040000 / 1M tokens
Output $0.040000 / 1M tokens
8,192
8,192
llama-3.1-8b-instruct
novita/meta-llama/llama-3.1-8b-instruct
Novita
Chat
Input $0.020000 / 1M tokens
Output $0.050000 / 1M tokens
16,384
16,384
llama-3.2-3b-instruct
novita/meta-llama/llama-3.2-3b-instruct
Novita
Chat
Input $0.030000 / 1M tokens
Output $0.050000 / 1M tokens
32,768
32,000
Functions
llama-3.3-70b-instruct
novita/meta-llama/llama-3.3-70b-instruct
Novita
Chat
Input $0.135000 / 1M tokens
Output $0.400000 / 1M tokens
131k
120k
Functions
llama-4-maverick-17b-128e-instruct-fp8
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8
Novita
Chat
Input $0.270000 / 1M tokens
Output $0.850000 / 1M tokens
1048k
8,192
Vision
llama-4-scout-17b-16e-instruct
novita/meta-llama/llama-4-scout-17b-16e-instruct
Novita
Chat
Input $0.180000 / 1M tokens
Output $0.590000 / 1M tokens
131k
131k
Vision
mimo-v2-flash
novita/xiaomimimo/mimo-v2-flash
Novita
Chat
Input $0.100000 / 1M tokens
Output $0.300000 / 1M tokens
262k
32,000
Functions
Reasoning
minimax-m1-80k
novita/minimaxai/minimax-m1-80k
Novita
Chat
Input $0.550000 / 1M tokens
Output $2.20 / 1M tokens
1000k
40,000
Functions
Reasoning
minimax-m2
novita/minimax/minimax-m2
Novita
Chat
Input $0.300000 / 1M tokens
Output $1.20 / 1M tokens
204k
131k
Functions
Reasoning
minimax-m2.1
novita/minimax/minimax-m2.1
Novita
Chat
Input $0.300000 / 1M tokens
Output $1.20 / 1M tokens
204k
131k
Functions
mistral-nemo
novita/mistralai/mistral-nemo
Novita
Chat
Input $0.040000 / 1M tokens
Output $0.170000 / 1M tokens
60,288
16,000
mythomax-l2-13b
novita/gryphe/mythomax-l2-13b
Novita
Chat
Input $0.090000 / 1M tokens
Output $0.090000 / 1M tokens
4,096
3,200
paddleocr-vl
novita/paddlepaddle/paddleocr-vl
Novita
Chat
Input $0.020000 / 1M tokens
Output $0.020000 / 1M tokens
16,384
16,384
Vision
qwen-2.5-72b-instruct
novita/qwen/qwen-2.5-72b-instruct
Novita
Chat
Input $0.380000 / 1M tokens
Output $0.400000 / 1M tokens
32,000
8,192
Functions
qwen-mt-plus
novita/qwen/qwen-mt-plus
Novita
Chat
Input $0.250000 / 1M tokens
Output $0.750000 / 1M tokens
16,384
8,192
qwen2.5-7b-instruct
novita/qwen/qwen2.5-7b-instruct
Novita
Chat
Input $0.070000 / 1M tokens
Output $0.070000 / 1M tokens
32,000
32,000
Functions
qwen2.5-vl-72b-instruct
novita/qwen/qwen2.5-vl-72b-instruct
Novita
Chat
Input $0.800000 / 1M tokens
Output $0.800000 / 1M tokens
32,768
32,768
Vision
qwen3-235b-a22b-fp8
novita/qwen/qwen3-235b-a22b-fp8
Novita
Chat
Input $0.200000 / 1M tokens
Output $0.800000 / 1M tokens
40,960
20,000
Reasoning
qwen3-235b-a22b-instruct-2507
novita/qwen/qwen3-235b-a22b-instruct-2507
Novita
Chat
Input $0.090000 / 1M tokens
Output $0.580000 / 1M tokens
131k
16,384
Functions
qwen3-235b-a22b-thinking-2507
novita/qwen/qwen3-235b-a22b-thinking-2507
Novita
Chat
Input $0.300000 / 1M tokens
Output $3.00 / 1M tokens
131k
32,768
Functions
Reasoning
qwen3-30b-a3b-fp8
novita/qwen/qwen3-30b-a3b-fp8
Novita
Chat
Input $0.090000 / 1M tokens
Output $0.450000 / 1M tokens
40,960
20,000
Reasoning
qwen3-32b-fp8
novita/qwen/qwen3-32b-fp8
Novita
Chat
Input $0.100000 / 1M tokens
Output $0.450000 / 1M tokens
40,960
20,000
Reasoning
qwen3-4b-fp8
novita/qwen/qwen3-4b-fp8
Novita
Chat
Input $0.030000 / 1M tokens
Output $0.030000 / 1M tokens
128k
20,000
Reasoning
qwen3-8b-fp8
novita/qwen/qwen3-8b-fp8
Novita
Chat
Input $0.035000 / 1M tokens
Output $0.138000 / 1M tokens
128k
20,000
Reasoning
qwen3-coder-30b-a3b-instruct
novita/qwen/qwen3-coder-30b-a3b-instruct
Novita
Chat
Input $0.070000 / 1M tokens
Output $0.270000 / 1M tokens
160k
32,768
Functions
qwen3-coder-480b-a35b-instruct
novita/qwen/qwen3-coder-480b-a35b-instruct
Novita
Chat
Input $0.300000 / 1M tokens
Output $1.30 / 1M tokens
262k
65,536
Functions
qwen3-embedding-0.6b
novita/qwen/qwen3-embedding-0.6b
Novita
Embedding
Input $0.070000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,768
32,768
qwen3-embedding-8b
novita/qwen/qwen3-embedding-8b
Novita
Embedding
Input $0.070000 / 1M tokens
Output $0.0000000000 / 1M tokens
32,768
4,096
qwen3-max
novita/qwen/qwen3-max
Novita
Chat
Input $2.11 / 1M tokens
Output $8.45 / 1M tokens
262k
65,536
Functions
qwen3-next-80b-a3b-instruct
novita/qwen/qwen3-next-80b-a3b-instruct
Novita
Chat
Input $0.150000 / 1M tokens
Output $1.50 / 1M tokens
131k
32,768
Functions
qwen3-next-80b-a3b-thinking
novita/qwen/qwen3-next-80b-a3b-thinking
Novita
Chat
Input $0.150000 / 1M tokens
Output $1.50 / 1M tokens
131k
32,768
Functions
Reasoning
qwen3-omni-30b-a3b-instruct
novita/qwen/qwen3-omni-30b-a3b-instruct
Novita
Chat
Input $0.250000 / 1M tokens
Output $0.970000 / 1M tokens
65,536
16,384
Functions
Vision
Audio In
Audio Out
qwen3-omni-30b-a3b-thinking
novita/qwen/qwen3-omni-30b-a3b-thinking
Novita
Chat
Input $0.250000 / 1M tokens
Output $0.970000 / 1M tokens
65,536
16,384
Functions
Vision
Reasoning
Audio In
qwen3-reranker-8b
novita/qwen/qwen3-reranker-8b
Novita
Rerank
Input $0.050000 / 1M tokens
Output $0.050000 / 1M tokens
32,768
4,096
qwen3-vl-235b-a22b-instruct
novita/qwen/qwen3-vl-235b-a22b-instruct
Novita
Chat
Input $0.300000 / 1M tokens
Output $1.50 / 1M tokens
131k
32,768
Functions
Vision
qwen3-vl-235b-a22b-thinking
novita/qwen/qwen3-vl-235b-a22b-thinking
Novita
Chat
Input $0.980000 / 1M tokens
Output $3.95 / 1M tokens
131k
32,768
Vision
Reasoning
qwen3-vl-30b-a3b-instruct
novita/qwen/qwen3-vl-30b-a3b-instruct
Novita
Chat
Input $0.200000 / 1M tokens
Output $0.700000 / 1M tokens
131k
32,768
Functions
Vision
qwen3-vl-30b-a3b-thinking
novita/qwen/qwen3-vl-30b-a3b-thinking
Novita
Chat
Input $0.200000 / 1M tokens
Output $1.00 / 1M tokens
131k
32,768
Functions
Vision
qwen3-vl-8b-instruct
novita/qwen/qwen3-vl-8b-instruct
Novita
Chat
Input $0.080000 / 1M tokens
Output $0.500000 / 1M tokens
131k
32,768
Functions
Vision
r1v4-lite
novita/skywork/r1v4-lite
Novita
Chat
Input $0.200000 / 1M tokens
Output $0.600000 / 1M tokens
262k
65,536
Vision
wizardlm-2-8x22b
novita/microsoft/wizardlm-2-8x22b
Novita
Chat
Input $0.620000 / 1M tokens
Output $0.620000 / 1M tokens
65,535
8,000