Local AI Hub
LLM ConfigsLLM Runner AIORankingsCoding AgentsLLM RunnersLLM Web UIMultimodal☕Support
Buy Me a Coffee

© 2026 Local AI Hub. All rights reserved.

Model Rankings

See the best-performing models based on Artificial Analysis benchmarks

RankModel NameModel CreatorIntelligenceParametersContext WindowPriceOutput Speed
🥇
GLM-5.2 (max)Top 1
Z AI51753B1.00M$0.9140 tokens/s
🥈
MiniMax-M3
MiniMax44428B1.00M$0.299 tokens/s
🥉
DeepSeek V4 Pro (Reasoning, Max Effort)
DeepSeek441.6KB1.00M$0.292 tokens/s
4
Kimi K2.6
Kimi431.0KB256k$0.780 tokens/s
5
MiMo-V2.5-Pro
Xiaomi421.0KB1.00M$0.252 tokens/s
6
Kimi K2.7 Code
Kimi421.0KB256k$0.762 tokens/s
7
Nex-N2-Pro
Nex AGI41397B262k$0.580 tokens/s
8
DeepSeek V4 Pro (Reasoning, High Effort)
DeepSeek411.6KB1.00M$0.285 tokens/s
9
DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeek40284B1.00M$0.1110 tokens/s
10
GLM-5.1 (Reasoning)
Z AI40744B200k$0.969 tokens/s
11
MiMo-V2.5
Xiaomi40310B1.00M$0.187 tokens/s
12
GLM-5 (Reasoning)
Z AI40744B200k$0.770 tokens/s
13
MiniMax-M2.7
MiniMax38230B205k$0.246 tokens/s
14
Kimi K2.5 (Reasoning)
Kimi381.0KB256k$0.645 tokens/s
15
Nemotron 3 Ultra 550B A55B (Reasoning)
NVIDIA38550B262k$0.6172 tokens/s
16
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek37284B1.00M$0.1-
17
Qwen3.6 27B (Reasoning)
Alibaba3727.8B262k$0.959 tokens/s
18
GLM-5.1 (Non-reasoning)
Z AI35744B200k$0.957 tokens/s
19
Kimi K2.6 (Non-reasoning)
Kimi351.0KB256k$0.767 tokens/s
20
GLM-4.7 (Reasoning)
Z AI34357B200k$0.7125 tokens/s
21
Qwen3.5 27B (Reasoning)
Alibaba3427.8B262k$0.582 tokens/s
22
Qwen3.5 397B A17B (Reasoning)
Alibaba34397B262k$0.950 tokens/s
23
MiniMax-M2.5
MiniMax34230B205k$0.3241 tokens/s
24
Hy3-preview (Reasoning)
Tencent34295B256k$0.1158 tokens/s
25
DeepSeek V3.2 (Reasoning)
DeepSeek33685B128k$0.2-
26
MiMo-V2-Flash (Feb 2026)
Xiaomi33309B256k$0.197 tokens/s
27
Kimi K2 Thinking
Kimi331.0KB256k$0.8124 tokens/s
28
GLM-5 (Non-reasoning)
Z AI32744B200k$0.758 tokens/s
29
Qwen3.5 122B A10B (Reasoning)
Alibaba32125B262k$0.7145 tokens/s
30
Qwen3.5 397B A17B (Non-reasoning)
Alibaba32397B262k$0.952 tokens/s
31
Qwen3.6 35B A3B (Reasoning)
Alibaba3236B262k$0.4173 tokens/s
32
MiniMax-M2.1
MiniMax31230B205k$0.4206 tokens/s
33
DeepSeek V4 Pro (Non-reasoning)
DeepSeek311.6KB1.00M$0.291 tokens/s
34
MiMo-V2-Flash (Reasoning)
Xiaomi31309B256k$0.195 tokens/s
35
Ring-2.6-1T
InclusionAI311.0KB262k$0.5130 tokens/s
36
Mistral Medium 3.5
Mistral30128B256k$1.2123 tokens/s
37
Step 3.7 Flash
StepFun30198B262k$0.2392 tokens/s
38
Kimi K2.5 (Non-reasoning)
Kimi291.0KB256k$0.843 tokens/s
39
Gemma 4 31B (Reasoning)
Google2930.7B256k-35 tokens/s
40
Qwen3.5 27B (Non-reasoning)
Alibaba2927.8B262k$0.589 tokens/s
41
Command A+
Cohere29218B192k-159 tokens/s
42
Qwen3.6 27B (Non-reasoning)
Alibaba2927.8B262k$0.960 tokens/s
43
Qwen3.5 35B A3B (Reasoning)
Alibaba2936B262k$0.4163 tokens/s
44
DeepSeek V4 Flash (Non-reasoning)
DeepSeek29284B1.00M$0.1112 tokens/s
45
MiniMax-M2
MiniMax28230B205k$0.4108 tokens/s
46
Qwen3.5 122B A10B (Non-reasoning)
Alibaba28125B262k$0.7160 tokens/s
47
MiMo-V2.5-Pro (Non-reasoning)
Xiaomi281.0KB1.00M$0.658 tokens/s
48
GLM-4.7 (Non-reasoning)
Z AI27357B200k$0.7118 tokens/s
49
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek26685B128k$1.7-
50
Hy3-preview (Non-reasoning)
Tencent26295B256k$0.1127 tokens/s
51
Ling-2.6-1T
InclusionAI261.0KB262k$0.5-
52
Gemma 4 26B A4B (Reasoning)
Google2625.2B256k$0.1-
53
Step 3.5 Flash
StepFun26196B256k$0.1195 tokens/s
54
DeepSeek V3.2 Exp (Reasoning)
DeepSeek25685B128k$0.2-
55
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA25120.6B1.00M$0.3253 tokens/s
56
GLM-4.6 (Reasoning)
Z AI25357B200k$0.754 tokens/s
57
Qwen3.5 9B (Reasoning)
Alibaba259.65B262k$0.151 tokens/s
58
Gemma 4 31B (Non-reasoning)
Google2530.7B256k$0.252 tokens/s
59
K-EXAONE (Reasoning)
LG AI Research25236B256k--
60
MiMo-V2-Flash (Non-reasoning)
Xiaomi25309B256k$0.197 tokens/s
61
DeepSeek V3.2 (Non-reasoning)
DeepSeek25685B128k$0.5-
62
Trinity Large Thinking
Arcee AI24399B512k$0.2202 tokens/s
63
Qwen3.6 35B A3B (Non-reasoning)
Alibaba2436B262k$0.6180 tokens/s
64
gpt-oss-120b (high)
OpenAI24117B131k$0.2309 tokens/s
65
Kimi K2 0905
Kimi241.0KB256k$0.830 tokens/s
66
Qwen3.5 35B A3B (Non-reasoning)
Alibaba2336B262k$0.4188 tokens/s
67
GLM-4.6 (Non-reasoning)
Z AI23357B200k$0.852 tokens/s
68
EXAONE 4.5 33B
LG AI Research2334.4B262k--
69
GLM-4.7-Flash (Reasoning)
Z AI2331.2B200k$0.1103 tokens/s
70
Qwen3 235B A22B 2507 (Reasoning)
Alibaba22235B256k$0.667 tokens/s
71
DeepSeek V3.2 Speciale
DeepSeek22685B128k--
72
HyperNova 60B 2605
Multiverse Computing2258.7B131k$0.1393 tokens/s
73
Gemma 4 12B (Reasoning)
Google2212B256k$0.1125 tokens/s
74
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek21685B128k$0.3-
75
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek21685B128k$0.2-
76
Nemotron Cascade 2 30B A3B
NVIDIA2131.6B1.00M--
77
Apriel-v1.5-15B-Thinker
ServiceNow2115B128k--
78
Qwen3 Coder Next
Alibaba2179.7B256k$0.4134 tokens/s
79
DeepSeek V3.1 (Non-reasoning)
DeepSeek21685B128k$0.7-
80
Mistral Small 4 (Reasoning)
Mistral21119B256k$0.2187 tokens/s
81
DeepSeek V3.1 (Reasoning)
DeepSeek21685B128k$0.7-
82
Qwen3 VL 235B A22B (Reasoning)
Alibaba21235B262k$1.456 tokens/s
83
North Mini Code
Cohere2130B256k-66 tokens/s
84
Apriel-v1.6-15B-Thinker
ServiceNow2115B128k--
85
Qwen3.5 9B (Non-reasoning)
Alibaba209.65B262k--
86
Gemma 4 26B A4B (Non-reasoning)
Google2025.2B256k$0.249 tokens/s
87
Qwen3.5 4B (Reasoning)
Alibaba204.66B262k$0.031 tokens/s