OpenClaw Model Rankings
Top 10 of 229 qualifying models
KwaiKAT
76.4
OpenClaw
Intelligence · Coding
Int 43.8·Code 45.6
Terminal-Bench Hard
49.2%
Latency (TTFT)
2.05 s
Run Cost
$0.0084
in $0.300 / out $1.200 per 1M
OpenAI
75.2
OpenClaw
Intelligence · Coding
Int 48.1·Code 51.5
Terminal-Bench Hard
52.3%
Latency (TTFT)
5.07 s
Run Cost
$0.027
in $0.750 / out $4.500 per 1M
OpenAI
74.5
OpenClaw
Intelligence · Coding
Int 44.4·Code 43.9
Terminal-Bench Hard
42.4%
Latency (TTFT)
2.52 s
Run Cost
$0.0074
in $0.200 / out $1.250 per 1M
Z AI
74.5
OpenClaw
Intelligence · Coding
Int 49.8·Code 44.2
Terminal-Bench Hard
43.2%
Latency (TTFT)
938 ms
Run Cost
$0.025
in $1.000 / out $3.200 per 1M
MiniMax
73.1
OpenClaw
Intelligence · Coding
Int 49.6·Code 41.9
Terminal-Bench Hard
39.4%
Latency (TTFT)
2.18 s
Run Cost
$0.0084
in $0.300 / out $1.200 per 1M
71.5
OpenClaw
Intelligence · Coding
Int 57.2·Code 55.5
Terminal-Bench Hard
53.8%
Latency (TTFT)
25.12 s
Run Cost
$0.072
in $2.000 / out $12.000 per 1M
Alibaba
70.8
OpenClaw
Intelligence · Coding
Int 45.0·Code 41.3
Terminal-Bench Hard
40.9%
Latency (TTFT)
1.37 s
Run Cost
$0.022
in $0.600 / out $3.600 per 1M
OpenAI
70.3
OpenClaw
Intelligence · Coding
Int 47.7·Code 44.7
Terminal-Bench Hard
45.5%
Latency (TTFT)
21.79 s
Run Cost
$0.055
in $1.250 / out $10.000 per 1M
69.6
OpenClaw
Intelligence · Coding
Int 46.4·Code 42.6
Terminal-Bench Hard
38.6%
Latency (TTFT)
5.79 s
Run Cost
$0.018
in $0.500 / out $3.000 per 1M
OpenAI
69.5
OpenClaw
Intelligence · Coding
Int 54.0·Code 53.1
Terminal-Bench Hard
53.0%
Latency (TTFT)
62.96 s
Run Cost
$0.077
in $1.750 / out $14.000 per 1M
| # | Model | OpenClaw Score | Intelligence · Coding | Terminal-Bench Hard | Latency (TTFT) | Run Cost | Value |
|---|---|---|---|---|---|---|---|
| #1 | KAT Coder Pro V2 KwaiKAT | 76.4 | Int 43.8·Code 45.6 Capability 44.0 | 49.2% Score 100.0 | 2.05 s Speed score 71.3 | $0.0084 in $0.300 / out $1.200 per 1M | 50.8 |
| #2 | GPT-5.4 mini (xhigh) OpenAI | 75.2 | Int 48.1·Code 51.5 Capability 48.4 | 52.3% Score 100.0 | 5.07 s Speed score 48.2 | $0.027 in $0.750 / out $4.500 per 1M | 24.6 |
| #3 | GPT-5.4 nano (xhigh) OpenAI | 74.5 | Int 44.4·Code 43.9 Capability 44.4 | 42.4% Score 95.1 | 2.52 s Speed score 66.5 | $0.0074 in $0.200 / out $1.250 per 1M | 56.0 |
| #4 | GLM-5 (Reasoning) Z AI | 74.5 | Int 49.8·Code 44.2 Capability 49.2 | 43.2% Score 96.9 | 938 ms Speed score 86.5 | $0.025 in $1.000 / out $3.200 per 1M | 26.7 |
| #5 | MiniMax-M2.7 MiniMax | 73.1 | Int 49.6·Code 41.9 Capability 48.8 | 39.4% Score 88.4 | 2.18 s Speed score 69.9 | $0.0084 in $0.300 / out $1.200 per 1M | 56.0 |
| #6 | Gemini 3.1 Pro Preview | 71.5 | Int 57.2·Code 55.5 Capability 57.0 | 53.8% Score 100.0 | 25.12 s Speed score 1.0 | $0.072 in $2.000 / out $12.000 per 1M | 16.0 |
| #7 | Qwen3.5 397B A17B (Reasoning) Alibaba | 70.8 | Int 45.0·Code 41.3 Capability 44.6 | 40.9% Score 91.7 | 1.37 s Speed score 79.7 | $0.022 in $0.600 / out $3.600 per 1M | 26.1 |
| #8 | GPT-5.1 (high) OpenAI | 70.3 | Int 47.7·Code 44.7 Capability 47.4 | 45.5% Score 100.0 | 21.79 s Speed score 3.8 | $0.055 in $1.250 / out $10.000 per 1M | 14.4 |
| #9 | Gemini 3 Flash Preview (Reasoning) | 69.6 | Int 46.4·Code 42.6 Capability 46.0 | 38.6% Score 86.6 | 5.79 s Speed score 44.4 | $0.018 in $0.500 / out $3.000 per 1M | 30.9 |
| #10 | GPT-5.3 Codex (xhigh) OpenAI | 69.5 | Int 54.0·Code 53.1 Capability 53.9 | 53.0% Score 100.0 | 62.96 s Speed score 1.0 | $0.077 in $1.750 / out $14.000 per 1M | 14.0 |
Scores (0–100) are percentile-normalized across all qualifying models — not raw benchmark percentages. Standard run = 12,000 input + 4,000 output tokens. Hover column headers for metric definitions. Data via Artificial Analysis.