cyberclaw.directory

OpenClaw Rankings

Best models for OpenClaw usage

OpenClaw Model Rankings

Top 10 of 256 qualifying models

Score weights:45% Terminal-Bench40% Capability15% Cost

77.3

OpenClaw

Intelligence · Coding

Int 48.9·Code 51.5

Terminal-Bench Hard

52.3%

Latency (TTFT)

12.02 s

Run Cost

$0.027

in $0.750 / out $4.500 per 1M

76.9

OpenClaw

Intelligence · Coding

Int 43.8·Code 45.6

Terminal-Bench Hard

49.2%

Latency (TTFT)

1.45 s

Run Cost

$0.0084

in $0.300 / out $1.200 per 1M

76.4

OpenClaw

Intelligence · Coding

Int 57.2·Code 55.5

Terminal-Bench Hard

53.8%

Latency (TTFT)

24.34 s

Run Cost

$0.072

in $2.000 / out $12.000 per 1M

Intelligence · Coding

Int 51.5·Code 47.5

Terminal-Bench Hard

46.2%

Latency (TTFT)

1.13 s

Run Cost

$0.035

in $1.740 / out $3.480 per 1M

74.8

OpenClaw

Intelligence · Coding

Int 56.8·Code 57.2

Terminal-Bench Hard

57.6%

Latency (TTFT)

209.59 s

Run Cost

$0.090

in $2.500 / out $15.000 per 1M

74.6

OpenClaw

Intelligence · Coding

Int 53.6·Code 53.1

Terminal-Bench Hard

53.0%

Latency (TTFT)

76.09 s

Run Cost

$0.077

in $1.750 / out $14.000 per 1M

Kimi

74.5

OpenClaw

Intelligence · Coding

Int 53.9·Code 47.1

Terminal-Bench Hard

43.9%

Latency (TTFT)

1.22 s

Run Cost

$0.027

in $0.950 / out $4.000 per 1M

74.0

OpenClaw

Intelligence · Coding

Int 53.8·Code 45.5

Terminal-Bench Hard

43.2%

Latency (TTFT)

2.05 s

Run Cost

$0.024

in $1.000 / out $3.000 per 1M

Alibaba

73.8

OpenClaw

Intelligence · Coding

Int 50.0·Code 42.9

Terminal-Bench Hard

43.9%

Latency (TTFT)

1.72 s

Run Cost

$0.018

in $0.500 / out $3.000 per 1M

72.5

OpenClaw

Intelligence · Coding

Int 49.8·Code 44.2

Terminal-Bench Hard

43.2%

Latency (TTFT)

761 ms

Run Cost

$0.025

in $1.000 / out $3.200 per 1M

Scores (0–100) are percentile-normalized across all qualifying models — not raw benchmark percentages. Standard run = 12,000 input + 4,000 output tokens. Hover column headers for metric definitions. Data via Artificial Analysis.

Value Map

Best models cluster top-left: high Intelligence Index, low run cost. Bubble size scales with Terminal-Bench Hard score. Hover a bubble or legend entry to inspect.

Scroll horizontally to explore →

← cheaper · smarter ↑ (ideal zone)$0.0028$0.0079$0.022$0.064$0.1803240485765Run Cost per call, log scale →Intelligence Index →GPT-5.4 mini (xhigh) Provider: OpenAI OpenClaw Score: 77.3 Intelligence Index: 48.9 Agent Fitness: 49.2 Terminal-Bench Hard: 52.3% Latency (TTFT): 12.02 s Run cost: $0.027KAT Coder Pro V2 Provider: KwaiKAT OpenClaw Score: 76.9 Intelligence Index: 43.8 Agent Fitness: 44.0 Terminal-Bench Hard: 49.2% Latency (TTFT): 1.45 s Run cost: $0.0084Gemini 3.1 Pro Preview Provider: Google OpenClaw Score: 76.4 Intelligence Index: 57.2 Agent Fitness: 57.0 Terminal-Bench Hard: 53.8% Latency (TTFT): 24.34 s Run cost: $0.072DeepSeek V4 Pro (Reasoning, Max Effort) Provider: DeepSeek OpenClaw Score: 75.1 Intelligence Index: 51.5 Agent Fitness: 51.1 Terminal-Bench Hard: 46.2% Latency (TTFT): 1.13 s Run cost: $0.035GPT-5.4 (xhigh) Provider: OpenAI OpenClaw Score: 74.8 Intelligence Index: 56.8 Agent Fitness: 56.8 Terminal-Bench Hard: 57.6% Latency (TTFT): 209.59 s Run cost: $0.090GPT-5.3 Codex (xhigh) Provider: OpenAI OpenClaw Score: 74.6 Intelligence Index: 53.6 Agent Fitness: 53.6 Terminal-Bench Hard: 53.0% Latency (TTFT): 76.09 s Run cost: $0.077Kimi K2.6 Provider: Kimi OpenClaw Score: 74.5 Intelligence Index: 53.9 Agent Fitness: 53.2 Terminal-Bench Hard: 43.9% Latency (TTFT): 1.22 s Run cost: $0.027MiMo-V2.5-Pro Provider: Xiaomi OpenClaw Score: 74.0 Intelligence Index: 53.8 Agent Fitness: 53.0 Terminal-Bench Hard: 43.2% Latency (TTFT): 2.05 s Run cost: $0.024Qwen3.6 Plus Provider: Alibaba OpenClaw Score: 73.8 Intelligence Index: 50.0 Agent Fitness: 49.3 Terminal-Bench Hard: 43.9% Latency (TTFT): 1.72 s Run cost: $0.018GLM-5 (Reasoning) Provider: Z AI OpenClaw Score: 72.5 Intelligence Index: 49.8 Agent Fitness: 49.2 Terminal-Bench Hard: 43.2% Latency (TTFT): 761 ms Run cost: $0.025GLM-5.1 (Reasoning) Provider: Z AI OpenClaw Score: 72.1 Intelligence Index: 51.4 Agent Fitness: 50.6 Terminal-Bench Hard: 43.2% Latency (TTFT): 781 ms Run cost: $0.034GPT-5.2 (xhigh) Provider: OpenAI OpenClaw Score: 72.0 Intelligence Index: 51.3 Agent Fitness: 51.0 Terminal-Bench Hard: 47.0% Latency (TTFT): 103.18 s Run cost: $0.077Qwen3.6 Max Preview Provider: Alibaba OpenClaw Score: 71.9 Intelligence Index: 51.8 Agent Fitness: 51.1 Terminal-Bench Hard: 43.9% Latency (TTFT): 2.74 s Run cost: $0.047MiMo-V2.5 Provider: Xiaomi OpenClaw Score: 71.9 Intelligence Index: 49.0 Agent Fitness: 48.3 Terminal-Bench Hard: 41.7% Latency (TTFT): 2.70 s Run cost: $0.012Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) Provider: Anthropic OpenClaw Score: 71.4 Intelligence Index: 51.7 Agent Fitness: 51.6 Terminal-Bench Hard: 53.0% Latency (TTFT): 57.13 s Run cost: $0.105GPT-5.4 nano (xhigh) Provider: OpenAI OpenClaw Score: 71.3 Intelligence Index: 44.0 Agent Fitness: 44.0 Terminal-Bench Hard: 42.4% Latency (TTFT): 3.12 s Run cost: $0.0074GPT-5.1 (high) Provider: OpenAI OpenClaw Score: 71.1 Intelligence Index: 47.7 Agent Fitness: 47.4 Terminal-Bench Hard: 45.5% Latency (TTFT): 30.26 s Run cost: $0.055Gemini 3.5 Flash (minimal) Provider: Google OpenClaw Score: 70.4 Intelligence Index: 43.3 Agent Fitness: 43.7 Terminal-Bench Hard: 46.2% Latency (TTFT): 886 ms Run cost: $0.054MiniMax-M2.7 Provider: MiniMax OpenClaw Score: 70.3 Intelligence Index: 49.6 Agent Fitness: 48.8 Terminal-Bench Hard: 39.4% Latency (TTFT): 1.33 s Run cost: $0.0084DeepSeek V4 Pro (Reasoning, High Effort) Provider: DeepSeek OpenClaw Score: 70.1 Intelligence Index: 49.8 Agent Fitness: 49.1 Terminal-Bench Hard: 41.7% Latency (TTFT): 1.18 s Run cost: $0.035MiMo-V2-Pro Provider: Xiaomi OpenClaw Score: 70.1 Intelligence Index: 49.2 Agent Fitness: 48.4 Terminal-Bench Hard: 40.9% Latency (TTFT): 2.01 s Run cost: $0.024Gemini 3.5 Flash (high) Provider: Google OpenClaw Score: 69.7 Intelligence Index: 55.3 Agent Fitness: 54.3 Terminal-Bench Hard: 40.9% Latency (TTFT): 12.31 s Run cost: $0.054GPT-5.5 (xhigh) Provider: OpenAI OpenClaw Score: 69.2 Intelligence Index: 60.2 Agent Fitness: 60.1 Terminal-Bench Hard: 60.6% Latency (TTFT): 72.43 s Run cost: $0.180Qwen3.5 397B A17B (Reasoning) Provider: Alibaba OpenClaw Score: 68.8 Intelligence Index: 45.0 Agent Fitness: 44.6 Terminal-Bench Hard: 40.9% Latency (TTFT): 1.83 s Run cost: $0.022GPT-5.5 (high) Provider: OpenAI OpenClaw Score: 68.7 Intelligence Index: 58.9 Agent Fitness: 58.9 Terminal-Bench Hard: 59.8% Latency (TTFT): 25.22 s Run cost: $0.180Grok 4.3 (high) Provider: xAI OpenClaw Score: 68.6 Intelligence Index: 53.2 Agent Fitness: 52.0 Terminal-Bench Hard: 37.9% Latency (TTFT): 23.82 s Run cost: $0.025Claude Opus 4.7 (Adaptive Reasoning, Max Effort) Provider: Anthropic OpenClaw Score: 67.9 Intelligence Index: 57.3 Agent Fitness: 56.8 Terminal-Bench Hard: 51.5% Latency (TTFT): 14.87 s Run cost: $0.175GPT-5.5 (medium) Provider: OpenAI OpenClaw Score: 67.8 Intelligence Index: 56.7 Agent Fitness: 56.7 Terminal-Bench Hard: 57.6% Latency (TTFT): 4.83 s Run cost: $0.180Grok 4.20 0309 (Reasoning) Provider: xAI OpenClaw Score: 67.7 Intelligence Index: 48.5 Agent Fitness: 47.9 Terminal-Bench Hard: 40.9% Latency (TTFT): 37.60 s Run cost: $0.048Gemini 3 Flash Preview (Reasoning) Provider: Google OpenClaw Score: 67.6 Intelligence Index: 46.4 Agent Fitness: 46.0 Terminal-Bench Hard: 38.6% Latency (TTFT): 6.09 s Run cost: $0.018Gemini 3 Pro Preview (high) Provider: Google OpenClaw Score: 66.4 Intelligence Index: 48.4 Agent Fitness: 48.2 Terminal-Bench Hard: 41.7% Latency (TTFT): 26.98 s Run cost: $0.072Claude Sonnet 4.6 (Non-reasoning, High Effort) Provider: Anthropic OpenClaw Score: 66.3 Intelligence Index: 44.4 Agent Fitness: 44.6 Terminal-Bench Hard: 46.2% Latency (TTFT): 1.08 s Run cost: $0.105DeepSeek V4 Flash (Reasoning, Max Effort) Provider: DeepSeek OpenClaw Score: 66.1 Intelligence Index: 46.5 Agent Fitness: 45.7 Terminal-Bench Hard: 35.6% Latency (TTFT): 869 ms Run cost: $0.0028GPT-5.4 (low) Provider: OpenAI OpenClaw Score: 66.0 Intelligence Index: 47.9 Agent Fitness: 47.7 Terminal-Bench Hard: 43.2% Latency (TTFT): 1.71 s Run cost: $0.090Claude Opus 4.7 (Non-reasoning, High Effort) Provider: Anthropic OpenClaw Score: 65.9 Intelligence Index: 51.8 Agent Fitness: 51.9 Terminal-Bench Hard: 54.5% Latency (TTFT): 1.44 s Run cost: $0.175GPT-5.5 (low) Provider: OpenAI OpenClaw Score: 65.5 Intelligence Index: 50.8 Agent Fitness: 50.9 Terminal-Bench Hard: 52.3% Latency (TTFT): 1.90 s Run cost: $0.180GLM-5 (Non-reasoning) Provider: Z AI OpenClaw Score: 65.4 Intelligence Index: 40.6 Agent Fitness: 40.4 Terminal-Bench Hard: 39.4% Latency (TTFT): 1.21 s Run cost: $0.025Grok 4.20 0309 v2 (Reasoning) Provider: xAI OpenClaw Score: 65.1 Intelligence Index: 49.3 Agent Fitness: 48.4 Terminal-Bench Hard: 37.9% Latency (TTFT): 32.87 s Run cost: $0.048Kimi K2.6 (Non-reasoning) Provider: Kimi OpenClaw Score: 64.6 Intelligence Index: 42.9 Agent Fitness: 42.4 Terminal-Bench Hard: 37.9% Latency (TTFT): 1.23 s Run cost: $0.027MiMo-V2-Omni-0327 Provider: Xiaomi OpenClaw Score: 64.5 Intelligence Index: 44.9 Agent Fitness: 44.1 Terminal-Bench Hard: 35.6% Latency (TTFT): 1.44 s Run cost: $0.013Kimi K2.5 (Reasoning) Provider: Kimi OpenClaw Score: 64.0 Intelligence Index: 46.8 Agent Fitness: 46.1 Terminal-Bench Hard: 34.8% Latency (TTFT): 1.36 s Run cost: $0.019Claude Opus 4.6 (Adaptive Reasoning, Max Effort) Provider: Anthropic OpenClaw Score: 63.8 Intelligence Index: 52.9 Agent Fitness: 52.4 Terminal-Bench Hard: 46.2% Latency (TTFT): 14.59 s Run cost: $0.175Claude Opus 4.6 (Non-reasoning, High Effort) Provider: Anthropic OpenClaw Score: 63.6 Intelligence Index: 46.5 Agent Fitness: 46.6 Terminal-Bench Hard: 48.5% Latency (TTFT): 1.58 s Run cost: $0.175Qwen3.6 35B A3B (Reasoning) Provider: Alibaba OpenClaw Score: 63.6 Intelligence Index: 43.5 Agent Fitness: 42.7 Terminal-Bench Hard: 34.8% Latency (TTFT): 1.51 s Run cost: $0.0089Claude Opus 4.5 (Reasoning) Provider: Anthropic OpenClaw Score: 63.4 Intelligence Index: 49.7 Agent Fitness: 49.5 Terminal-Bench Hard: 47.0% Latency (TTFT): 13.04 s Run cost: $0.175Qwen3.6 27B (Reasoning) Provider: Alibaba OpenClaw Score: 63.3 Intelligence Index: 45.8 Agent Fitness: 44.9 Terminal-Bench Hard: 34.8% Latency (TTFT): 1.45 s Run cost: $0.022MiniMax-M2.5 Provider: MiniMax OpenClaw Score: 63.1 Intelligence Index: 41.9 Agent Fitness: 41.4 Terminal-Bench Hard: 34.8% Latency (TTFT): 1.15 s Run cost: $0.0084Hy3-preview (Reasoning) Provider: Tencent OpenClaw Score: 62.9 Intelligence Index: 41.9 Agent Fitness: 41.4 Terminal-Bench Hard: 34.1% Latency (TTFT): 2.54 s Run cost: $0.0032GPT-5 Codex (high) Provider: OpenAI OpenClaw Score: 62.7 Intelligence Index: 44.6 Agent Fitness: 44.0 Terminal-Bench Hard: 37.9% Latency (TTFT): 12.59 s Run cost: $0.055GLM-5.1 (Non-reasoning) Provider: Z AI OpenClaw Score: 62.1 Intelligence Index: 43.8 Agent Fitness: 43.0 Terminal-Bench Hard: 35.6% Latency (TTFT): 859 ms Run cost: $0.034Claude Sonnet 4.6 (Non-reasoning, Low Effort) Provider: Anthropic OpenClaw Score: 62.0 Intelligence Index: 42.6 Agent Fitness: 42.6 Terminal-Bench Hard: 42.4% Latency (TTFT): 1.16 s Run cost: $0.105Qwen3.5 397B A17B (Non-reasoning) Provider: Alibaba OpenClaw Score: 62.0 Intelligence Index: 40.1 Agent Fitness: 39.8 Terminal-Bench Hard: 35.6% Latency (TTFT): 1.82 s Run cost: $0.022GPT-5.2 Codex (xhigh) Provider: OpenAI OpenClaw Score: 61.9 Intelligence Index: 49.0 Agent Fitness: 48.4 Terminal-Bench Hard: 37.1% Latency (TTFT): 2.82 s Run cost: $0.077GPT-5.5 (Non-reasoning) Provider: OpenAI OpenClaw Score: 61.8 Intelligence Index: 40.9 Agent Fitness: 41.7 Terminal-Bench Hard: 49.2% Latency (TTFT): 1.01 s Run cost: $0.180GPT-5 (medium) Provider: OpenAI OpenClaw Score: 61.8 Intelligence Index: 42.0 Agent Fitness: 41.7 Terminal-Bench Hard: 37.9% Latency (TTFT): 55.86 s Run cost: $0.055DeepSeek V4 Pro (Non-reasoning) Provider: DeepSeek OpenClaw Score: 61.2 Intelligence Index: 39.3 Agent Fitness: 39.2 Terminal-Bench Hard: 36.4% Latency (TTFT): 1.15 s Run cost: $0.035GPT-5 mini (high) Provider: OpenAI OpenClaw Score: 61.2 Intelligence Index: 41.2 Agent Fitness: 40.6 Terminal-Bench Hard: 33.3% Latency (TTFT): 109.86 s Run cost: $0.011DeepSeek V4 Flash (Non-reasoning) Provider: DeepSeek OpenClaw Score: 60.9 Intelligence Index: 36.5 Agent Fitness: 36.4 Terminal-Bench Hard: 34.1% Latency (TTFT): 801 ms Run cost: $0.0028Qwen3.5 27B (Reasoning) Provider: Alibaba OpenClaw Score: 60.6 Intelligence Index: 42.1 Agent Fitness: 41.4 Terminal-Bench Hard: 32.6% Latency (TTFT): 1.46 s Run cost: $0.013GPT-5.4 nano (medium) Provider: OpenAI OpenClaw Score: 60.4 Intelligence Index: 38.1 Agent Fitness: 37.8 Terminal-Bench Hard: 33.3% Latency (TTFT): 4.51 s Run cost: $0.0074

Data provided by Artificial Analysis