Thoughts on AI, technology, and the future we're building.

New posts every week

HomeAll PostsAI NewsAI Basics
Timelines
ChatGPTOpenAI release historyAnthropic ClaudeClaude release historyGoogle GeminiGemini release history
Benchmarks
OverviewFull model trackerValue RankingsPerformance for the moneyCoding RankingsSWE-bench and code signalsAgent RankingsTool and workflow signalsReasoning RankingsKnowledge and reasoningLong ContextDocument and retrieval signalsLab ComparisonsProvider-level rankings
CategoriesAboutContact

Subscribe to Newsletter

Practical AI news, tips, tricks, tool analysis, sent straight to your inbox.

No spam. Unsubscribe anytime.

Practical explainers, tool notes, and systems thinking for people turning new AI capability into useful work.

Explore

  • All Posts
  • Categories
  • About
  • Contact

Categories

  • AI News
  • AI Basics
  • ChatGPT
  • Anthropic
  • AI Tools
  • AI Video
  • AI Images
  • Courses

Connect

LinkedInTwitterRSS

© 2026. All rights reserved.

Benchmark suiteUpdated May 27, 2026

xAI benchmarks.

A lab-level profile for xAI, covering tracked models, category leaders, open/proprietary mix, speed, value, source coverage, and portfolio depth.

All labs

Lab rank

#11

By available Arena average

Models tracked

15

0 open-weight models

Portfolio coverage

23%

Average matched benchmark signals

Fastest model

0.58s

Grok 4.3

OverviewFull public trackerValuePerformance for the moneyCodingCode and SWE-bench signalsAgentsTool and workflow readinessReasoningKnowledge and reasoning signalsContextDocument and retrieval signalsLabsProvider comparisons

Lab profile

Portfolio shape, leaders, and model coverage.

The lab page now starts with the portfolio read, then drills into category leaders, value rankings, and model-level coverage rather than repeating source panels.

Portfolio read

xAI has 15 tracked benchmark profiles.

Lab #11

The portfolio combines 15 proprietary models and 0 open-weight models. Its average available Arena rank is 40.6, with a best category rank of #5. Average profile coverage is 23%, so model-level gaps remain visible in the cards below.

Best Arena model

Grok 4.20 Multi Agent Beta 0309

Best category rank #5

Highest coverage

Grok 4.3

78% matched signals

Fastest response

Grok 4.3

0.58s first token

Access mix

Open versus proprietary

Proprietary

15 / 100%

Open weights

0 / 0%

Value leader

Grok 4.3

Category leaders

Best model by benchmark lens.

Each card shows the highest-ranked model from this lab in a composite benchmark suite, with the global rank and index left intact.

Value

Quality, price, latency, and speed

Grok 4.3

#1771

value index

Coding

Code Arena, SWE-bench, and output speed

Grok 4.20 Beta 0309 Reasoning

#3364

coding index

Agents

Search, document, code, and latency

Grok 4.20 Multi Agent Beta 0309

#3493

agent proxy

Reasoning

Text, document, vision, and mode signals

Grok 4.20 Beta 0309 Reasoning

#1672

reasoning proxy

Context

Document, search, and text proxy

Grok 4.1 Fast Search

#1383

context proxy

xAI value ranking table

Models from this lab with both quality and commercial price matches.

Top 3 of 3

RankModelIndexPriceLatencySpeedQualitySources
#17
Grok 4.3

grok-4.3

xAIProprietary
71value index
$1.56blended / 1M tokens
0.58stime to first token
111 tok/soutput speed
64quality signal
ArenaArtificial Analysis
#78
Grok 4 Fast Reasoning

grok-4-fast-reasoning

xAIProprietary
43value index
$0.275blended / 1M tokens
Not listedtime to first token
Not listedoutput speed
12quality signal
ArenaArtificial Analysis
#79
Grok 4 Fast Chat

grok-4-fast-chat

xAIProprietary
42value index
$0.275blended / 1M tokens
Not listedtime to first token
Not listedoutput speed
11quality signal
ArenaArtificial Analysis
#17
Grok 4.3

grok-4.3

xAIProprietary
71value index
Price
$1.56blended / 1M tokens
Latency
0.58stime to first token
Speed
111 tok/soutput speed
Quality
64quality signal
ArenaArtificial Analysis
#78
Grok 4 Fast Reasoning

grok-4-fast-reasoning

xAIProprietary
43value index
Price
$0.275blended / 1M tokens
Latency
Not listedtime to first token
Speed
Not listedoutput speed
Quality
12quality signal
ArenaArtificial Analysis
#79
Grok 4 Fast Chat

grok-4-fast-chat

xAIProprietary
42value index
Price
$0.275blended / 1M tokens
Latency
Not listedtime to first token
Speed
Not listedoutput speed
Quality
11quality signal
ArenaArtificial Analysis

Model portfolio

xAI model coverage map.

Every tracked model keeps its best Arena rank, coverage score, commercial metrics, and suite placement visible in one scannable grid.

Grok 4.20 Multi Agent Beta 0309

grok-4.20-multi-agent-beta-0309

Proprietary

Coverage

33%

Best Arena

#5

Search

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4.3

grok-4.3

Proprietary

Coverage

78%

Best Arena

#6

Search

Value

#17

value index

Coding

#51

coding index

Latency

0.58s

111 tok/s

$1.56

Grok 4.20 Beta 1

grok-4.20-beta1

Proprietary

Coverage

22%

Best Arena

#8

Search

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4.1 Fast Search

grok-4-1-fast-search

Proprietary

Coverage

11%

Best Arena

#11

Search

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4.20 Beta 0309 Reasoning

grok-4.20-beta-0309-reasoning

Proprietary

Coverage

33%

Best Arena

#14

Text

Value

Not listed

No price match

Coding

#33

coding index

Latency

Not listed

Not listed

Not listed

Grok 4 Fast Search

grok-4-fast-search

Proprietary

Coverage

11%

Best Arena

#21

Search

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4.1 Thinking

grok-4.1-thinking

Proprietary

Coverage

22%

Best Arena

#25

Text

Value

Not listed

No price match

Coding

#76

coding index

Latency

Not listed

Not listed

Not listed

Grok 4 Search

grok-4-search

Proprietary

Coverage

11%

Best Arena

#27

Search

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4.1

grok-4.1

Proprietary

Coverage

11%

Best Arena

#30

Text

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4.1 Fast Reasoning

grok-4-1-fast-reasoning

Proprietary

Coverage

33%

Best Arena

#57

Vision

Value

Not listed

No price match

Coding

#74

coding index

Latency

Not listed

Not listed

Not listed

Grok 4 0709

grok-4-0709

Proprietary

Coverage

11%

Best Arena

#64

Vision

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed

Grok 4 Fast Reasoning

grok-4-fast-reasoning

Proprietary

Coverage

22%

Best Arena

#79

Code

Value

#78

value index

Coding

#84

coding index

Latency

Not listed

Not listed

$0.275

Grok Code Fast 1

grok-code-fast-1

Proprietary

Coverage

11%

Best Arena

#80

Code

Value

Not listed

No price match

Coding

#86

coding index

Latency

Not listed

Not listed

Not listed

Grok 4 Fast Chat

grok-4-fast-chat

Proprietary

Coverage

22%

Best Arena

#81

Text

Value

#79

value index

Coding

Not listed

No coding index

Latency

Not listed

Not listed

$0.275

Grok 3 Preview 02 24

grok-3-preview-02-24

Proprietary

Coverage

11%

Best Arena

#98

Text

Value

Not listed

No price match

Coding

Not listed

No coding index

Latency

Not listed

Not listed

Not listed