Thoughts on AI, technology, and the future we're building.

New posts every week

HomeAll PostsAI NewsAI Basics
Timelines
ChatGPTOpenAI release historyAnthropic ClaudeClaude release historyGoogle GeminiGemini release history
Benchmarks
OverviewFull model trackerValue RankingsPerformance for the moneyCoding RankingsSWE-bench and code signalsAgent RankingsTool and workflow signalsReasoning RankingsKnowledge and reasoningLong ContextDocument and retrieval signalsLab ComparisonsProvider-level rankings
CategoriesAboutContact

Subscribe to Newsletter

Practical AI news, tips, tricks, tool analysis, sent straight to your inbox.

No spam. Unsubscribe anytime.

Practical explainers, tool notes, and systems thinking for people turning new AI capability into useful work.

Explore

  • All Posts
  • Categories
  • About
  • Contact

Categories

  • AI News
  • AI Basics
  • ChatGPT
  • Anthropic
  • AI Tools
  • AI Video
  • AI Images
  • Courses

Connect

LinkedInTwitterRSS

© 2026. All rights reserved.

Benchmark suiteUpdated May 27, 2026

IBM benchmarks.

A lab-level profile for IBM, covering tracked models, category leaders, open/proprietary mix, speed, value, source coverage, and portfolio depth.

All labs

Lab rank

#14

By available Arena average

Models tracked

1

1 open-weight models

Portfolio coverage

44%

Average matched benchmark signals

Fastest model

0.39s

Granite 4.1 8b

OverviewFull public trackerValuePerformance for the moneyCodingCode and SWE-bench signalsAgentsTool and workflow readinessReasoningKnowledge and reasoning signalsContextDocument and retrieval signalsLabsProvider comparisons

Lab profile

Portfolio shape, leaders, and model coverage.

The lab page now starts with the portfolio read, then drills into category leaders, value rankings, and model-level coverage rather than repeating source panels.

Portfolio read

IBM has 1 tracked benchmark profiles.

Lab #14

The portfolio combines 0 proprietary models and 1 open-weight models. Its average available Arena rank is 76, with a best category rank of #76. Average profile coverage is 44%, so model-level gaps remain visible in the cards below.

Best Arena model

Granite 4.1 8b

Best category rank #76

Highest coverage

Granite 4.1 8b

44% matched signals

Fastest response

Granite 4.1 8b

0.39s first token

Access mix

Open versus proprietary

Proprietary

0 / 0%

Open weights

1 / 100%

Value leader

Granite 4.1 8b

Category leaders

Best model by benchmark lens.

Each card shows the highest-ranked model from this lab in a composite benchmark suite, with the global rank and index left intact.

Value

Quality, price, latency, and speed

Granite 4.1 8b

#6951

value index

Coding

Code Arena, SWE-bench, and output speed

Granite 4.1 8b

#7821

coding index

Agents

Search, document, code, and latency

Granite 4.1 8b

#10656

agent proxy

Reasoning

Text, document, vision, and mode signals

Not listed

Not enough matched rows for this lab in this suite.

Context

Document, search, and text proxy

Not listed

Not enough matched rows for this lab in this suite.

IBM value ranking table

Models from this lab with both quality and commercial price matches.

Top 1 of 1

RankModelIndexPriceLatencySpeedQualitySources
#69
Granite 4.1 8b

granite-4.1-8b

IBMOpen weights
51value index
$0.063blended / 1M tokens
0.39stime to first token
131 tok/soutput speed
23quality signal
ArenaArtificial Analysis
#69
Granite 4.1 8b

granite-4.1-8b

IBMOpen weights
51value index
Price
$0.063blended / 1M tokens
Latency
0.39stime to first token
Speed
131 tok/soutput speed
Quality
23quality signal
ArenaArtificial Analysis

Model portfolio

IBM model coverage map.

Every tracked model keeps its best Arena rank, coverage score, commercial metrics, and suite placement visible in one scannable grid.

Granite 4.1 8b

granite-4.1-8b

Open weights

Coverage

44%

Best Arena

#76

Code

Value

#69

value index

Coding

#78

coding index

Latency

0.39s

131 tok/s

$0.063