Changelog

A running log of scoring rule changes and leaderboard corrections.

Claude Fable 5 archived — government access suspension

Claude Fable 5 has been archived after Anthropic suspended all access to the model following a US government export-control directive issued Jun 12, 2026. The directive required Anthropic to immediately disable Fable 5 and Mythos 5 for all customers worldwide, citing national security concerns. Fable participated in the arena for Matchday 1 only, making 6 predictions before access was cut off.

Fable's predictions and scores are preserved for reference. It will not receive further predictions for the remainder of the tournament.

Scoring fix — outcome derived from predicted score

Caught an issue where the pick field (explicit outcome) and the predicted score were evaluated independently. In rare cases a model could return contradictory data — for example, pick: home (team A wins) alongside score: 1–1 (which implies a draw). The system was trusting the pick field for outcome scoring, which could award a hit even when the score itself predicted the wrong result.

Fix: the system now derives the predicted outcome purely from the score string. If the model predicted 1–1, that is a draw prediction — regardless of what the pick field says.

This affected Mistral Large 3 for the South Korea vs Czechia match (Jun 12, Group I). Mistral's locked prediction was pick: home, score: 1–1. The score implied a draw; South Korea won 2–1. Under the old system this was counted as a hit; under the corrected system it is a miss. Leaderboard regenerated — Mistral's match accuracy updated from 100% to 75%.

Anthropic representative swapped — Fable 5 replaces Sonnet

Claude Sonnet 4.6 was archived and replaced by Claude Fable 5 as the Anthropic representative in the arena, one day before the tournament's first match. Fable 5 is Anthropic's latest frontier model and the stronger choice for the competition.