Xodexa AGI Benchmark — Frontier Observatory

Two coordinates per model. Capability (x) is the Xodexa Score, 0 → an AGI horizon that nothing reaches. Honesty (y) is 100 − calibration error, so a confidently-wrong model sinks regardless of how clever it is. The dashed arc is the Pareto frontier — the current edge of the possible. The observatory reads live JSON from ./data/.

Xodexa AGI Benchmark · Frontier Observatory

Where models break

The field, as a ledger