Epoch AI and AI safety organization METR published the full results of MirrorCode on June 26, 2026 — a benchmark that answers a question the field has been unable to measure cleanly: how much ...
The accessibility tree decides whether an AI agent can read and act on your page. The 2026 data says the web is getting ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results