Skip to content

Alternatives to Humanity's Last Exam (HLE)

Humanity's Last Exam (HLE) and 2 alternative tools evaluated on the Tekai technology radar.

Humanity's Last Exam (HLE)

Subject

A 2,500-question expert-level benchmark curated by ~1,000 specialists to measure AI capabilities where frontier models still score 40-50%.

open-source CC-BY-4.0
assess
View full details →

Alternatives

Comparison Summary

Tool Radar Type License
Humanity's Last Exam (HLE) assess open-source CC-BY-4.0
MMLU (Massive Multitask Language Understanding) hold open-source MIT
HCAST (Human-Calibrated Autonomy Software Tasks) assess open-source MIT