Skip to content

Alternatives to MMLU (Massive Multitask Language Understanding)

MMLU (Massive Multitask Language Understanding) and 2 alternative tools evaluated on the Tekai technology radar.

MMLU (Massive Multitask Language Understanding)

Subject

A benchmark of 15,908 multiple-choice questions across 57 academic subjects for evaluating LLM knowledge, now effectively saturated by frontier models.

open-source MIT
hold
View full details →

Alternatives

Comparison Summary

Tool Radar Type License
MMLU (Massive Multitask Language Understanding) hold open-source MIT
Humanity's Last Exam (HLE) assess open-source CC-BY-4.0
HCAST (Human-Calibrated Autonomy Software Tasks) assess open-source MIT