Ciro Santilli
OurBigBook.com
$£
Sponsor
中国
独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件 邓家贵、低端人口、西藏骚乱
Humanity's Last Exam
(2025)
...
Generative AI by modality
AI text generation
Text-to-text model
Large language model
LLM benchmark
List of LLM benchmarks
OurBigBook.com
Tags:
AI Math benchmark
Words: 24
Contains highly specialized questions in various academic fields, including
mathematics
. The problems are answered either with a number, or multiple choice, or free text.
arxiv.org/abs/2501.1424
huggingface.co/datasets/cais/hle
agi.safe.ai/
Ancestors
(15)
List of LLM benchmarks
LLM benchmark
Large language model
Text-to-text model
AI text generation
Generative AI by modality
Generative AI
AI by capability
Artificial intelligence
Machine learning
Computer
Information technology
Area of technology
Technology
Home