ID photo of Ciro Santilli taken in 2013 right eyeCiro Santilli OurBigBook logoOurBigBook.com  Sponsor 中国独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件 邓家贵、低端人口、西藏骚乱
matharena.ai/
This project tests various models against various competitions.
How they "ensure" that models are not contaminated:
By evaluating models as soon as new problems are released, we effectively eliminate the risk of contamination
Most of their problems come from high school knowledge olympiads and they are therefore completely irrelevant for 2025 LLMs.

Ancestors (10)

  1. Math AI benchmark
  2. Automated theorem proving
  3. AI by capability
  4. Artificial intelligence
  5. Machine learning
  6. Computer
  7. Information technology
  8. Area of technology
  9. Technology
  10. Home