Which organization's model will be ranked first as of 18 December 2026 on Humanity's Last Exam (HLE), according to the Center for AI Safety (CAIS)?
Closing Dec 18, 2026 08:01AM UTC
HLE is "a multi-modal benchmark at the frontier of human knowledge, designed to be the final closed-ended academic benchmark of its kind with broad subject coverage" with 2,500 questions spanning over 100 subjects (CAIS - HLE). The question will be suspended on 17 December 2026 and the outcome determined using the ranks as reported by CAIS at approximately 5:00PM ET on 18 December 2026 (CAIS AI Dashboard, see "Text Capabilities Index" and click the header for the "Humanity's Last Exam" column). As of 2 February 2026, Google was ranked first with its "Gemini 3 Pro" model with a score of 38.3%. In the event of a tie for first place by models of different organizations, the model to have made the high score first will be considered the winner (CAIS - HLE, see "AI Progress on Humanity's Last Exam" chart). If a listed organization ceases to exist due to a merger or acquisition, the successor in interest organization will count as that named organization.
Confused? Check our FAQ or ask us for help. To learn more about Good Judgment and Superforecasting, click here.
To learn more about how you can become a Superforecaster, see here. For other posts from our Insights blog, click here.