Roche asks:

Which organization's large language model (LLM) will be ranked first as of 29 May 2026, according to Arena's "Text Arena" leaderboard?

Started Feb 11, 2026 02:30PM UTC
Closing May 29, 2026 07:01AM UTC

Arena is an open-source platform for crowdsourced AI benchmarking, created by researchers from UC Berkeley SkyLab (SkyLab). For more information on how the leaderboard is constructed, see Arena - Blog. The question will be suspended on 28 May 2026 and the outcome determined using the ranks as reported by Arena at approximately 5:00p.m. ET on 29 May 2026 (Arena - Text Arena Leaderboard, see "Rank"). As of 5 February 2026, Google was ranked first, with its "gemini-3-pro" scoring 1487, followed by xAI's "grok-4.1-thinking" scoring 1475. In the event of a tie for first place by LLMs of different organizations, the LLM with the higher "Score" will be considered first, followed by the "Votes" total. If the named source changes the way it presents the data, further instructions will be provided. Models marked "Preliminary" next to their score will not count.

Confused? Check our FAQ or ask us for help. To learn more about Good Judgment and Superforecasting, click here.

To learn more about how you can become a Superforecaster, see hereFor other posts from our Insights blog, click here.

Possible Answer Crowd Forecast Change in last 24 hours
Anthropic 34.00% +14.00%
Google 29.50% +9.50%
OpenAI 7.25% -12.75%
xAI 22.00% +2.00%
Another organization 7.25% -12.75%

Sign up or sign in to forecast!

Sign Up Sign In
Files
Tip: Mention someone by typing @username