Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?
Basic
4
Ṁ38Feb 2
66%
chance
1D
1W
1M
ALL
https://simple-bench.com/ Claude 3.5 Sonnet 10/22 achieves 41.4% whereas the best Gemini model scores 27.1%
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
What will be the best score on Cybench by December 31st 2025?
Will Gemini achieve a higher score on the SAT compared to GPT-4?
70% chance
Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?
72% chance
Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.
55% chance
Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?
18% chance
How long until one of Gemini, Claude, etc... match the capabilities of O1?
Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?
59% chance
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
Will Gemini be released before 2024? x Will GPT-5 be released before 2025?
What will be true of Gemini 2?