xAI Grok will beat OpenAI's flagship model on HumanEval benchmarks by the end of 2024. | Manifold

xAI Grok will beat OpenAI's flagship model on HumanEval benchmarks by the end of 2024.

Plus

101

Ṁ14k

resolved Jun 9

Resolved

NO

1D

1W

1M

ALL

This is inclusive of any new models OpenAI unveils in 2024, but the question resolves to "yes" if Grok beats OpenAI at any time in 2024 against their current state of the art model.

This question is managed and resolved by Manifold.

#Technical AI Timelines

Get

1,000

and

3.00

Sort by:

@mods resolves NO

@mods Can this be resolved NO?

DanboughtṀ150YES

https://x.ai/blog/grok-1.5

They claim to have done it:

bought Ṁ10 YES from 69% to 71%

@DanMan314 67.0% is the HumanEval figure from the original GPT-4 report published more than a year ago. The current zero-shot GPT-4 performance, as reported by Papers With Code, is 76.5%, which is from Guo et al. (January 2024).

Note that the market creator is banned, so this will probably be resolved by moderators. Personally, I think the current version of GPT-4 is the more natural interpretation of "OpenAI's flagship model" than the original version of GPT-4.

DansoldṀ212YES

@Jacy Yea I just looked into it and I agree with your assessment.

Related questions

Open-source OpenAI model beats Grok 4 on LMArena?

Will any AI model score above 95% on GRAB by the end of 2025?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

Will OpenAI claim that it has achieved AGI in 2025?

Will OpenAI still be considered one of the top players in AI by end of 2025

When will xAI release Grok 4 (or Grok 3.5)

Will an AI system beat humans in the GAIA benchmark before the end of 2025?

When will xAI release Grok 4?

Will OpenAI be in the lead in the AGI race end of 2026?

Will an AI by OpenAI beat a super grandmaster playing chess by 2028?

Related questions

Open-source OpenAI model beats Grok 4 on LMArena?

When will xAI release Grok 4 (or Grok 3.5)

Will any AI model score above 95% on GRAB by the end of 2025?

Will an AI system beat humans in the GAIA benchmark before the end of 2025?

Will there be an AI language model that strongly surpasses ChatGPT and other OpenAI models before the end of 2025?

When will xAI release Grok 4?

Will OpenAI claim that it has achieved AGI in 2025?

Will OpenAI be in the lead in the AGI race end of 2026?

Will OpenAI still be considered one of the top players in AI by end of 2025

Will an AI by OpenAI beat a super grandmaster playing chess by 2028?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules