🐕 Will Any AI Effectively Achieve Higher than Human Level Reasoning Through Common Sense Questions, By 2023 End?

390Ṁ1888

resolved Jan 4

Resolved

ALL

Preface:

Please read the preface for this type of market and other similar third-party validated AI markets here.

Third-Party Validated, Predictive Markets: AI Theme

Market Description

ARC

The AI2 Reasoning Challenge (ARC) aims to promote research in advanced question-answering, in particular questions that require reasoning, use of commonsense knowledge, and other methods for deeper text comprehension. In particular, the ARC Challenge questions are those that are hard to answer with simple baselines.

Example ARC Question

Which property of a mineral can be determined just by looking at it?

(A) luster
(B) mass
(C) weight
(D) hardness

https://leaderboard.allenai.org/arc/submissions/public

https://paperswithcode.com/sota/common-sense-reasoning-on-arc-challenge

Papers with Code - ARC (Challenge) Benchmark (Common Sense Reasoning)

The current state-of-the-art on ARC (Challenge) is GPT-4 (few-shot, k=25). See a full comparison of 34 papers with code.

Market Resolution

As of the time of market creation in July 2023, the top submission is GPT-4 with 96.3:

Resolution Criteria

We will define Superintelligence for the purposes of this question as, "achieving 99% accuracy on the test in question."
Will any entry from the above two links result in a 99% Accuracy Rating? If so, resolves YES, otherwise NO.

20230727 - Changed title, "Superintelligence" to "Higher than Human Level"

Technical AI Timelines

New Year's Resolutions 2024

Third Party Validated, Predictive Markets

Third Party Validated, Predictive Markets: AI

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ132
2		Ṁ61
3		Ṁ24
4		Ṁ13
5		Ṁ8

People are also trading

Will AI be smarter than any one human probably around the end of 2025?

16% chance

Will AI top level capabilities generally be judged by question and answer benchmarks in 2029?

25% chance

Will AI surpass human intellect by 2030?

91% chance

Will AI enable a successful conversation between a human and a member of a non-human species by the end of 2030?

81% chance

Will an ai be trained/fine-tuned on Quora by 2035?

Sort by:

We're still at 96.4 so resolving NO.

predictedNO

Can these 2023 markets resolve?

@Hedgehog Working on it!

predictedNO

@PatrickDelaney Thanks!

Argh, your 🐕 emoji in the beginning of the question tricked me into clicking on it, here's a bet

Defining 99% as "superintelligence" seems like nonsense for a test where humans write the answer sheet.

@EliezerYudkowsky Can you suggest a better term? Happy to change it. I'm trying to express the idea that in this particular domain or set of tests, the measurement of human performance has already been surpassed by AI.

@PatrickDelaney I'd write the title as follows:
"Will any AI get a significantly better score than humans on common sense reasoning questions by the end of 2023?"

People are also trading

Will AI be smarter than any one human probably around the end of 2025?

16% chance

Will AI top level capabilities generally be judged by question and answer benchmarks in 2029?

25% chance

Will AI surpass human intellect by 2030?

91% chance

Will AI enable a successful conversation between a human and a member of a non-human species by the end of 2030?

81% chance

Will an ai be trained/fine-tuned on Quora by 2035?

92% chance

ARC

Example ARC Question

🏅 Top traders

People are also trading

People are also trading

Related questions