Will pre-2026 AI out-forecast the Metaculus community?

1kṀ6169

2026

44%

chance

ALL

Will an AI system out-perform the Metaculus community prediction before 2026? Any amount of scaffolding is allowed.

If this does not happen, and no negative result comes out in the last quarter of 2025, then this question resolves to my subjective credence that this could be done with an existing AI system and scaffolding. Specifically, my credence on the proposition 'Using 4 months of individual-engineering time, a pre-2026 AI could be fine-tuned and scaffolded to out-perform, on mean brier score, over all binary questions on Metaculus

I will not participate in this market.

Technology

Technical AI Timelines

Meta-Forecasting

Get

1,000

to start trading!

People are also trading

Best AI time horizon by August 2026, per METR?

Will AI progress surprise Metaculus?

77% chance

By which years will AI be shown to have a better log loss than the Metaculus community pred. on <= 1 year predictions?

Will Metaculus still exist and have active forecasting throughout 2025?

97% chance

Will we have better-than-human-aggregate forecasting AIs by the end of 2024?

4% chance

Will Metaculus still exist and have active forecasting throughout 2026?

93% chance

Will an AI model outperform 95% of Manifold users on accuracy before 2026?

43% chance

Will Metaculus still exist and have active forecasting throughout 2030?

82% chance

Will Metaculus still exist and have active forecasting throughout 2035?

42% chance

How well will I (@draaglom) forecast on Metaculus in 2025? (Peer accuracy leaderboard)

11 Comments

44 Holders

101 Trades

Sort by:

sold Ṁ21 NO

https://x.com/Simeon_Cps/status/1878022286613074271

I propose adding "Specifically, my credence on the proposition 'Using 4 months of individual-engineering time, a pre-2026 AI could be fine-tuned and scaffolded to out-perform, on mean brier score, over all binary questions on Metaculus".

If no one objects within a week, I'll add this.

bought Ṁ100 YES

"If this does not happen, and no negative result comes out in the last quarter of 2025, then this question resolves to my subjective credence that this could be done with an existing AI system and scaffolding."

Does this include finetuning?

@NoaNabeshima Yes my subjective credence includes limited fine-tuning things like the berkeley group's level of fine-tuning are fine.

bought Ṁ150 NO

I think it can be done in principle it's just not clear it will be done in practice

Unless anyone objects, I'll clarify the constraint that this out-performance should hold on average for at least 50% of the questions on Metaculus in a prospective study. Obviously if this ends up depending on my credence, I'll be taking into account other results e.g. the below.

@JacobPfau wdym by "hold on average for at least 50% of the questions"? if they outperform it'll be an average of theirs vs an average of metaculus, I would think?

@Bayesian You're right that the question phrasing implied all questions, though I didn't specify binary vs time series etc. I'll come back to this tomorrow.