AI Safety Research Futarchy: Exploring more metacognitive capabilities of LLMs

1kṀ1018

Oct 13

53%

A LessWrong post is produced within 6 months and gains 50 upvotes or more within a month of posting.

23%

If a LessWrong post is produced, it gains 150 upvotes or more within a month of posting.

75%

A paper is produced and uploaded to arXiv within 9 months.

33%

If a paper is produced, it is accepted to a top ML conference (ICLR, ICML, or NeurIPS) within 6 months of being uploaded to arXiv.

28%

If a paper is produced, it receives 10 citations or more within one year of being uploaded to arXiv.

If chosen, how successful will the research project "Exploring more metacognitive capabilities of LLMs" be?

Project summary: Investigating whether LLMs can metacognitively monitor their own internal probability distributions and predictive models, with implications for deceptive alignment and AI safety.

Detailed project overview.

Clarifications:

Unless otherwise stated, timeframes are given from when the research begins, i.e. the start of the MARS program, 1st December 2025
Updates to posts and papers will be considered the same entity as the original for purposes of outcome resolution (i.e. If a paper is produced and uploaded to arXiv within 9 months, but it is edited after this before being accepted at a conference, (4) still resolves YES)
Some outcomes are conditional on others as follows: outcome (2) will resolve N/A if (1) resolves NO, outcomes (4)-(6) will resolve N/A if (3) resolves NO
All outcomes are conditioned on the project being selected and will resolve N/A if it is not (see main post below)
Provisionally, market will close and decisions will be made on Monday the 12th of October

For more details on AI Safety Research Futarchy, see here.

Update 2025-10-16 (PST) (AI summary of creator comment): The project has been selected and will proceed. The market will remain locked (no trading allowed) and the relevant metrics will resolve when the resolution criteria are met.

Technical AI Timelines

Get

1,000

to start trading!

1 Comment

18 Holders

54 Trades

Sort by: