Will LLMs be able to formally verify non-trivial programs by the end of 2025? | Manifold

Will LLMs be able to formally verify non-trivial programs by the end of 2025?

12

1kṀ791

Dec 31

27%

chance

1H

6H

1D

1W

1M

ALL

This resolves yes if there is reasonable evidence (or my own experiments) that shows an LLM can reliably write specs/proofs for a deductive verifier for a reasonably complex program with no extra help (access to the verifier to try things and some scaffolding to make that possible are OK). This should be reasonably stable for different verifiers and kinds of programs and such.

My experiments with o1 and o3-mini currently fail for pretty basic examples. This may be because my prompting is not good enough (verification requires a fair amount of explanation). If it turns out that current models are already capable of this, then this also resolves yes. To me, the gap between the level of reasoning they are doing and what is needed feels substantial enough that I would be surprised if the current models succeed out of the box even with better prompting.

This is a somewhat subjective question. I will (and invite you to) collect evidence in the comments. I am also open to a more objective resolution criterion if people have ideas.

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

Will one of the major LLMs be capable of continual lifelong learning (learning from inference runs) by EOY 2025?

-8% 1d6% chance

Will there by a major breakthrough in LLM continual learning before 2027?

Will there be any major breakthrough in LLM continual learning before 2028?

-3% 1d67% chance

Will researchers extract a novel program from the weights of an LLM into a Procedural/OO programming language by 2026?

Will RL work for LLMs "spill over" to the rest of RL by 2026?

Will LLMs mostly overcome the Reversal Curse by the end of 2025?

By 2025 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Will there be major breakthrough in LLM Continual Learning before 2026?

Will LLMs become a ubiquitous part of everyday life by June 2026?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

Related questions

Will one of the major LLMs be capable of continual lifelong learning (learning from inference runs) by EOY 2025?

Will there by a major breakthrough in LLM continual learning before 2027?

Will there be any major breakthrough in LLM continual learning before 2028?

Will researchers extract a novel program from the weights of an LLM into a Procedural/OO programming language by 2026?

Will RL work for LLMs "spill over" to the rest of RL by 2026?

Will LLMs mostly overcome the Reversal Curse by the end of 2025?

By 2025 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Will there be major breakthrough in LLM Continual Learning before 2026?

Will LLMs become a ubiquitous part of everyday life by June 2026?

Will the best public LLM at the end of 2025 solve more than 5 of the first 10 Project Euler problems published in 2026?

© Manifold Markets, Inc.•Terms•Privacy