Will SOTA on MATH in Sep 2024 utilize a hard-coded search/amplification procedure?
16
121
680
Oct 1
56%
chance

Jeremy Gillen has bet Eli Lifland (myself) that (no-calculator) SOTA on MATH as of Sep 30, 2024 will utilize a hard-coded search/amplification procedure like MCTS. The bet is at 150:200 odds in Jeremy's favor: Eli will pay Jeremy $200 if this market resolves yes, otherwise Jeremy will pay Eli $150.

For current SOTA, see Minerva.

We've agreed on how a few boundary cases would resolve, and any disagreements about boundary cases will be judged by Thomas Larsen. We're reluctant to share the details of boundary cases publicly due to capability speedup concerns, and generally encourage commenters to be careful about infohazards.

Get Ṁ200 play money
Sort by:

predicts YES

@JeanStanislasDenain This was one of our edge cases, we decided it would resolve in Eli's favor. So no.

@JeremyGillen Sorry about that.

Some amazing hubris in assuming out of the thousands of people working in the area you’re the only ones who had some brilliant idea that might “speed things up”

That said, augmentation and solution-checking aren’t quite mcts but are vastly better than one-shot prompting (even restatement, reordering of answers etc. would be obvious day two work in any serious domain)

“Bet on our weird bet of which we disclose no information about wtf it means, and please don’t talk about what it means because if the AI can do 12-th grade math the world will end 🤔”

some say the world will end in fire, others ice, me I predict when someone leaks details about the qualifier to the qualifier to the people shown here