Current thoughts on resolving; will firm up over coming weeks:
- kicking the can on the actual question(s) to avoid it ending up in the training data (but will stick with the 7 gear question above if there is high confidence it isn't in training data)
- key aspect of the challenge seems to be (a) requires a few steps of deductive reasoning about the physical world (b) superficial similarity to a simpler question of this type (c) a quirk in the question that makes pattern-matching to solving the simpler question wrong
- with be deferent within reason to Yann LeCun as well as the comments when coming up with which question(s) to ask that best capture the intention of this market
- model should get it right >66% of the time; no clever prompting, just straight up asking it
O1-preview gets it correct!
https://chatgpt.com/share/66e3416c-df80-800c-9342-31efa7885616
Closed this for now;
@traders, let me know if anyone objects to resolving yes.