What Puzzle Am I Looking For?
Here are the requirements for the puzzle I want:
Takes hours for a human to solve (10+ hours). Solvable virtually.
Not solvable by a machine — or at least takes a machine significantly longer than a human (e.g., 10× longer than a human solver).
Quick to verify (ideally seconds).
Mass-generatable by either a human or a computer (I should be able to produce many instances easily).
Requirements #1 and #2 are mandatory. Any puzzle that fails either of these is immediately disqualified.
For #3, I can tolerate verification taking up to 5 minutes. I can also tolerate verification procedures that are probabilistic — for example, ones that confirm correctness with 95% confidence instead of 100%.
For #4, Assuming requirements #1–#3 are satisfied, the quality of a puzzle is judged mainly by how easily it can be mass-generated. Ideally:
Tier 1: A puzzle that a human can generate in under 5 minutes, without using code or an LLM.
Tier 2: A puzzle generated by simple code (I know “simple” is subjective, but I'll use my own judgment).
Tire 3: A puzzle that can only be generated with the help of an LLM
Example of simple code: Generally codes that doesnt require the use of LLM, involving simple logics.
For example, code that piece together a bunch of smaller images into a bigger image is rather simple. code that does arithmetic calculations are rather simple.
I will award the person coming up with the best puzzle a 5000 mana prize.
If someone came up with a qualified Tier 1 puzzle I will resolve the market immediately. If no one managed to come up with a Tier 1 puzzle I will resolve the market based on the best (mass-generatable) proposed puzzle at the end of 2025.
People are also trading
Would making someone do 25000 high end auto generateable captchas count
https://manifold.markets/IsaacKing/will-i-be-able-to-vibecode-a-full-f
The puzzle: Make a working fairy chess website. It must have x, y, and z features.
1) Takes a human 10+ hours to do.
2) The machine couldn’t do it, and almost certainly not by itself.
3) it’s verifiable within five minutes whether a fairychess website is working. (And maybe this could be verifiable by a machine. )
4) it’s very easy to create more of these puzzles by just creating different websites and different functionalities.
@TonyBaloney Also, if you want something more conventionally considered a puzzle, my hunch is that @bens puzzle will also fit these requirements, but this market might close before we find out the solution.