Will language models solve cryptic crosswords by end of 2026?
14
220
Ṁ321Ṁ340
2027
72%
chance
1D
1W
1M
ALL
If a large language model can solve >90% of clues from Financial Times cryptic crosswords then yes otherwise no. Crossword layout as context is allowed.
Here's an example FT crossword:
For example, 2 down is "Giant pig, including tail, served up (7)". Pig is "hog", put tail inside to get "htailog" then reverse it to get "goliath".
Get Ṁ200 play money
Sort by:
In isolation or given the crossword layout as context?
I would argue that 90% of the challenge is eliminated by having the crossword as context (number of letters, positions of letters). In fact, I wouldn't be surprised if many cryptic crosswords could be solved without the clues - just the crossword itself.
Without the crossword layout, I think there is a high degree of ambiguity and 90% might be impossible.
Related questions
Will a large language model beat a super grandmaster playing chess by 2028?
49% chance
Will there be an AI language model that surpasses ChatGPT and other OpenAI models before the end of 2024?
30% chance
By the end of 2024, will there be an LLM prompt that can reliably solve the NYT Connections puzzle?
59% chance
Will language models be able to solve simple graphical mazes by the end of 2025?
65% chance
Will natural language based proof assistants be in common use by 2026?
35% chance
Will any language model trained without large number arithmetic be able to generalize to large number arithmetic by 2026?
69% chance
Will any large language model be able to draw a game of TicTacToe against me by the end of 2024?
72% chance
Will a language model that runs locally on a consumer cellphone beat GPT4 by EOY 2026?
48% chance
Will an Open-Ended Embodied Agent with Large Language Models be able to complete The Witness (2016) by 2024?
32% chance