
https://www.twitch.tv/claudeplayspokemon
Claude Plays Pokemon is a Twitch stream where the AI chatbot Claude attempts to beat Pokemon Red. Once the game is reset, all remaining answers resolve NO, even if the stream continues with a new game.
I am N/Aing anything that is annoying to resolve. If I have to pore over multiple days of twitch VODs to figure out which way an answer resolves, I am not going to bother.
Changes to the harness between 3.7's runs and this one: https://docs.google.com/document/d/e/2PACX-1vRIsu2pLI21W4KjfYbN13or8E-8cvJYw570wGMEp4UQU63ZhEh9FPGgj2ark8Yk7Vyrtt9MWq3jnn4h/pub
Some relevant milestones from the second run:
Reached Pewter City between steps 5000-5500
Escaped Mt. Moon at step 21496
Reached Vermilion City between steps 30500-32000
Obtained HM01 Cut between steps 55000-60000
Defeated Surge around step 61000
Obtained HM05 Flash around step 100000? (Unsure)
Update 2025-05-28 (PST) (AI summary of creator comment): For the answer 'Claude uses Dig on the SS Anne', the creator has specified that this refers to Dig being used outside of battle.
Update 2025-05-28 (PST) (AI summary of creator comment): For the answer 'Claude enters Mt. Moon after step 20000':
This condition is met if Claude enters Mt. Moon at any point after step 20,000, including re-entries.
Update 2025-06-09 (PST) (AI summary of creator comment): Regarding a period where the stream was down and VODs are missing:
In the interim, answers for events that must have logically occurred during the downtime to reach the current game state will be resolved to YES (e.g., passing through a necessary town).
Developer logs, once available, will be used to resolve answers affected by the missing VODs.