Resolution Criteria:
"Authored" means that a model is responsible for everything we would expect a human author to do: come up with the plot, write most of the text, revise the paper. A human editor who says "I got bored in chapter 7" / "your writing style is too fanciful" / whatever is entirely fine; real authors get this kind of feedback from their (human) editors. But more than this is not allowed; as an example, a human editor that rewrites the book in their voice is not allowed. "High-quality" here means that the book is actually halfway decent; if, for example, an AI-written book is so bad that it's bought as a novelty, I won't count it. Basically I'll judge "high-quality" as "it's basically the quality of a book I'd expect to be for sale". I won't be obnoxious and call something low-quality because I don't like it.
Motivation and Context:
Current LLMs are pretty good at helping on knowledge tasks and technical tasks because you can just repeat what is known. But they are not very good at creative tasks. They tell the same story, write the same joke, compose the same poem. This kind of uninspired fiction writing wouldn't sell, and so if an AI system managed to get on the NYT fiction best seller list, something must have changed in their abilities to be creative.
Question copied from: https://nicholas.carlini.com/writing/2024/forecasting-ai-future.html