Will I be impressed by someone using RL through self-play to improve model creativity or aesthetics in 2025? | Manifold

Will I be impressed by someone using RL through self-play to improve model creativity or aesthetics in 2025?

6

10kṀ730

2026

45%

chance

1H

6H

1D

1W

1M

ALL

Researchers have used reinforcement learning to improve LLM performance on math problems, among other things (one example). It's also widely known that firms like Midjourney use human feedback to improve the aesthetic quality of their images. Could LLMs bootstrap aesthetic/creative abilities through self-play (e.g. by using other models as a judges of quality and creating a reward function from that)?

Resolves YES if someone releases a model which is commonly understood to have used RL through self-play and that model produces an artifact that is publicly accessible in 2025—be it a poem, text, song, image, video, interpretive dance, etc—that I find aesthetically or creatively impressive (compared to the pre-self play version of the model that produced it).

Get

1,000

to start trading!

Sort by:

bought Ṁ30 YES

Related: /MalachiteEagle/will-there-be-an-inferencetime-scal

@MalachiteEagle good market!

People are also trading

Will World Model AIs start being used in games in early 2026 and quickly start dominating the Game Development industry?

Will AI be Recursively Self Improving by mid 2026?

-9% 1d38% chance

By 2026 will any RL agent with learned causal models of its environment achieve superhuman performance on >=10 Atari environments?

Will AI be able to create art in a "human-like way" within Photoshop, GIMP, or Paint.NET by the end of 2025?

Will a single model have all the upsides o1-style RL with none of the downsides at 2027?

Before 2026, Will DL systems outperform humans at describing a picture in words to make human mental images match it?

Will I be impressed by GPT-5?

Will most critics think AI writes good poetry by the end of 2025?

Will AI-generated art win a major, traditionally human-only art competition by 2030?

Will advanced AI systems be found to have faked data on algorithm improvements for purposes of positive reinforcement by end of 2035?

Related questions

Will World Model AIs start being used in games in early 2026 and quickly start dominating the Game Development industry?

Will AI be Recursively Self Improving by mid 2026?

By 2026 will any RL agent with learned causal models of its environment achieve superhuman performance on >=10 Atari environments?

Will AI be able to create art in a "human-like way" within Photoshop, GIMP, or Paint.NET by the end of 2025?

Will a single model have all the upsides o1-style RL with none of the downsides at 2027?

Before 2026, Will DL systems outperform humans at describing a picture in words to make human mental images match it?

Will I be impressed by GPT-5?

Will most critics think AI writes good poetry by the end of 2025?

Will AI-generated art win a major, traditionally human-only art competition by 2030?

Will advanced AI systems be found to have faked data on algorithm improvements for purposes of positive reinforcement by end of 2035?

© Manifold Markets, Inc.•Terms•Privacy