
I will compare the state of the art LLMs in June 2025 with current LLMs. I will resolve YES if there's not a marked improvement, no credible, verifiable improvement on the horizon and my judgement is that LLMs 'peaked' around end of 2024. I will resolve NO if something like GPT-5 comes out and it's a (subjectively) step improvement from GPT-4. Improvements around the edges and stuff like multimodality don't count. If Google releases something new that catches up with Claude 3.5/GPT-4 that doesn't count.
I use LLMs extensively daily and my friends have described me as having "such great judgment" but this market is still vibes-based at the end of the day so caveat emptor. I will answer any/all relevant clarifying questions in the comments and add stuff to the description if it helps clear things up. While most of the resolution will be mine, I will allow some external consensus to influence me
I will not trade in the market to avoid (appearance of) conflicts of interest
Update 2025-05-20 (PST) (AI summary of creator comment): The creator has indicated they are about to resolve this market NO, earlier than the originally stated timeframe. See the linked comment for their reasoning.
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ122 | |
2 | Ṁ121 | |
3 | Ṁ108 | |
4 | Ṁ50 | |
5 | Ṁ48 |
People are also trading
@dlin007 This doesn't make sense. If you think you can predict what models will be like in a month, then place a bet. But don't resolve early. Someone who claimed a peak right before the first reasoning models came out would look pretty foolish. There could be similar jumps in the near future.
(honestly, probably there won't be. still too soon to call)
@CraigDemel wait, wdym? i'm not claiming models have peaked now, just that they're already demonstrably (and subjectively) better than what was around in Nov 2024. do you disagree?
@dlin007 Apologies, I'm so embarrassed. I didn't remember how I had bet, and somehow perceived that I'd bet no and you'd resolved yes.