Resolution is based on the chatbot arena LLM leaderboard (https://lmarena.ai), specifically the company with the highest Arena Score in the Overall category without filters (without style control or show deprecated), at the end of March 31, 2025 11:59PM ET.
See also:
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ7,221 | |
2 | Ṁ2,331 | |
3 | Ṁ1,792 | |
4 | Ṁ954 | |
5 | Ṁ688 |
@jim haha guess we'll see. I think OpenAI has some internal results with o4 (or whatever they're calling it) that are quite far beyond anything Google has.
@jim oh and the image stuff is just with gpt-4o, will be interesting to see how fast they ship gpt-4.5 image capabilities
@MalachiteEagle is it confirmed that 4.5 can generate images natively? i thought 4o was unique in its multimodality?
@SaviorofPlant I haven't seen them explicitly confirm it, but I think the likelihood is very high, especially since it was originally intended to be their gpt-5 model
@MalachiteEagle i think 4o was trained more recently than 4.5, which has an october 2023 knowledge cutoff. seems likely to me that 4o was an experimental model testing this type of multimodality and previous models did not have it
@SaviorofPlant they have a fine-tuning/continual-learning technique allowing them to update the knowledge cutoff from a given base model
@SaviorofPlant they got multimodal working with gpt-4o, released it (without the multimodal output) and then started the work / training run for gpt-4.5. This produced lower-quality results than they hoped, so they went in circles for a few months until the episode with Q* happened
@MalachiteEagle oh hang on, that's not right. They released gpt-4o in first half of 2024. Q* shenanigans happened in late 2023
@SaviorofPlant but point being that I'm pretty sure gpt-4.5 training run was a scaled up version of what they got working for gpt-4o
@SaviorofPlant created a question for this: https://manifold.markets/MalachiteEagle/will-openai-release-true-multimodal?play=true
Oh sorry I misread that as "before March".
Here's a question in case one of these appears tomorrow: https://manifold.markets/MalachiteEagle/will-there-be-a-new-and-smart-anony?play=true