Why is Claude 3.5 Sonnet such a good model for its size?

Question

Claude sonnet (3.5) is a relatively small model (estimated to be 5e24 FLOPs). Yet it beats larger models on GPQA, LMSYS, and many other industry standard benchmarks. While it can’t be known that this market can resolve, it’s possible that academics and OSS will learn in the coming years what was done to achieve this high quality model.

Manifold Markets · Answer

Per Manifold Markets prediction market, Pretraining data composition, followed by Doesn't use any scale.ai training data and Offline policy learning RLHf are most likely. See the market for live updates (8 traders, as of Feb 18, 2025).

People are also trading

Related questions