Is GPT-4-turbo (1106-preview) more capable than GPT-4 (0613)?
9
Ṁ190Ṁ402resolved Jul 22
Resolved
YES1H
6H
1D
1W
1M
ALL
Resolve using the average over the four best benchmark comparisons, published in academic papers or preprint, at the time of resolution (e.g. four benchmarks: MATH, MMLU, HumanEval, SWAG).
Will resolve to NA if there are no benchmark comparisons between the 1106-preview and 0613.
This question is managed and resolved by Manifold.
Market context
Get
1,000 to start trading!
🏅 Top traders
| # | Trader | Total profit |
|---|---|---|
| 1 | Ṁ91 | |
| 2 | Ṁ17 | |
| 3 | Ṁ15 | |
| 4 | Ṁ5 | |
| 5 | Ṁ5 |
Sort by:
@EmilyThomas Good question. It's too late to change it to include the stable version. Let's say only the preview.
Will resolve to NA if there are no benchmark comparisons between the 1106-preview and 0613.