Is GPT-4-turbo (1106-preview) more capable than GPT-4 (0613)?
9
190Ṁ402
resolved Jul 22
Resolved
YES

Resolve using the average over the four best benchmark comparisons, published in academic papers or preprint, at the time of resolution (e.g. four benchmarks: MATH, MMLU, HumanEval, SWAG).

Will resolve to NA if there are no benchmark comparisons between the 1106-preview and 0613.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ91
2Ṁ17
3Ṁ15
4Ṁ5
5Ṁ5
© Manifold Markets, Inc.TermsPrivacy