Intology’s Locus result on RE-Bench real?
8
1kṀ9762
resolved Nov 24
Resolved
NO

ways this would be real:

  • result independly replicated

  • Model is clearly found to be strong SOTA at SWE tasks similar to RE-Bench

Ways this would not be real:

  • they announce that this reported score was in part caused by an error in their setup / due to extensive reward hacking by their model (it ‘cheated’)

  • Independently replicated and score is nowhere near human level

    failing these, resolves to consensus of credible people, let’s say in feb 2025

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ816
2Ṁ88
3Ṁ33
4Ṁ9
5Ṁ2
Sort by:
bought Ṁ2,500 NO

@AndrewImpellitteri is this enough for resolution to NO?

@AndrewImpellitteri I’m leaning that way but ill keep open for a bit and would prefer stronger evidence

© Manifold Markets, Inc.TermsPrivacy