
Will someone find a truth-telling vector which modifies completions in a range of situations by 2024-10-24?
10
170Ṁ512resolved Nov 5
Resolved as
99.0%1H
6H
1D
1W
1M
ALL
TurnTrout et al asked people to predict if they would find a "truth-telling vector" that worked as an algorithmic value edit for a large language model. Here's the post where they asked for predictions:
That resolved NO, they were unable to find one. They also weren't able to find a "speaking French vector". But then a poster in the comments found one:
Will anyone find a "truth-telling vector" by 2024-10-24? I will resolve based on what I know, so hopefully if someone finds one they will tell us about it on Manifold or LessWrong to help me resolve the market. They should provide a similar quality of evidence, such as an explanation of their technique and a link to a colab.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ84 | |
2 | Ṁ57 | |
3 | Ṁ34 | |
4 | Ṁ16 | |
5 | Ṁ16 |