
Will "Preventing Language Models from hiding their ..." make the top fifty posts in LessWrong's 2023 Annual Review?
Will "Preventing Language Models from hiding their ..." make the top fifty posts in LessWrong's 2023 Annual Review?
1
1kṀ50resolved Feb 11
Resolved
NO1D
1W
1M
ALL
As part of LessWrong's Annual Review, the community nominates, writes reviews, and votes on the most valuable posts. Posts are reviewable once they have been up for at least 12 months, and the 2023 Review resolves in February 2025.
This market will resolve to 100% if the post Preventing Language Models from hiding their reasoning is one of the top fifty posts of the 2023 Review, and 0% otherwise. The market was initialized to 14%.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ7 |
What is this?
What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Or create your own play-money betting market on any question you care about.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like betting still use Manifold to get reliable news.
Why use play money?
Mana (Ṁ) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with Ṁ1,000 for free.
Play money means it's much easier for anyone anywhere in the world to get started and try out forecasting without any risk. It also means there's more freedom to create and bet on any type of question.
Related questions
What is this?
What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Or create your own play-money betting market on any question you care about.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like betting still use Manifold to get reliable news.
Why use play money?
Mana (Ṁ) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with Ṁ1,000 for free.
Play money means it's much easier for anyone anywhere in the world to get started and try out forecasting without any risk. It also means there's more freedom to create and bet on any type of question.
Related questions
Will "Language Models Model Us" make the top fifty posts in LessWrong's 2024 Annual Review?
14% chance
Will "The case for more ambitious language model evals" make the top fifty posts in LessWrong's 2024 Annual Review?
12% chance
Will "Alignment Faking in Large Language Models" make the top fifty posts in LessWrong's 2024 Annual Review?
94% chance
Will "Takes on "Alignment Faking in Large Language ..." make the top fifty posts in LessWrong's 2024 Annual Review?
19% chance
Will "Mechanistically Eliciting Latent Behaviors in..." make the top fifty posts in LessWrong's 2024 Annual Review?
14% chance
Will "What Goes Without Saying" make the top fifty posts in LessWrong's 2024 Annual Review?
15% chance
Will "The Leopold Model: Analysis and Reactions" make the top fifty posts in LessWrong's 2024 Annual Review?
13% chance
Will "Dialogue introduction to Singular Learning Theory" make the top fifty posts in LessWrong's 2024 Annual Review?
19% chance
Will "Ablations for “Frontier Models are Capable of..." make the top fifty posts in LessWrong's 2024 Annual Review?
15% chance
Will "Frontier Models are Capable of In-context Sch..." make the top fifty posts in LessWrong's 2024 Annual Review?
14% chance