Will "Mechanistically Eliciting Latent Behaviors in..." make the top fifty posts in LessWrong's 2024 Annual Review? | Manifold

Will "Mechanistically Eliciting Latent Behaviors in..." make the top fifty posts in LessWrong's 2024 Annual Review?

1

100Ṁ164

Feb 1

5%

chance

1H

6H

1D

1W

1M

ALL

As part of LessWrong's Annual Review, the community nominates, writes reviews, and votes on the most valuable posts. Posts are reviewable once they have been up for at least 12 months, and the 2024 Review resolves in February 2026.

This market will resolve to 100% if the post Mechanistically Eliciting Latent Behaviors in Language Models is one of the top fifty posts of the 2024 Review, and 0% otherwise. The market was initialized to 14%.

LessWrong Annual Review

Get

1,000

to start trading!

People are also trading

Will "Me, Myself, and AI: the Situational Awareness..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Discriminating Behaviorally Identical Classif..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "The Leopold Model: Analysis and Reactions" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Behavioral red-teaming is unlikely to produce..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "AIs Will Increasingly Attempt Shenanigans" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Explore More: A Bag of Tricks to Keep Your Li..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "The Local Interaction Basis: Identifying Comp..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Two easy things that maybe Just Work to impro..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "What Goes Without Saying" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "o1: A Technical Primer" make the top fifty posts in LessWrong's 2024 Annual Review?

Related questions

Will "Me, Myself, and AI: the Situational Awareness..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Discriminating Behaviorally Identical Classif..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "The Leopold Model: Analysis and Reactions" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Behavioral red-teaming is unlikely to produce..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "AIs Will Increasingly Attempt Shenanigans" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Explore More: A Bag of Tricks to Keep Your Li..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "The Local Interaction Basis: Identifying Comp..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Two easy things that maybe Just Work to impro..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "What Goes Without Saying" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "o1: A Technical Primer" make the top fifty posts in LessWrong's 2024 Annual Review?

© Manifold Markets, Inc.•Terms•Privacy