Will "Discriminating Behaviorally Identical Classif..." make the top fifty posts in LessWrong's 2024 Annual Review?

As part of LessWrong's Annual Review, the community nominates, writes reviews, and votes on the most valuable posts. Posts are reviewable once they have been up for at least 12 months, and the 2024 Review resolves in February 2026.

This market will resolve to 100% if the post Discriminating Behaviorally Identical Classifiers: a model problem for applying interpretability to scalable oversight is one of the top fifty posts of the 2024 Review, and 0% otherwise. The market was initialized to 14%.

Get Ṁ600 play money

More related questions