What % of alignment forum karma will be pro-interpetability vs anti this year?

14

1kṀ1154

resolved Sep 24

Resolved

YES

1H

6H

1D

1W

1M

ALL

On 2024/09/13 I will uniformly sample from all post on the alignmentforum published between 2023/09/13 and 2024/09/13 that express an opinion on whether prosaic interpretability is net useful for aligning future, dangerous AI, weighted by their karma. (So a post with 4 karma is 2 times more likely to get picked than one with 2 karma)

If the sampled post contributes to prosaic interpretability or is in favor of past/future interpretability research, this question resolves to "yes".

I won't vote on this. I hope but do not guarantee to maintain the updated list of posts I'll sample over with their labels somewhere here.

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ116
2		Ṁ33
3		Ṁ20
4		Ṁ15
5		Ṁ10

People are also trading

Will there be more alignmentforum posts from 2025 than 2024?

Will "Takes on "Alignment Faking in Large Language ..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "What Is The Alignment Problem?" make the top fifty posts in LessWrong's 2025 Annual Review?

Will "How to replicate and extend our alignment fak..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Alignment Faking in Large Language Models" make the top fifty posts in LessWrong's 2024 Annual Review?

In 5 years will I think the org Conjecture was net good for alignment?

Will "LLMs for Alignment Research: a safety priority?" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "“Alignment Faking” frame is somewhat fake" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Demystifying "Alignment" through a Comic" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Making a conservative case for alignment" make the top fifty posts in LessWrong's 2024 Annual Review?

Related questions

Will there be more alignmentforum posts from 2025 than 2024?

Will "Takes on "Alignment Faking in Large Language ..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "What Is The Alignment Problem?" make the top fifty posts in LessWrong's 2025 Annual Review?

Will "How to replicate and extend our alignment fak..." make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Alignment Faking in Large Language Models" make the top fifty posts in LessWrong's 2024 Annual Review?

In 5 years will I think the org Conjecture was net good for alignment?

Will "LLMs for Alignment Research: a safety priority?" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "“Alignment Faking” frame is somewhat fake" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Demystifying "Alignment" through a Comic" make the top fifty posts in LessWrong's 2024 Annual Review?

Will "Making a conservative case for alignment" make the top fifty posts in LessWrong's 2024 Annual Review?

© Manifold Markets, Inc.•Terms•Privacy