What alignment proposals and research directions will I be excited about by the end of 2023?

680Ṁ353

resolved Jan 1

100%72%

Infra-bayesianism

13%Other

0.7%

Outsourcing alignment of AI to other AI

0.3%

Reinforcement Learning from Human Feedback (RLHF)

0.3%

Transparency tools

0.3%

Imitative amplification

0.3%

Intermittent oversight

0.3%

Relaxed adversarial training

0.3%

Approval-based amplification

0.3%

Microscope AI

1.1%

STEM AI

0.3%

Narrow reward modeling

0.4%

Recursive reward modeling

0.4%

AI safety via debate with transparency tools

0.4%

Amplification with auxiliary RL objective

0.4%

Shard theory mechanistic interpretability

1.8%

Motivational API hacking: https://www.alignmentforum.org/posts/cAC4AXiNC5ig6jQnc/understanding-and-controlling-a-maze-solving-policy-network

Hodgepodge alignment

0.4%

Cyborgism

AI Alignment

New Year's Resolutions 2024

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ60
2		Ṁ14

Comments

7 Holders

27 Trades

What is this?

What is Manifold?

Manifold is the world's largest social prediction market.

Get accurate real-time odds on politics, tech, sports, and more.

Or create your own play-money betting market on any question you care about.

Are our predictions accurate?

Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.

In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like betting still use Manifold to get reliable news.

Why use play money?

Mana (Ṁ) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with Ṁ1,000 for free.

Play money means it's much easier for anyone anywhere in the world to get started and try out forecasting without any risk. It also means there's more freedom to create and bet on any type of question.

People are also trading

By the end of 2025, which piece of advice will I feel has had the most positive impact on me becoming an effective AI alignment researcher?

Will I think that alignment is no longer "preparadigmatic" by the start of 2026?

18% chance

Will some piece of AI capabilities research done in 2023 or after be net-positive for AI alignment research?

81% chance

Will there be more alignmentforum posts from 2025 than 2024?

55% chance

Will taking annual MRIs of the smartest alignment researchers turn out alignment-relevant by 2033?

7% chance

Will >= 1 alignment researcher/paper cite "maximum diffusion reinforcement learning" as alignment-relevant in 2025?

19% chance

Will we solve AI alignment by 2026?

4% chance

Will there exist a compelling demonstration of deceptive alignment by 2026?

70% chance

Will xAI significantly rework their alignment plan by the start of 2026?

32% chance

Will I have a research position at Anthropic (Research Engineer included) by the end of 2025?

13% chance

What is this?

What is Manifold?

Manifold is the world's largest social prediction market.

Get accurate real-time odds on politics, tech, sports, and more.

Or create your own play-money betting market on any question you care about.

Are our predictions accurate?

Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.

Why use play money?

Mana (Ṁ) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with Ṁ1,000 for free.

🏅 Top traders

What is this?

People are also trading

What is this?

Related questions