Will Jesse Hoogland Believe the Project of Scoping Devinterp Has Made Meaningful Progress Towards Alignment by March?
6
86
210
resolved Feb 29
Resolved
YES

From the post Towards Developmental Interpretability:
The high-level near-term plan (as of July 2023) for developmental interpretability:

  • Phase 1: sanity checks (six months). Assemble a library of examples of phase transitions over training, analyze each of them with our existing tools to validate the key ideas.

  • Phase 2: build new tools. Jointly develop theoretical and experimental measures that give more refined information about structure formed in phase transitions.

I will ask @JesseHoogland soon after the resolution date what he thinks.

Get Ṁ200 play money

🏅 Top traders

#NameTotal profit
1Ṁ74
2Ṁ16
Sort by:

Jesse told me I can resolve this market. See also: https://www.lesswrong.com/posts/Quht2AY6A5KNeZFEA/timaeus-s-first-four-months