WhiteBox Research will solve at least four of Neel Nanda's Concrete Open Problems by the end of September 2024.
7
Ṁ736
Sep 30
36%
chance

WhiteBox Research is currently holding an interpretability fellowship in Manila. 11 participants are currently in our fellowship’s guided research phase, 6 of whom will use Neel Nanda's 200 Concrete Open Problems to rapidly upskill in mechanistic interpretability. We are currently funded by the Long-Term Future Fund and Manifund.

Some of the fellows in our first cohort include:
- an IOI silver medalist
- an Iranian Geometry Olympiad bronze medalist
- an AI engineering technical manager and former trainer for the Philippine IOI team

Other fellows have also won first and third places in recent Apart Hackathons.

By 'solve', we mean producing a paper, post, or demonstration with the same amount of rigour or exceeding that of We Found An Neuron in GPT-2. The guided research phase of our fellowship started in 01 Jun and will technically end by 31 Aug, but fellows will be able to use our office to work on alignment-related projects afterwards.

Get Ṁ1,000 play money