
[Redwood Research] Will we submit the bash control project to NeurIPS?
14
1kṀ629resolved May 19
Resolved
NO1H
6H
1D
1W
1M
ALL
We're working on a follow-up to AI Control: Improving Safety Despite Intentional Subversion, in a language model agent shell programming setting. We intend to write this up and submit it to NeurIPS. Will we succeed?
Currently, we have a preliminary dataset, and we've done some back-and-forth on trusted monitoring.
The main reason we wouldn't submit is that we don't think our results are sufficiently solid by then. We won't submit if we think it's very unlikely (<20%) the paper will be accepted.
(Feel free to message me if you want to beta read the paper.)
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ164 | |
2 | Ṁ17 | |
3 | Ṁ14 | |
4 | Ṁ6 | |
5 | Ṁ1 |