
Background on me: I rate myself at somewhere between 75th and 95th percentile skill in machine learning engineering [amongst employed professionals of that job title]. I rate myself at >95th percentile knowledge/skill in the overlap space of machine learning and neuroscience. Disclaimer: I may be wrong about these estimates, this is just information for you to use about my opinion of myself.
Resolution criteria: I must either have successfully done this, or it must be obvious that someone of my equivalent skill level could accomplish this. Coding a seed AI that recursively self-improves to AGI counts, if the improvement is accomplished within budget and timeframe.
Criteria for AGI pretty vauge, but I will try to resolve honestly in way which I believe would be convincing to most neutral parties as of creation of this question. Does not require capability of manipulating robotics for real world tasks, but does require all cogntive skills (long term planning, theory of mind, contextual awareness, episodic and declarative memory, etc). At the very least this would require demonstrating use of a virtual mouse & keyboard to accomplish complex multistage tasks on a web browser (e.g. open an email account and send emails, download a github repo and run the code therein, rent cloud computing resources and utilize them to train ML models, write a research paper about a previously unknown topic), play any Steam game to at least 90th percentile capability, drive a small RC car with a camera through obstacles or on complex navigation tasks to at least 90th percentile human capability, write a 500 page novel with logically cohesive plot, get at least a 4 on any US highschool AP exam without external resources, score at least 90th percentile vs humans on all existing non-robotic AI benchmarks (avalance continual learning benchmarks, minecraft, BIG bench, text adventure games, etc), score at least 90th percentile vs humans on most Kaggle competitions, convincingly engage in protracted dialogue with adversarial questioners (Turing test).
May include open source code developed by others, including libraries and use of pre-trained open source models.