AGI's consensus definition reaches alignment as follows: current tests + new tests in planning+reasoning capabilities
Standard
2
แน€200
Jul 8
41%
chance

By consensus definition referring to what can be both quantitatively and qualitatively accepted by both majority of researchers + the broader population of what and when AGI is just commonly accepted to be.

Get
แน€1,000
and
S1.00
Sort by:

I don't understand what you're trying to say at all, and probably no one else does either, since no one here is betting on it ๐Ÿซค

AGI's consensus definition reaches alignment as follows: current tests + new tests in planning+reasoning capabilities

I don't understand this.

@singer the thesis would be broadly as follows:

1) current common evaluations are good for specific skills/knowledge based narrow intelligence assessments

2) seems like biggest perceived gap on LLM-based capabilities are within reasoning and planning, even when scaffolded with helpful cognitive prompting architectures like chain-of-thought or multi-agent systems
3) there doesn't seem to be evaluations in the current eval mix that would look at tests related to either planning or reasoning
4) there seems to be consensus on the ether that AGI has two definition classes (narrow/quantifiable and broad/qualitative), where the broad seems to be referred to the non-scientific perception of "this feels like AGI"

so when 2 would change and 3 would evolve, could then both cases of 4 be commonly agreed upon to be true, more or less both amongst scientists and culturally.

๐Ÿคทโ€โ™‚๏ธ๐Ÿ™‚๐Ÿ™ƒ๐Ÿ•บ

Your explanation does not make the question clearer, sorry.