πŸ• Will AI Achieve Significantly More, "Embodiment" by end of 2023?
πŸ• Will AI Achieve Significantly More, "Embodiment" by end of 2023?
34
630αΉ€4663
resolved Jan 2
Resolved
NO

Preface:

Please read the preface for this type of market and other similar third-party validated AI markets here.

Third-Party Validated, Predictive Markets: AI Theme

Market Description

AI2-THOR Rearrangement Challenge

This question pertains to the following AI challenge:

https://github.com/allenai/ai2thor-rearrangement#----2022-ai2-thor-rearrangement-challenge

  • The goal of this challenge is to build a model/agent that move objects in a room to restore them to a given initial configuration.

  • Example query:

task involves moving and modifying (i.e. opening/closing) randomly placed objects within a room to obtain a goal configuration. There are 2 phases:

  1. Walkthrough πŸ‘€. The agent walks around the room and observes the objects in their ideal goal state.

  2. Unshuffle πŸ‹. After the walkthrough phase, we randomly change between 1 to 5 objects in the room. The agent's goal is to identify which objects have changed and reset those objects to their state from the walkthrough phase. Changes to an object's state may include changes to its position, orientation, or openness.

The resolution for this market will be here:

https://leaderboard.allenai.org/ithor_rearrangement_1phase_2022/submissions/public

robot rearranging a room

Market Resolution Threshold:

  • If any 2022 AI2-THOR Rearrangement Challenge Submission Get a % Fixed Strict (Test) Score of >0.4 by end of 2023, this resolves as YES, otherwise NO.


Mar 22, 12:25pm: Will Any 2022 AI2-THOR Rearrangement Challenge Submission Get a % Fixed Strict (Test) Score of >0.4 by end of 2023? β†’ Will AI Achieve Significantly More, "Embodiment" by end of 2023?

Get
αΉ€1,000
to start trading!

πŸ… Top traders

#NameTotal profit
1αΉ€156
2αΉ€150
3αΉ€88
4αΉ€50
5αΉ€46


Sort by:
predictedNO 1y

Same question for 2024: https://manifold.markets/PatrickDelaney/-will-ai-achieve-significantly-more ... though I have not set the threshold yet.

predictedNO 1y

the only other contest I could find with a quick search was this one: https://leaderboard.allenai.org/ithor_rearrangement_2phase_2022/submissions/public ... which had a score of 0.2894 as the top one thus far in 2023, so I'm resolving NO. Please let me know if you object.

predictedNO 1y

@PatrickDelaney For this market it doesn't change the outcome, but in general I think you should only use the specific contest variant you were supposed to use for resolution.

predictedNO 1y

@na_pewno I agree. I was just posting that as further evidence for anyone who may have come by and pointing this contest out, but yes, I completely agree, and the purpose of this market was to stay strict to the above metric chosen.

I also think it's interesting (though not relevant to market resolution) to just stay informed on this stuff. AllenAI claims now that there is going to be an, "exponential improvement," in embodiment due to something called ProcThor (question posted here https://manifold.markets/PatrickDelaney/-will-ai-achieve-significantly-more)

predictedNO 2y

Another one, more of a cross section based on Google Big Bench:

predictedNO 2y

Other relevant markets:

What is this?

What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Or create your own play-money betting market on any question you care about.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like betting still use Manifold to get reliable news.
αΉ€Why use play money?
Mana (αΉ€) is the play-money currency used to bet on Manifold. It cannot be converted to cash. All users start with αΉ€1,000 for free.
Play money means it's much easier for anyone anywhere in the world to get started and try out forecasting without any risk. It also means there's more freedom to create and bet on any type of question.
Β© Manifold Markets, Inc.β€’Termsβ€’Privacy