Is Q* the A star pathfinding search combined with Q-learning, as claimed in a "tin hat time" tweet?
Is Q* the A star pathfinding search combined with Q-learning, as claimed in a "tin hat time" tweet?
36
1kṀ2352Jan 1
25%
chance
1H
6H
1D
1W
1M
ALL
https://twitter.com/natolambert/status/1727474191925182849
Doesn't have to be exact, just close enough to be informative
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
Sort by:
predictedNO 1y
Yes, if Q* includes A*, Q-learning, and other stuff this pays out. "Doesn't have to be exact, just close enough to be informative"
Wouldn’t that basically be a muzero-esque algorithm? Mcts with pruning via your value function (the llm)
I'm against these kinds of markets. I don't think we should have markets that encourage the leaking of details about potentially dangerous capabilities advances.
predictedNO 1y
@ChrisLeong I find it hard to believe that anyone will commit such a serious crime for mana when they could commit the exact same crime but sell the information to Google.