Will LLM hallucination problems on "summarize/use this context to answer a question" be solved by April 2024?

25

470Ṁ2551

resolved Jun 9

Resolved

NO

1H

6H

1D

1W

1M

ALL

Specfically today there exists a pattern which is: take an LLM and some search system. When the user asks the question, have the LLM feed that into search, get results, and then use those results to answer the original question. Examples of this include GPT Bing, Stripe Docs AI, etc. Sometimes these systems still run into hallucination (coming up with facts/etc not supported by the provided context/text) problems.

I'm predicting that, according to my best judgement, this will be solved as of next year. Solved here means that in 99.9% of cases (approx) this issue doersn't occur.

Language Models

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ238
2		Ṁ82
3		Ṁ63
4		Ṁ45
5		Ṁ44

Sort by:

predictedYES

Additional detail: this does not include actively*trying* to make a system produce bad answers. The test is "if I give a document and a question, will it answer based on the document (or at least refuse to answer if it can't)"

Can't cheat by refusing to answer all questions. Should be "reasonably useful", in terms of how often it refuses to answer (sometimes the doc just doesn't contain an answer)

People are also trading

Will LLM hallucinations be a fixed problem by the end of 2025?

LLM Hallucination: Will an LLM score >90% on SimpleQA before 2026?

Will LLM hallucinations be a fixed problem by the end of 2028?

Will hallucinations (made up facts) created by LLMs go below 1% on specific corpora before 2025?

How Will the LLM Hallucination Problem Be Solved?

Will an LLM do a task that the user hadn't requested in a notable way before 2026?

Will RL work for LLMs "spill over" to the rest of RL by 2026?

Will LLMs mostly overcome the Reversal Curse by the end of 2025?

Will there be major breakthrough in LLM Continual Learning before 2026?

Will we get a new LLM paradigm by EOY?

Related questions

Will LLM hallucinations be a fixed problem by the end of 2025?

LLM Hallucination: Will an LLM score >90% on SimpleQA before 2026?

Will LLM hallucinations be a fixed problem by the end of 2028?

Will hallucinations (made up facts) created by LLMs go below 1% on specific corpora before 2025?

How Will the LLM Hallucination Problem Be Solved?

Will an LLM do a task that the user hadn't requested in a notable way before 2026?

Will RL work for LLMs "spill over" to the rest of RL by 2026?

Will LLMs mostly overcome the Reversal Curse by the end of 2025?

Will there be major breakthrough in LLM Continual Learning before 2026?

Will we get a new LLM paradigm by EOY?

© Manifold Markets, Inc.•Terms•Privacy