Will more than 5% of GPT-4’s training data be YouTube transcripts?
34
1kṀ3629Jun 2
11%
chance
1H
6H
1D
1W
1M
ALL
If there is an estimate as to what the training data of GPT-4, this market will resolve to YES if more than 5% of it contains YouTube transcripts. Raw YouTube videos don't count towards the resolution, if GPT-4 ends up being multimodal.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
Did OpenAI transcribe Youtube videos to train a GPT model as claimed by NYT?
89% chance
Will OpenAI be sued (with standing) for using transcribed YouTube videos for GPT before 2026?
10% chance
How much compute will be used to train GPT-5?
What will be true about GPT-5?
What hardware will GPT-5 be trained on?
Will the ratio of inference runs to training runs on GPT5 decrease from the ratio on GPT4?
50% chance
Will OpenAI's SearchGPT reach 2% of US search engine market share during any month in 2024/2025?
4% chance
Will manifold be part of GPT5's training data?
76% chance
In 2028, Will a >5 min video completely generated by an AI have more than 1 billion views on Youtube?
33% chance
Will an AI generated YouTube video reach 5B views before 2027?
14% chance
Sort by:
@BionicD0LPH1N how will this resolve if the information is not publicly available? How long will you you wait for it to become available (I expect likely it never will)? is the current close date a deadline?
Useful: https://arxiv.org/abs/2101.00027 includes
youtube transcripts
People are also trading
Related questions
Did OpenAI transcribe Youtube videos to train a GPT model as claimed by NYT?
89% chance
Will OpenAI be sued (with standing) for using transcribed YouTube videos for GPT before 2026?
10% chance
How much compute will be used to train GPT-5?
What will be true about GPT-5?
What hardware will GPT-5 be trained on?
Will the ratio of inference runs to training runs on GPT5 decrease from the ratio on GPT4?
50% chance
Will OpenAI's SearchGPT reach 2% of US search engine market share during any month in 2024/2025?
4% chance
Will manifold be part of GPT5's training data?
76% chance
In 2028, Will a >5 min video completely generated by an AI have more than 1 billion views on Youtube?
33% chance
Will an AI generated YouTube video reach 5B views before 2027?
14% chance