Does training LLMs or LTGMs on copyrighted material violate copyright? | Manifold

Does training LLMs or LTGMs on copyrighted material violate copyright?

20

Never closes

Yes

No

Specifically, do you think it violates US copyright law?

Companies like OpenAI and Stable Diffusion claim that their use of copyrighted material falls under Fair Use because it is transformative, provides many benefits to society, and would not be possible with only public domain training content.

Opponents claim that the scraping process does not fall under Fair Use because the purpose is commercial, creative, and harms the market for the original works. The training process likewise may be considered a copyright violation when the network weights end up containing the training data in a way that can be output verbatim.

Market context

AI Copyright Law

Get

1,000

to start trading!

Sort by:

Anyone voting NO on this want to explain why?

People are also trading

Illegal Agent-like LLM which automatically serves up links to copyrighted texts available by mid 2026

By 2027, will it be generally agreed upon that LLM produced text > human text for training LLMs?

By 2029 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Will Generative AI trained on crawled art be illegal in 2027 because of copyright?

Will any widely used LLM be pre-trained with abstract synthetic data before 2030?

Related questions

Illegal Agent-like LLM which automatically serves up links to copyrighted texts available by mid 2026

By 2027, will it be generally agreed upon that LLM produced text > human text for training LLMs?

By 2029 end, will it be generally agreed upon that LLM produced text/code > human text/code for training LLMs?

Will Generative AI trained on crawled art be illegal in 2027 because of copyright?

Will any widely used LLM be pre-trained with abstract synthetic data before 2030?