OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.
that’s not exactly what’s in dispute— the prodcut that LLMs produce. That would probably be ruled as a derivative work under the DMCA’s “Fair Use” clause, and, therefore, public domain.
the issue at hand is that the company accessed the copyrighted material without paying for it and is now using that training to earn more money without fair compensation.
these language models or even proper AI can’t create original creative works the way a human can. The best it can do it create a pastiche or composition that simulates originality but is really just a jumble of recycled ideas that it’s been trained on. There’s a fair argument to be made that the owners of the copyrights of those pesos works are entitled to fair compensation, especially since, otherwise, AI will just be a tool used by companies to churn out profit off the work of others.