'Game of Thrones' author and others accuse ChatGPT maker of 'theft' in lawsuit

Washington Post - Technology News 

The lawsuit is the latest salvo in the ongoing debate over how AI tools should be trained and whether the companies behind them owe anything to the original creators of the training data. Large language models are generally trained on billions of sentences of text pulled from the internet, including news stories, Wikipedia and comments on social media sites. OpenAI and other AI companies such as Google and Microsoft do not say specifically what data they use, but AI critics have long suspected that it includes well-known collections of pirated books that have circulated online for years.