OpenAI Faces Copyright Lawsuit Over AI Training Data

The article discusses a copyright lawsuit filed against OpenAI by Suchir Balaji, a former employee, alleging that the company used copyrighted material to train its AI models without proper licensing. Balaji claims that OpenAI scraped large portions of the internet, including copyrighted books, articles, and websites, to create its training datasets. This practice, known as “data laundering,” raises concerns about the legality of using copyrighted material without permission. The lawsuit seeks class-action status and could have significant implications for the AI industry, as many companies rely on web-scraped data for training their models. OpenAI has not yet responded to the allegations, but the case highlights the need for clearer guidelines and regulations around the use of copyrighted material in AI training.

Source: https://www.businessinsider.com/suchir-balaji-named-openai-copyright-court-case-ai-training-2024-12