The Fight Against AI Comes to a Foundational Data Set

Jun-13-2024, 15:21:11 GMT–WIRED

Danish media outlets have demanded that the nonprofit web archive Common Crawl remove copies of their articles from past data sets and stop crawling their websites immediately. Common Crawl plans to comply with the request, first issued on Monday. Executive director Rich Skrenta says the organization is "not equipped" to fight media companies and publishers in court. It made the request on behalf of four media outlets, including Berlingske Media and the daily newspaper Jyllands-Posten. The New York Times made a similar request of Common Crawl last year, prior to filing a lawsuit against OpenAI for using its work without permission.

foundational data set, new york time, publisher, (9 more...)

WIRED

Jun-13-2024, 15:21:11 GMT

News Web Page

Add feedback

Country:
- Europe > Denmark (0.06)
- North America > United States
  - California > San Francisco County > San Francisco (0.06)

Industry:
- Media > News (0.95)
- Law > Litigation (0.57)

Technology:
- Information Technology > Artificial Intelligence
  - Issues > Social & Ethical Issues (0.40)
  - Natural Language
    - Large Language Model (0.39)
    - Chatbot (0.39)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.41)