Support the fact-based journalism you rely on with a donation to Marketplace today. Give Now!

The economy and ethics of AI training data

Jan 31, 2024
Many artificial intelligence tools were trained on freely-available digital content. That might be legal, but is it ethical?
By publishing something on the internet without explicitly telling other computers to avoid it, you're consenting to its use by AI, says Common Crawl's Rich Skrenta.
Outflow Designs/Getty Images

New York Times suit may test copyright law's constraints on AI

Dec 28, 2023
Where's the line between fair use and commercial exploitation when it comes to scraping the web to train artificial intelligence models?
Most AI models are trained on data sets scraped from the internet. OpenAI trained its chatbot, ChatGPT, with data that included Times content, the lawsuit says.
Sebastien Bozon/AFP via Getty Images