Follow Us! Like Our Page!

CBC News analysis finds thousands of Canadian authors, books in controversial dataset used to train AI – CBC

Dec 07, 2023

Margaret Atwood, Gordon Korman, Alice Munro top list of Canadian writers with most books in data trove

A CBC News investigation has found at least 2,500 copyrighted books written by more than 1,200 Canadian authors were shared online as part of a massive — and now defunct — dataset used for artificial intelligence training and research purposes.

The dataset’s existence and general highlights were revealed earlier this year in The Atlantic. It led to an avalanche of writers expressing shock on social media that their work had been included without their permission and sharing their concerns that AI tools could use information from the dataset to generate content in their distinct artistic voice.

A CBC News analysis of the dataset, called Books3, identified thousands of Canadian authors and books in both official languages.

Read More: https://www.cbc.ca/news/canada/canadian-authors-books3-ai-dataset-1.7050243

Loading

NationTalk Partners & Sponsors Learn More