Creators urge Ottawa to force disclosure of 'black box' AI system training
CTV
Canadian creators and publishers want the government to do something about the unauthorized and usually unreported use of their content to train generative artificial intelligence systems.
Canadian creators and publishers want the government to do something about the unauthorized and usually unreported use of their content to train generative artificial intelligence systems.
But AI companies maintain that using the material to train their systems doesn’t violate copyright, and say limiting its use would stymie the development of AI in Canada.
The two sides are making their cases in recently published submissions to a consultation on copyright and AI being undertaken by the federal government as it considers how Canada’s copyright laws should address the emergence of generative AI systems like OpenAI’s ChatGPT.
Generative AI can create text, images, videos and computer code based on a simple prompt, but to do that, the systems must first study vast amounts of existing content.
In its submission to the government, Access Copyright argued most and potentially all large language models "are currently profiting from unauthorized use and reproduction of copyright protected works."
It’s taking place in a "black box," according to Access Copyright, which represents writers, visual artists and publishers.
"Rightsholders know it is happening, but due to the information asymmetry between themselves and AI platforms, they cannot determine who is conducting the activity, with whose works, and have no mechanism to stop it from happening.”