Google's alleged YouTube scraping could be the biggest AI controversy yet

Google logo, YouTube logo and ChatGPT logo
(Image credit: Google/OpenAI)

AI has arguably become the biggest – and most contentious – topic in the world of art and design in recent years. For every impressive example of prompt-generated images or video, there's a question of ethics and copyright, not to mention the perceived existential threat to creative jobs. 

Many of the biggest brands creating AI tech have been eager to emphasise their ethical credentials, with Adobe Firefly, for example, committing to transparency and authenticity by only training its models on commercially available content. But according to a new exposé, Google appears to consider the entire of YouTube to be fair game. 

iPhone running ChatGPT

The report claims ChatGPT was trained on millions of YouTube videos (Image credit: Apple/Future)

The New York Times claims that OpenAI (of ChatGPT fame) trained its Whisper speech recognition tool on millions of YouTube videos, with the transcripts used to train ChatGPT 4.

The most damning claim, though, is that Google was aware of the practice, but did not intervene, despite it contravening YouTube's own policies on unauthorised content scraping. This, the report claims, is because Google was already training its own AI, Gemini, on YouTube videos. Matt Bryant, a spokesperson for Google, told the New York Times Google did not know OpenAI was training ChatGPT on YouTube videos, but the report suggests several people at Google were aware of it, and did not take action because the company itself was doing the same thing.

The suggestion that two major AI players have trained their AI models on millions of YouTube videos will do nothing to allay the fears of those who AI is committing mass copyright infringement

The report echoes the outcry over recently leaked document which revealed that MidJourney was trained on the work of over 16,000 artists. But with both OpenAI and Google implicated in this new report, we could be looking at the most significant AI controversy yet.

Thank you for reading 5 articles this month* Join now for unlimited access

Enjoy your first month for just £1 / $1 / €1

*Read 5 free articles per month without a subscription

Join now for unlimited access

Try first month for just £1 / $1 / €1

Daniel John
Design Editor

Daniel John is Design Editor at Creative Bloq. He reports on the worlds of design, branding and lifestyle tech, and has covered several industry events including Milan Design Week, OFFF Barcelona and Adobe Max in Los Angeles. He has interviewed leaders and designers at brands including Apple, Microsoft and Adobe. Daniel's debut book of short stories and poems was published in 2018, and his comedy newsletter is a Substack Bestseller.