OpenAI Secretly Trained GPT-4 With More Than a Million Hours of Transcribed YouTube Videos

from Futurism 3 weeks ago

We used publicly available data and licensed data. So, videos on YouTube?
Futurismhttps://futurism.com/openai-gpt4-youtube

It's yet another data point illustrating how AI companies are relying on massive amounts of murky and possibly copyright-infringing data to train their models.
Futurismhttps://futurism.com/openai-gpt4-youtube

The practice has already led to a number of lawsuits, with rightsholders accusing companies including OpenAI and Microsoft of misattributing their practices to 'fair use,' a doctrine of US copyright law.
Futurismhttps://futurism.com/openai-gpt4-youtube

If OpenAI had in fact trained Sora on YouTube videos, that would be a 'clear violation' of the video platform.
Futurismhttps://futurism.com/openai-gpt4-youtube

Read at Futurism

#openai #ai-training-data #copyright-infringement #legal-issues #youtube

[

]

[

...

]

OpenAI Secretly Trained GPT-4 With More Than a Million Hours of Transcribed YouTube VideosOpenAI Secretly Trained GPT-4 With More Than a Million Hours of Transcribed YouTube Videos Briefly

OpenAI Secretly Trained GPT-4 With More Than a Million Hours of Transcribed YouTube Videos
OpenAI Secretly Trained GPT-4 With More Than a Million Hours of Transcribed YouTube Videos
Briefly