Begin typing your search above and press return to search.

UAE

Dubai police honour DXB Airport workers

3 May 2024 8:45 PM GMT

UAE

Sharjah's Mleiha National Park promises sustainable heritage,...

3 May 2024 8:34 PM GMT

UAE

Indian Consulate in Dubai resumes 'open house' forum for grievances

3 May 2024 8:20 PM GMT

Saudi Arabia

Saudi Arabia to issue entry permits for Makkah ahead of hajj season

3 May 2024 8:10 PM GMT

UAE

Air India Express launches new route connecting Ras Al Khaimah to...

3 May 2024 8:02 PM GMT

UAE

Dubai's Global Village extends opening hours for final days

3 May 2024 7:56 PM GMT

OPINION

Editorial

One year of Manipur and administrative negligence

3 May 2024 11:00 AM GMT

Editorial

Secular credential should be proven through actions

1 May 2024 8:38 AM GMT

Editorial

When the Brij Bhushans and Revannas are protected

30 April 2024 12:10 PM GMT

Editorial

The core of democracy is not mere trusting, but questioning

29 April 2024 4:01 AM GMT

Editorial

The disillusionment of the saffron brigades

27 April 2024 4:43 AM GMT

Editorial

The pro-Palestine protests on American campuses

26 April 2024 4:00 AM GMT

DEEP READ

Deep Read

The future of Oil and Gas - an industry that is not going to die...

5 April 2024 1:56 PM GMT

Deep Read

Towards Totalitarianism: from NJAC to Electoral Bonds

27 March 2024 9:20 AM GMT

Article

Ramadan: Its essence and lessons

13 March 2024 9:24 AM GMT

Deep Read

Hyderabad's Barkas community: solidarity amidst Palestinian tragedy

1 March 2024 11:55 AM GMT

Posted On

8 April 2024 2:24 PM GMT

Updated On

8 April 2024 2:24 PM GMT

OpenAI utilized YouTube videos to train GPT-4 AI model

This move, if proven true, could pose legal challenges for the AI firm, which is already entangled in multiple lawsuits regarding the use of copyrighted data.

New reports suggest that OpenAI employed data from YouTube videos, amounting to over a million hours, to train its latest AI model, GPT-4.

It's alleged that OpenAI resorted to utilizing transcribed data from YouTube videos after exhausting its existing text-word resources for training AI models.

This move, if proven true, could pose legal challenges for the AI firm, which is already entangled in multiple lawsuits regarding the use of copyrighted data. Recently, a report shed light on mini chatbots in OpenAI's GPT Store that reportedly violated the company's guidelines.

According to The New York Times, OpenAI developed an automatic speech recognition tool named Whisper to transcribe YouTube videos and utilize the data for training its models, after facing a shortage of unique text words. Whisper was publicly launched by OpenAI in September 2022, and the firm stated that it was trained on 6,80,000 hours of "multilingual and multitask supervised data collected from the web".

Unnamed sources familiar with the matter claimed that OpenAI employees deliberated over the potential breach of YouTube's guidelines and the risk of legal consequences. Notably, Google prohibits the use of its videos for applications external to the platform.

Despite the concerns, OpenAI allegedly proceeded with the plan, transcribing over a million hours of YouTube videos to feed the text into GPT-4. The report also alleges direct involvement from OpenAI President Greg Brockman, who reportedly assisted in data collection from videos.

OpenAI spokesperson Matt Bryant responded to the reports, calling them unconfirmed and denying any unauthorized scraping or downloading of YouTube content, citing the company's robots.txt files and Terms of Service.

Another spokesperson, Lindsay Held, mentioned that OpenAI utilizes various sources, including publicly available and non-public data partnerships, for its data sources. Additionally, Held stated that the AI firm is exploring the potential use of synthetic data for training future AI models.

Show Full Article

TAGS:AI Artificial Intelligence OpenAI

Uddhav Thackeray's resignation is not a matter of joy for us: Rebel...

Dubai police honour DXB Airport workers

Sharjah's Mleiha National Park promises sustainable heritage,...

Indian Consulate in Dubai resumes 'open house' forum for grievances

Saudi Arabia to issue entry permits for Makkah ahead of hajj season

Air India Express launches new route connecting Ras Al Khaimah to...

Dubai police honour DXB Airport workers

Sharjah's Mleiha National Park promises sustainable heritage,...

Indian Consulate in Dubai resumes 'open house' forum for grievances

Saudi Arabia to issue entry permits for Makkah ahead of hajj season

Air India Express launches new route connecting Ras Al Khaimah to...

Dubai's Global Village extends opening hours for final days

One year of Manipur and administrative negligence

Secular credential should be proven through actions

When the Brij Bhushans and Revannas are protected

The core of democracy is not mere trusting, but questioning

The disillusionment of the saffron brigades

The pro-Palestine protests on American campuses

Schools breeding hatred

Racial underpinnings of war

The future of Oil and Gas - an industry that is not going to die...

Towards Totalitarianism: from NJAC to Electoral Bonds

Ramadan: Its essence and lessons

Hyderabad's Barkas community: solidarity amidst Palestinian tragedy

OpenAI utilized YouTube videos to train GPT-4 AI model