Firstpost
  • Home
  • Video Shows
    Vantage Firstpost America Firstpost Africa First Sports
  • World
    US News
  • Explainers
  • News
    India Opinion Cricket Tech Entertainment Sports Health Photostories
  • Asia Cup 2025
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
Trending:
  • PM Modi in Manipur
  • Charlie Kirk killer
  • Sushila Karki
  • IND vs PAK
  • India-US ties
  • New human organ
  • Downton Abbey: The Grand Finale Movie Review
fp-logo
Worrying times for AI ahead? Major tech companies are running out of data to train LLMs
Whatsapp Facebook Twitter
Whatsapp Facebook Twitter
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
  • Home
  • World
  • Worrying times for AI ahead? Major tech companies are running out of data to train LLMs

Worrying times for AI ahead? Major tech companies are running out of data to train LLMs

FP Staff • November 14, 2023, 12:08:25 IST
Whatsapp Facebook Twitter

AI needs a lot of data to stay updated. However, most AI studios developing LLMs are running out of natural or human-generated data. Now it runs the risk of having to work with AI generated data. One of the problems that AI studios are facing is the refusal to pay for new natural data

Advertisement
Subscribe Join Us
Add as a preferred source on Google
Prefer
Firstpost
On
Google
Worrying times for AI ahead? Major tech companies are running out of data to train LLMs

In the rapidly evolving landscape of the AI economy, data emerges as the linchpin that propels advancements. It is not merely a component; rather, it stands as the lifeblood of AI models, influencing their fundamental functionality and overall quality. The correlation is clear: the more abundant and diverse the human-generated data an AI system is exposed to, the more adept it becomes. However, a disconcerting revelation casts a shadow over AI companies—the finite nature of natural data. In a warning that has been reverberating among AI researchers for nearly a year, experts caution that the well of natural data, essential for training AI systems, is running dry. Rita Matulionyte, a professor of information technology law at Macquarie University in Australia, emphasizes this concern in an essay for The Conversation. A study by the AI forecasting organisation Epoch AI adds a tangible timeline to the foreboding scenario. The study estimates that AI companies could confront a shortage of high-quality textual training data as early as 2026, with low-quality text and image data potentially depleting between 2030 and 2060. This data scarcity poses a substantial threat to AI firms heavily reliant on continuous data influx for the enhancement of their models. The trajectory of AI development has mirrored the infusion of increasing volumes of data. If this supply chain stagnates, the consequences could reverberate throughout the industry. Matulionyte suggests a potential remedy in the form of synthetic data, generated by AI models. However, the viability of this solution is contested, with research indicating a risk of an “inbreeding effect” that distorts the model when trained on AI-generated content. Despite these challenges, some companies are already exploring synthetic training sets. A pragmatic alternative emerges in the concept of data partnerships. In essence, companies or institutions possessing vast repositories of high-quality data could enter into agreements with AI companies to share this data, often in exchange for financial compensation. OpenAI, a prominent Silicon Valley AI firm, recently launched a Data Partnership initiative. In a blog post, the company underscores the significance of such collaborations in steering the future of AI and creating models that are more relevant to diverse organizations. As the race for data intensifies, the practicality of data partnerships becomes a focal point. Many AI datasets currently derive from internet-scraped data created by online users, making data partnerships a plausible solution. Yet, with the escalating value of data, the competition for datasets is poised to intensify, raising questions about the willingness of institutions and individuals to share their data with AI entities. Even with data partnerships, there remains a lingering uncertainty about the sustainability of the data supply. Despite the seemingly boundless expanse of the internet, the impending challenge of dwindling data reserves forces a reassessment of assumptions about the endless nature of this critical resource. (With input from agencies)

Tags
artificial intelligence Datasets Large Language Models
End of Article
Latest News
Find us on YouTube
Subscribe
End of Article

Impact Shorts

‘The cries of this widow will echo’: In first public remarks, Erika Kirk warns Charlie’s killers they’ve ‘unleashed a fire’

‘The cries of this widow will echo’: In first public remarks, Erika Kirk warns Charlie’s killers they’ve ‘unleashed a fire’

Erika Kirk delivered an emotional speech from her late husband's studio, addressing President Trump directly. She urged people to join a church and keep Charlie Kirk's mission alive, despite technical interruptions. Erika vowed to continue Charlie's campus tours and podcast, promising his mission will not end.

More Impact Shorts

Top Stories

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Top Shows

Vantage Firstpost America Firstpost Africa First Sports
Latest News About Firstpost
Most Searched Categories
  • Web Stories
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Tech/Auto
  • Entertainment
  • IPL 2025
NETWORK18 SITES
  • News18
  • Money Control
  • CNBC TV18
  • Forbes India
  • Advertise with us
  • Sitemap
Firstpost Logo

is on YouTube

Subscribe Now

Copyright @ 2024. Firstpost - All Rights Reserved

About Us Contact Us Privacy Policy Cookie Policy Terms Of Use
Home Video Shorts Live TV