Firstpost
  • Home
  • Video Shows
    Vantage Firstpost America Firstpost Africa First Sports
  • World
    US News
  • Explainers
  • News
    India Opinion Cricket Tech Entertainment Sports Health Photostories
  • Asia Cup 2025
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
Trending:
  • PM Modi in Manipur
  • Charlie Kirk killer
  • Sushila Karki
  • IND vs PAK
  • India-US ties
  • New human organ
  • Downton Abbey: The Grand Finale Movie Review
fp-logo
OpenAI’s GPT-4o update: ChatGPT, other AI apps can now talk, laugh, sing and read like a human
Whatsapp Facebook Twitter
Whatsapp Facebook Twitter
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
  • Home
  • Tech
  • OpenAI’s GPT-4o update: ChatGPT, other AI apps can now talk, laugh, sing and read like a human

OpenAI’s GPT-4o update: ChatGPT, other AI apps can now talk, laugh, sing and read like a human

Mehul Reuben Das • May 14, 2024, 10:24:38 IST
Whatsapp Facebook Twitter

The new AI model is a massive step up from GPT-4, despite not being a generational update. The new update makes the AI model more like a human, in the sense that it can accept inputs in any form — images, texts, audio — and answer in as many formats

Advertisement
Subscribe Join Us
Add as a preferred source on Google
Prefer
Firstpost
On
Google
OpenAI’s GPT-4o update: ChatGPT, other AI apps can now talk, laugh, sing and read like a human
OpenAI's GPT-4o is a massive leave in their GPT model, and far more capable and powerful than anything we have seen till now, even though, this is not a generational upgrade. Image Credit: OpenAI

OpenAI made waves on Monday with the announcement of their latest flagship model, GPT-4o, or GPT-4 Omni. This new model promises enhanced capabilities for all users, boasting smarter and faster real-time voice interactions.

Why is this important? Well, tech giants like Google, Microsoft, and Apple are all pivoting towards a future powered by generative AI. With OpenAI leading the charge, they’re determined to maintain their position at the forefront of this rapidly evolving landscape.

STORY CONTINUES BELOW THIS AD

According to CTO Mira Murati, the standout feature of GPT-4o is its accessibility — it brings the formidable intelligence of GPT-4 to all users, including those on the free tier. During a livestream presentation, Murati emphasized that GPT-4o, denoted by the letter “o” for “Omni,” represents a significant leap forward in terms of user-friendliness and speed.

More from Tech
How ChatGPT is becoming everyone’s BFF and why that’s dangerous How ChatGPT is becoming everyone’s BFF and why that’s dangerous America ready for self-driving cars, but it has a legal problem America ready for self-driving cars, but it has a legal problem

The latest model from OpenAI, GPT-4o, is breaking boundaries by accepting input in various forms—text, audio, and images—and seamlessly generating output in all three formats. But that’s not all—this advanced AI is equipped with emotion recognition, allowing it to gauge the feelings behind your input. Plus, you can interrupt it mid-speech, and it responds with lightning speed, almost keeping pace with human conversation. With its multifaceted abilities and human-like responsiveness, GPT-4o is revolutionizing the way we interact with AI.

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN

Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx

— OpenAI (@OpenAI) May 13, 2024

Impact Shorts

More Shorts
America ready for self-driving cars, but it has a legal problem

America ready for self-driving cars, but it has a legal problem

Alibaba, Baidu begin using own AI chips as China shifts away from US tech amid Nvidia row

Alibaba, Baidu begin using own AI chips as China shifts away from US tech amid Nvidia row

But the excitement doesn’t stop there. Murati hinted at an even more substantial update to the model—think GPT-5 — set to be unveiled later this year, promising further advancements.

In a demonstration, OpenAI showcased the real-time capabilities of ChatGPT’s voice assistant, highlighting faster responses and the ability to seamlessly interrupt the AI. This glimpse into the future of AI-driven interactions underscores the potential of GPT-4o to revolutionize how we engage with technology.

STORY CONTINUES BELOW THIS AD

During one demonstration, OpenAI showcased a real-time tutorial on deep breathing, illustrating the potential for practical guidance using GPT-4o.

In another demo, ChatGPT displayed its versatility by reading an AI-generated story in various voices, ranging from dramatic recitals to robotic tones and even singing.

A third demonstration showcased ChatGPT’s problem-solving prowess, as it helped a user work through an algebra equation instead of simply providing an answer.

Throughout the demos, GPT-4o exhibited significantly enhanced personality and conversational abilities compared to previous iterations.

OpenAI also demonstrated the chatbot’s ability to seamlessly switch between languages, facilitating translations between English and Italian in real-time.

These demonstrations underscored ChatGPT’s multimodal capabilities, spanning visual, audio, and text interactions. The AI assistant was able to utilize a phone’s camera to read written notes and even attempt to discern the emotions of individuals.

In the broader context, the online event coincided with Google’s upcoming I/O developer conference, where advancements in generative AI are expected to take centre stage.

STORY CONTINUES BELOW THIS AD

OpenAI has also announced the release of a desktop version of ChatGPT, initially catering to Mac users, with access rolling out to paid users starting today. While a Windows version is in the pipeline, OpenAI chose to prioritize Mac users initially due to their larger user base.

In addition, OpenAI revealed plans to grant free users access to custom GPTs and its GPT store. These capabilities will be phased in over the coming weeks.

The rollout of GPT-4o’s text and image capabilities has commenced for paid ChatGPT Plus and Team users, with availability for Enterprise users on the horizon. Free users will also begin gaining access, albeit with rate limits.

Furthermore, the voice version of GPT-4o is set to become available “in the coming weeks,” expanding its utility beyond text-based interactions.

Developers can expect to leverage GPT-4o’s text and vision modes, with audio and video capabilities slated for release to “a small group of trusted partners” in the near future.

STORY CONTINUES BELOW THIS AD

In a tweet, OpenAI confirmed that the enigmatic GPT2-chatbot spotted on a benchmarking site is indeed GPT-4o, shedding light on its identity.

End of Article
Latest News
Find us on YouTube
Subscribe
End of Article

Impact Shorts

America ready for self-driving cars, but it has a legal problem

America ready for self-driving cars, but it has a legal problem

US self-driving cars may soon ditch windshield wipers as the NHTSA plans to update regulations by 2026. State-level rules vary, complicating nationwide deployment. Liability and insurance models are also evolving with the technology.

More Impact Shorts

Top Stories

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Top Shows

Vantage Firstpost America Firstpost Africa First Sports
Latest News About Firstpost
Most Searched Categories
  • Web Stories
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Tech/Auto
  • Entertainment
  • IPL 2025
NETWORK18 SITES
  • News18
  • Money Control
  • CNBC TV18
  • Forbes India
  • Advertise with us
  • Sitemap
Firstpost Logo

is on YouTube

Subscribe Now

Copyright @ 2024. Firstpost - All Rights Reserved

About Us Contact Us Privacy Policy Cookie Policy Terms Of Use
Home Video Shorts Live TV