Firstpost
  • Home
  • Video Shows
    Vantage Firstpost America Firstpost Africa First Sports
  • World
    US News
  • Explainers
  • News
    India Opinion Cricket Tech Entertainment Sports Health Photostories
  • Asia Cup 2025
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
Trending:
  • PM Modi in Manipur
  • Charlie Kirk killer
  • Sushila Karki
  • IND vs PAK
  • India-US ties
  • New human organ
  • Downton Abbey: The Grand Finale Movie Review
fp-logo
Google DeepMind unveils a new AI model V2A that can generate soundtrack and dialogues for videos
Whatsapp Facebook Twitter
Whatsapp Facebook Twitter
Apple Incorporated Modi ji Justin Trudeau Trending

Sections

  • Home
  • Live TV
  • Videos
  • Shows
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Health
  • Tech/Auto
  • Entertainment
  • Web Stories
  • Business
  • Impact Shorts

Shows

  • Vantage
  • Firstpost America
  • Firstpost Africa
  • First Sports
  • Fast and Factual
  • Between The Lines
  • Flashback
  • Live TV

Events

  • Raisina Dialogue
  • Independence Day
  • Champions Trophy
  • Delhi Elections 2025
  • Budget 2025
  • US Elections 2024
  • Firstpost Defence Summit
  • Home
  • Tech
  • Google DeepMind unveils a new AI model V2A that can generate soundtrack and dialogues for videos

Google DeepMind unveils a new AI model V2A that can generate soundtrack and dialogues for videos

FP Staff • June 19, 2024, 13:00:51 IST
Whatsapp Facebook Twitter

V2A is designed to work seamlessly with Veo, Google’s text-to-video model that was showcased at Google I/O 2024. V2A uses a diffusion model trained on a mix of sounds, dialogue transcripts, and videos

Advertisement
Subscribe Join Us
Add as a preferred source on Google
Prefer
Firstpost
On
Google
Google DeepMind unveils a new AI model V2A that can generate soundtrack and dialogues for videos
One of the coolest features of V2A is its ability to generate an unlimited number of soundtracks for any video. Users can tweak the audio output with 'positive prompts' and 'negative prompts' to get the sound just right. Image Credit: Pexels

Creating videos from text prompts is becoming more simple thanks to models like Sora, Dream Machine, Veo, and Kling. However, many of these tools have a major drawback: they can’t generate sound, leaving us with silent videos. But Google DeepMind is stepping up to tackle this issue with their latest innovation: a new AI model that can create soundtracks and dialogue for videos.

In a recent blog post, Google DeepMind introduced V2A (Video-to-Audio), an exciting AI model that merges video visual cues with text prompts to create rich sound and audio. This new technology aims to transform the way we create and experience AI-generated videos, by adding dramatic music, realistic sound effects, and matching dialogue.

STORY CONTINUES BELOW THIS AD

V2A is designed to work seamlessly with Veo, Google’s text-to-video model that was showcased at Google I/O 2024. This combination allows users to enhance their videos not just visually but also audibly. V2A can add sound to anything from modern videos created with Veo to silent films and old archival footage, bringing them to life in a whole new way.

More from Tech
How ChatGPT is becoming everyone’s BFF and why that’s dangerous How ChatGPT is becoming everyone’s BFF and why that’s dangerous America ready for self-driving cars, but it has a legal problem America ready for self-driving cars, but it has a legal problem

One of the coolest features of V2A is its ability to generate an unlimited number of soundtracks for any video. Users can tweak the audio output with ‘positive prompts’ and ’negative prompts’ to get the sound just right. Plus, every piece of generated audio is watermarked with SynthID technology to ensure it’s original and authentic.

V2A uses a diffusion model trained on a mix of sounds, dialogue transcripts, and videos. While the model is powerful, it wasn’t trained on a massive number of videos, so sometimes the audio might come out a bit off. Because of this, and to prevent any potential misuse, Google isn’t planning to release V2A to the public anytime soon.

Impact Shorts

More Shorts
America ready for self-driving cars, but it has a legal problem

America ready for self-driving cars, but it has a legal problem

Alibaba, Baidu begin using own AI chips as China shifts away from US tech amid Nvidia row

Alibaba, Baidu begin using own AI chips as China shifts away from US tech amid Nvidia row

The introduction of V2A by Google DeepMind is a significant step forward in video creation technology. By adding sound and dialogue, V2A fills a crucial gap, making videos more immersive and engaging. Although it’s still in the works and not yet available for public use, V2A shows incredible promise for the future of video production.

STORY CONTINUES BELOW THIS AD
End of Article
Latest News
Find us on YouTube
Subscribe
End of Article

Impact Shorts

America ready for self-driving cars, but it has a legal problem

America ready for self-driving cars, but it has a legal problem

US self-driving cars may soon ditch windshield wipers as the NHTSA plans to update regulations by 2026. State-level rules vary, complicating nationwide deployment. Liability and insurance models are also evolving with the technology.

More Impact Shorts

Top Stories

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

Russian drones over Poland: Trump’s tepid reaction a wake-up call for Nato?

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

As Russia pushes east, Ukraine faces mounting pressure to defend its heartland

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Why Mossad was not on board with Israel’s strike on Hamas in Qatar

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Turkey: Erdogan's police arrest opposition mayor Hasan Mutlu, dozens officials in corruption probe

Top Shows

Vantage Firstpost America Firstpost Africa First Sports
Latest News About Firstpost
Most Searched Categories
  • Web Stories
  • World
  • India
  • Explainers
  • Opinion
  • Sports
  • Cricket
  • Tech/Auto
  • Entertainment
  • IPL 2025
NETWORK18 SITES
  • News18
  • Money Control
  • CNBC TV18
  • Forbes India
  • Advertise with us
  • Sitemap
Firstpost Logo

is on YouTube

Subscribe Now

Copyright @ 2024. Firstpost - All Rights Reserved

About Us Contact Us Privacy Policy Cookie Policy Terms Of Use
Home Video Shorts Live TV