AI & Humanoids

Stunning new AI adds realistic sound effects to any video

Stunning new AI adds realistic sound effects to any video
"What if you could describe a sound and generate it with AI?" teases ElevenLabs as AI-generated sound effects and dialog are added to video footage generated by Sora from OpenAI
"What if you could describe a sound and generate it with AI?" teases ElevenLabs as AI-generated sound effects and dialog are added to video footage generated by Sora from OpenAI
View 1 Image
"What if you could describe a sound and generate it with AI?" teases ElevenLabs as AI-generated sound effects and dialog are added to video footage generated by Sora from OpenAI
1/1
"What if you could describe a sound and generate it with AI?" teases ElevenLabs as AI-generated sound effects and dialog are added to video footage generated by Sora from OpenAI

Last week, OpenAI released a new AI model called Sora that could generate high-resolution video clips from text prompts. But they're all essentially clever silent films. Now ElevenLabs has added background sounds to Sora-created footage.

AI voice cloning startup ElevenLabs was co-founded by former Google machine-learning engineer Piotr Dabkowski and ex Palantir deployment strategist Mati Staniszewski in 2022, and has since launched AI-powered text-to-speech software and an AI dubbing tool designed to auto translate speech from a video into more than 20 languages that "maintains the original voice tone and style."

Now the company is working on something new, which can reportedly generate sounds to accompany otherwise silent video footage based on descriptions of a scene given by a user.

It's a sound effects/foley team in a box, and to demonstrate its prowess, ElevenLabs has unleashed it on some Sora-generated content.

"We used text prompts like 'waves crashing,' 'metal clanging,' 'birds chirping,' and 'racing car engine' to generate audio that we overlaid onto some of our favorite clips from the OpenAI Sora announcement," explained the company in a blog post.

Sound Effects are Coming Soon to ElevenLabs

The nitty gritty of Sound Effects by ElevenLabs has yet to be revealed, but the demo shows a bunch of Sora-generated video clips accompanied by fairly realistic background sounds – from footsteps on a busy street along with the hum of the city to the beeps and mechanical drone of a bipedal robot of the future to a film-like narrative in a Hollywood-style promo voice. All this apparently from text-to-audio prompts.

As with Sora, there will doubtless be some kinks that need working out, as well as protections against fraud and safety protocols to cook in, but with the pace of AI development so rapid, can we perhaps expect the Oscars for best everything to be awarded to an AI in the near future? Interesting (and possibly scary) times ahead.

There's been no word as yet on when we can expect the Sound Effects tech to land, but folks interested in learning more are being invited to register their interest.

Source: ElevenLabs (X/Twitter)

1 comment
1 comment
Brian M
Quite eerie hearing the sound added to the same AI video clips previous shown in an earlier article here. Makes them seem so much more real.

As they leave, will the last human please switch off the light!