Meta introduces generative AI model for speech 'Voicebox'

Jun 18, 2023, 14:10 IST

San Francisco, June 18 (IANS) Meta has developed a cutting-edge generative AI model 'Voicebox', designed to revolutionise the field of speech generation.

"We've developed Voicebox, the first model that can generalise to speech-generation tasks it was not specifically trained to accomplish with state-of-the-art performance," Meta said in a blogpost.

According to the company, Voicebox generates images and text in a variety of styles, and it can create outputs from scratch or modify samples provided to it.

However, instead of creating a picture or a passage of text, Voicebox produces high-quality audio clips.

The model supports speech synthesis across six languages, including English, French, German, Spanish, Polish, and Portuguese, as well as performs noise removal, content editing, style conversion, and diverse sample generation.

Moreover, Meta said that Voicebox uses a new approach to learn just from raw audio and an accompanying transcription.

Unlike autoregressive models for audio generation, Voicebox can modify any part of a given sample, not just the end of an audio clip it is given.

Further, the tech giant said that Voicebox is trained to predict a speech segment when given the surrounding speech and the transcript of the segment.

Once the model has learned to infill speech from context, it can be applied across a wide range of speech generation tasks, including generating portions of an audio recording without re-creating the entire recording.

This versatility enables Voicebox to perform well across a variety of tasks, including -- in-context text-to-speech synthesis, cross-lingual style transfer, speech denoising and editing, and diverse speech sampling.

Disclaimer: This story has not been edited by the Sakshi Post team and is auto-generated from syndicated feed.

Read More:

Vavilala Chidvilas Reddy from Telangana Tops JEE Advanced 2023

Tags:

Technology News

Meta introduces generative AI model for speech 'Voicebox'

Salesforce India CEO Arundhati Bhattacharya to lead operations in ASEAN from Feb 1

Sugary drinks can raise risk of stroke, heart failure: Study

IIT Guwahati’s new tech to convert methane, CO2 to biofuel using bacteria

Cyber Fraud: Hyderabad Man Arrested After Chasing for 2500 Kms

Google to tie up with NCERT, launch YouTube channels in 29 Indian languages

Salesforce India CEO Arundhati Bhattacharya to lead operations in ASEAN from Feb 1

Holidays: Anti Scam Tools Launched by Meta

Infosys Narayana Murthy Buys Rs 50 Crore Luxury Apartment in Bengaluru

Zepto HR Head Resigns, CEO Aadit Palicha Takes Over HR Duties

Sugary drinks can raise risk of stroke, heart failure: Study

Jagan Launches All-Out Attack on Chandrababu and the Coalition Government in AP

What Are The Main Challenges In India’s Real Estate Sector? | NAR India 2019

What Are The Main Challenges In India’s Real Estate Sector? | NAR India 2019

You are here

Meta introduces generative AI model for speech 'Voicebox'

What Are The Main Challenges In India’s Real Estate Sector? | NAR India 2019