Tech | Visa | Scholarship/School | Info Place

Resemble AI launches tool to create AI voice clones in one minute

Join us in Atlanta on April 10 to explore the future of a safe workforce. We’ll explore the vision, benefits, and use cases of artificial intelligence for security teams. Request an invitation here.


Resemble AI is launching Rapid Voice Cloning, a new feature of its platform that significantly speeds up the process of generating voice clones. The company specializes in the elusive AI voice category, focusing on enterprise users.

Rapid Speech Cloning, launched today, can copy speech from relatively short data sets and produce output in about a minute. Resemble said the move marks a major development that will make voice cloning technology more accessible, allowing more users to create custom voices for their applications. The company believes it will have an impact in areas such as content creation, personalization and accessibility.

Resemble has released multiple voice clone samples, demonstrating the power of the new technology. VentureBeat also tested the feature to see how well it works in practice.

How does the new AI voice cloning feature work?

Using Resemble’s web platform, users can create a digital copy of their voice by uploading an audio sample or recording a series of sentences. The company has been offering this feature for a while, but the process took time. Users must record about 25 sentences or upload at least three minutes of voice content to set up the system, and then it takes another hour or so before the clones are available.

VB event

Artificial Intelligence Impact Tour – Atlanta

Continuing our tour, we will head to Atlanta for the AI ​​Impact Tour stop on April 10th. This exclusive, invitation-only event in partnership with Microsoft will discuss how generative AI is transforming the security workforce. Space is limited, please request an invitation now.

request an invitation

Now, with the launch of Rapid Voice Cloning, it’s even easier for users to start using the technology. All they have to do is provide clear audio samples of the target speech, ranging in duration from 10 seconds to 1 minute. The company’s model instantly captures all parameters in a sample, including accents, and gives results for downstream use cases within a minute.

“While other state-of-the-art models often struggle to replicate the nuances and subtleties of different accents, Resemble AI’s advanced machine learning algorithms excel in this area. By analyzing and learning from 10 seconds of speech samples, our fast speech clones AI-generated speech can be created that faithfully mimics the unique intonation, pronunciation, and rhythm of the original speaker’s accent.” This feature.

The company released a series of samples comparing its product to Microsoft’s VALL-E and XTTS-v2 speech cloning models, which included input speech samples and text used for cloning. The results are very impressive. However, when we created a free test account to see how the technology really works, we discovered some glaring gaps.

In our tests, the system required recording at least three long sentences, and there was no option to record shorter 10-second samples. The processing is fast, but it fails to recognize the speaker’s Indian accent and defaults to the input as a speech sample of American English. This affects the accent of the output speech. However, the issue is expected to be fixed as, according to the company, Rapid Voice Cloning will support most English accents.

Notably, the company will continue to offer the original cloning feature under the name Professional Voice Cloning. This option has longer input requirements and takes time, but supports all English accents and supports text-to-speech and speech-to-speech use cases. Quick Clone only supports text-to-speech generation.

Used across different categories

With the speed of rapid voice cloning and dramatically reduced sample requirements, Resemble AI anticipates more users using the technology and enabling faster iteration and deployment. The biggest adopters are expected to be content creators, who can use the technology to generate voiceovers, voiceovers, narration and dialogue for their podcasts, videos, audiobooks or e-learning materials. The company also said businesses can use the technology to create enhanced accessibility and personalized experiences.

“For example, fitness apps could use rapid voice cloning to create personalized AI coaches that speak to each user in a familiar voice, providing encouragement and guidance. Likewise, virtual assistants could adjust their voices to match the user’s preferences, thereby Create more intimate and customized interactions,” the company said.

While it remains to be seen how the technology will be adopted, it’s worth noting that Resemble isn’t the only company reducing the time it takes to generate voice clones. ElevenLabs, another major player in this category, offers a feature called Instant Voice Clone, which requires at least a minute of clear audio to generate clones almost instantly. Like Resemble, ElevenLabs offers a professional version of the tool that covers more languages ​​and accents.

As of now, Resemble AI allows users to create a free voice clone. For more services, users must choose the company’s paid plans, which start at $29 per month and go up to $499 per month. There’s also the option of a pay-as-you-go personal plan or a larger enterprise plan with custom pricing.

#Resemble #launches #tool #create #voice #clones #minute

Leave a Reply

Your email address will not be published. Required fields are marked *

Index