sb-au logo
Story image

New rapid voice cloning tech could spell end for voice authentication

10 Mar 2018

A new innovation has been announced that can clone and reproduce a pretty believable version of your voice using less than four seconds of audio.

Chinese tech giant Baidu unveiled its latest advancements within Deep Voice, a project the company began around a year ago.

In its infancy, Deep Voice required 30 minutes of audio to create a new fake clip, which is a perfect representation of the rapid development within machine learning and artificial intelligence (AI).

The more audio samples the system is provided, the better the quality and believability of the fake voice. At its bare minimum of 3.7 seconds the output does sound a little distorted but certainly not much worse than what a low quality audio file might sound.

Furthermore, the system can change a male voice to female or a British accent to American (and vice versa) showcasing the power of AI to learn and imitate different styles of speaking and bringing the act of text-to-speech to a entirely new level.

Technologies similar to this have emerged in recent years with Adobe revealing its VoCo software in 2016 that could generate speech from text after listening to a voice for 20 minutes. Likewise, Lyrebird (a Montreal-based AI startup) affirms it can do the same with just one minute of audio.

Of course, there is a fair amount of apprehension surrounding the rapid advancements of text-to-speech, especially given the rise of machine learning software that has enabled the easy creation of fake videos – making any media on the web increasingly harder to believe.

AI researchers and theorists have expressed their concerns as essentially if all that’s needed is a few seconds of someone’s voice and a dataset of their face it wouldn’t be hard at all to concoct an entire interview, press conference or news segment.

Aeriandi chief product officer and co-founder Tom Harwood says there is a huge security issue given we are in the era of voice biometric authentication.

"This technology is poised to transform personalisation in human-machine interfaces, but it raises serious concerns about voice biometric security systems,” Harwood says.

“Soon, criminals will need just a few seconds of someone’s voice to cheat a voice recognition security system – voice biometric authentication will be rendered useless.”

Harwood says organisations need to be thinking now about how they can implement new technologies to ensure they stay ahead of the curve

“Voice fraud detection technology is the primary candidate, as it looks at far more than the user's voice print; it considers hundreds, if not thousands, of other parameters,” Harwood says.

“For example, is the phone number being used legitimate? Has it been used fraudulently before? Increasingly, phone fraud attacks come from overseas. Voice fraud technology has been proven to protect against this as well as domestic threats."

Story image
Rising to the contact centre security challenge in the era of COVID-19
Cloud based contact centres have enabled Australian organisations to keep on working through the coronavirus pandemic but, in a climate of heightened risk, ensuring the security of your solutions and customer data is a critical imperative.More
Story image
Why DX is not complete without a transformed security architecture
Secure Access Services Edge (SASE) is the process by which core WAN edge capabilities like SD-WAN, routing, and WAN optimisation at branch locations are integrated with cloud-based security services like secure web gateways, firewall-as-a-service, cloud access security brokers, and more.More
Link image
Phishing attacks skyrocket as criminals cash in on COVID fears
Attackers are imitating the WHO as part of malicious phishing attacks - is your workforce protected?More
Story image
Australians ignoring cybersecurity policies in favour of productivity
Trend Micro has found that 67% of remote workers have increased their cybersecurity awareness during COVID-19 related lockdowns. However, despite greater awareness people may still engage in risky behaviour, the survey finds.More
Download image
NFV touted as go-to method of simplifying corporate networks
Enterprises outline their considerations in this study, built on evidence from those directly involved with building a strong networking infrastructure.More
Story image
Whither quantum computing in a rapidly digitising Australia?
While widespread digitisation continues to transform the way Australians live, work and play, the real revolution is occurring in the background, in the quantum computing sphere.More