Scarily, AI Can Generate Accurate Faces Just From a Person's Voice

Scientists at the Massachusetts Institute of Technology (MIT-USA) have for the first time successfully applied an algorithm to recreate a person's portrait using only a short voice recording.

The AI algorithm called Speech2Face was first introduced in 2019.

Picture 1 of Scarily, AI Can Generate Accurate Faces Just From a Person's Voice

First, the researchers designed a deep learning artificial neural network. The AI was then trained by watching millions of videos from YouTube and the internet of people talking to learn the correlation between the sound of a voice and the speaker, and then make its best guesses about the speaker's age, gender, and nationality.

Once trained, the AI was able to come up with portraits based on voice recordings alone.

Picture 2 of Scarily, AI Can Generate Accurate Faces Just From a Person's Voice

The researchers built a 'face decoder' that creates a standard representation of a person's face from a still image of them, ignoring lighting and pose. They then used this standard human face to compare it to the AI's voice-generated face. The results showed that the AI-generated face was very close to the real face in a wide range of studied cases from a variety of ages, genders, and ethnicities.

Picture 3 of Scarily, AI Can Generate Accurate Faces Just From a Person's Voice

AI-generated portraits could be used to assign machine-generated voices to home appliances and virtual assistants, researchers say. Or AI could help law enforcement create a portrait of a suspect from a voice recording as the only evidence. But that could raise privacy concerns.

Update 09 December 2024

Scarily, AI Can Generate Accurate Faces Just From a Person's Voice

You should read it

Maybe you are interested

System

Mac OS X

Hardware

Game

Tech info

Technology

Science

Life

Application

Electric

Program

Mobile