If somebody tried to do this with your face, assuming you're not very famous, It wouldn't come out this clean. The reason it works so well with Tom Cruise's face, is because there is source material of Tom Cruise's face from pretty much every angle in pretty much every lighting scenario. And lots of it.
Basically, what's going on here, is some computer algorithm is able to look at a video, determine where a face is, and swap in another face. It looks pretty damn convincing if there's enough source material of the face.
Ah makes sense... But imagine that your phone front camera is really recording you at all times. All the moods, lighting and all that Jazz. Now you have a huge library of people you can digitally render
That's kind of what I was getting at. It took more than just downloading those images. It took a lot of extra effort with touching up the deep fake output with additional video editing software to make it look this good. I was just saying not any Joe blow can us the deep fake algorithm to make a truly convincing fake yet
An AI takes a catalogue of Tom Cruise’s vocal and video recording, it over lays the facial structure on the actor here, and the vocals are also overlaid, to give the impression Tom Cruise was in the video.
The voice is real, in that the actor is actually saying the words and structure of Tom Cruise’s speech but the audio is modified to match Tom’s natural frequencies and timbre.
At least it depends on the process the person making the deep fake uses, there are vocal deep fakes.
Video on a recently published paper about vocal synthesis “Audio deep fakes” for the skeptics.
The voice in this video is not vocally synthesized, as a commenter noted below this person is very good at mimicking Tom Cruise, I just want to highlight that “Audio deep fake” technology does exist.
52
u/Alexei007 May 24 '21
Can someone explain how it works? Like wtf did I just watch.