Lip-to-Speech Synthesis in the Wild with Multi-task Learning

Overview

This repository contains a video demo of IEEE International Conference on Acoustics, Speech and Signal Processing submitted paper titled "Lip-to-Speech Synthesis in the Wild with Multi-task Learning."

The official code is now available in here.

Demo video

A demo video contains the original speech, the generated speech from previous state-of-the-art work [1], and the generated speech from the proposed method from three different speakers on both LRS2 and LRS3 datasets, respectively, and six speakers on LRW dataset. The video demo is located in demo-video folder in our repository, and it is also available in Youtube:

LRS2 and LRS3 [Demo Video]
LRW [Demo Video]

References

[1] Rodrigo Mira, Alexandros Haliassos, Stavros Petridis, Bj̈orn W Schuller, and Maja Pantic, “Svts: Scalable video-to-speech synthesis,” arXiv preprint arXiv:2205.02058, 2022.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
demo-video		demo-video
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

demo-video

demo-video

README.md

README.md

Repository files navigation

Lip-to-Speech Synthesis in the Wild with Multi-task Learning

Overview

Demo video

References

About

Releases

Packages

joannahong/Lip-to-Speech-Synthesis-in-the-Wild

Folders and files

Latest commit

History

demo-video

demo-video

README.md

README.md

Repository files navigation

Lip-to-Speech Synthesis in the Wild with Multi-task Learning

Overview

Demo video

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages