Demos
- Synthesizing Speech from Silent Video (GRID)
- Synthesizing Speech from Silent Video (Chemistry)
Niu Zhe, Brian Mak, "On the Audio-visual Synchronization for Lip-to-Speech Synthesis," ICCV 2023. [pdf] - TTS Style Transfer by Speech Imitation
Raymond Chung, Brian Mak, "On-The-Fly Data Augmentation for Text-to-Speech Style Transfer." ASRU 2021: 634-641. [pdf] - Non-parallel Many-to-many Voice Conversion
Xinyuan Yu, Brian Mak, "Non-Parallel Many-To-Many Voice Conversion by Knowledge Transfer from a Text-To-Speech Model." ICASSP 2021: 5924-5928. [pdf] - Multilingual Multi-speaker Neural TTS
Zhaoyu Liu, Brian Mak, "Multi-Lingual Multi-Speaker Text-to-Speech Synthesis for Voice Cloning with Online Speaker Enrollment." INTERSPEECH 2020: 2932-2936. [pdf]