I am a Postdoctor at the Technical University of Munich (TUM), under Prof. Bjorn Schuller. I received my Ph.D. degree from National University of Singapore, supervised by Prof. Li Haizhou and Prof. Robby T. Tan. Prior to that, I received the Master’s Degree from Zhejiang University in 2019, supervised by Prof. Li Ping and Prof. Ren Qinyuan. I was awarded a Bachelor’s Degree by Northeastern University (China) in 2016.

My research interests are audio-visual speech recognition, talking face generation, and audio-visual sound source localization. I have 15 papers published or under review at top international conferences and journals, including CVPR, TASLP, TNNLS, TMM, AAAI, ICRA, and ICASSP.

📜 Research Area

Audio-Visual Speech Processing :
   Audio-visual speech recognition; Sound Source localization
Video Synthesize :
   Talking Face Generation

💼 Employment

  • 2024.04 - Now, Postdoctor in MRI, Technical University of Munich, Germany.
  • 2023.08 - 2024.03, Research Assistant in Chinese University of Hong kong, Shenzhen, China.

🏫 Education

  • 2019.08 - 2024.02, Ph.D. in Electrical and Computer Engineering, National University of Singapore, Singapore.
  • 2016.08 - 2019.06, M.Sc. in Control Engineering, Zhejiang Univerisity, China.
  • 2012.09 - 2016.06, B.Eng. in Automation, Northeastern University, China.

📝 Publication

2024

2023

2022

2021

💻 Open Source Code

  • TalkLip, Talking Face Generation
  • AVRI, Dataset and Codes, Audio-Visual Speaker Tracking

👔 Internship and Visiting Experience

  • 2018.07 - 2018.12, Visiting Student, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore.
  • 2022.02 - 2022.08, Visiting Student, Chinese University of Hong Kong (CUHKSZ), Shenzhen, China.

Reviewer

  • Reviewer of CVPR, ICCV, ECCV, ACM MM, TMM, SIGGRAPH ASIA, IROS.