👤 About Me

I am about to join King Abdullah University of Science and Technology University (KAUST) as a Postdoctoral Researcher Under Prof. Jürgen Schmidhuber. I earned my Ph.D. from Tokyo Institute of Technology (TIT), advised by Prof. Hideki Koike. Before joining TIT, I worked as a computer vision researcher at Sensetime. I received my Master degree at Shanghai Jiaotong University (SJTU) in 2020.

My research focuses on image and video generative models, as well as the creation of virtual humans. I’m always interested in novel applications and open to collaboration—feel free to reach out!

📝 Publications

sym

One at a Time: Progressive Multi-step Volumetric Probability Learning for Reliable 3D Scene Perception

Bohan Li, Yasheng Sun, Jingxin Dong, Zheng Zhu, Jinming Liu, Xin Jin, Wenjun Zeng.

AAAI Conference on Artificial Intelligence (AAAI) 2024.

sym

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion

Bohan Li, Yasheng Sun, Zhujin Liang, Dalong Du, Zhuanghui Zhang, Xiaofeng Wang, Yunnan Wang, Xin Jin, Wenjun Zeng

International Joint Conference on Artificial Intelligence (IJCAI) 2024.

Project

sym

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation

Yasheng Sun, Yifan Yang, Houwen Peng, Yifei Shen, Yuqing Yang, Han Hu, Lili Qiu, Hideki Koike

Advances in Neural Information Processing Systems (NeurlPS) 2023.

sym

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation

Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Shio Miyafuji, Ziwei Liu, Hideki Koike

International Conference on Multimodal Interaction (ICMI) 2023 (Oral).

Project

sym

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers

Yasheng Sun, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Zhibin Hong, Jingtuo Liu, Errui Ding, Jingdong Wang, Ziwei Liu, Hideki Koike

SIGGRAPH Asia 2022 Conference.

Project

sym

Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation

Yasheng Sun, Hang Zhou, Ziwei Liu, Hideki Koike International Joint Conference on Artificial Intelligence (IJCAI) 2021.

Project

sym

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation

Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021.

Project

🎖 Honors and Awards

📖 Educations

  • 2020.09 - 2024.12, Tokyo Institute of Technology, Ph.D student of Computer Science, School of Computing.
  • 2017.09 - 2020.06, Shanghai Jiao Tong University, Master of Engineering.
  • 2013.09 - 2017.06, Nanjing University of Aeronautics and Astronautics, Bachelor of Engineering.

📡 Collaborators

💻 Academic Activities

  • Journal Reviewer of Transactions on Multimedia, TPAMI, etc
  • Conference Reviewer of CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, etc
  • Program Committee of AAAI, etc