👤 About Me

I am about to join King Abdullah University of Science and Technology University (KAUST) as a Postdoctoral Researcher Under Prof. Jürgen Schmidhuber. I earned my Ph.D. from Tokyo Institute of Technology (TIT), advised by Prof. Hideki Koike. Before joining TIT, I worked as a computer vision researcher at Sensetime. I received my Master degree at Shanghai Jiaotong University (SJTU) in 2020.

My research focuses on image and video generative models, as well as the creation of virtual humans. I’m always interested in novel applications and open to collaboration—feel free to reach out!

📝 Publications

One at a Time: Progressive Multi-step Volumetric Probability Learning for Reliable 3D Scene Perception

Bohan Li, Yasheng Sun, Jingxin Dong, Zheng Zhu, Jinming Liu, Xin Jin, Wenjun Zeng.

AAAI Conference on Artificial Intelligence (AAAI) 2024.

Bridging Stereo Geometry and BEV Representation with Reliable Mutual Interaction for Semantic Scene Completion

Bohan Li, Yasheng Sun, Zhujin Liang, Dalong Du, Zhuanghui Zhang, Xiaofeng Wang, Yunnan Wang, Xin Jin, Wenjun Zeng

International Joint Conference on Artificial Intelligence (IJCAI) 2024.

Project

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation

Yasheng Sun, Yifan Yang, Houwen Peng, Yifei Shen, Yuqing Yang, Han Hu, Lili Qiu, Hideki Koike

Advances in Neural Information Processing Systems (NeurlPS) 2023.

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation

Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Shio Miyafuji, Ziwei Liu, Hideki Koike

International Conference on Multimodal Interaction (ICMI) 2023 (Oral).

Project

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers

Yasheng Sun, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Zhibin Hong, Jingtuo Liu, Errui Ding, Jingdong Wang, Ziwei Liu, Hideki Koike

SIGGRAPH Asia 2022 Conference.

Project

Speech2Talking-Face: Inferring and Driving a Face with Synchronized Audio-Visual Representation

Yasheng Sun, Hang Zhou, Ziwei Liu, Hideki Koike International Joint Conference on Artificial Intelligence (IJCAI) 2021.

Project

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation

Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021.

Project

🎖 Honors and Awards

2017.02 . Outstanding Winner (1/8085) of MCM/ICM competition held by American Consortium for Mathematics and Its Application (COMAP). A nice collaboration with Teng Xue and Zhu Taihang!
2015.07 . Principal’s Special Award – Qiushi Award (Top 1%).
2014.12 . National Scholoarship (Top 5%).

📖 Educations

2020.09 - 2024.12, Tokyo Institute of Technology, Ph.D student of Computer Science, School of Computing.
2017.09 - 2020.06, Shanghai Jiao Tong University, Master of Engineering.
2013.09 - 2017.06, Nanjing University of Aeronautics and Astronautics, Bachelor of Engineering.

📡 Collaborators

I am fortunate to have closely collaborated with Kaisiyuan Wang, Wenqing Chu, Qianyi Wu, Zhiliang Xu, Bohan Li, Hang Zhou and Prof. Ziwei Liu.
I am also fortunate to have had excellent discussions with researchers from industry, including Wayne Wu, Borong Liang, Yi Yuan, Jiangke Lin, Yifan Yang from orgnizations such as SenseTime, Baidu VIS, NetEase Fuxi AI Lab and Microsoft Research Asia.

💻 Academic Activities

Journal Reviewer of Transactions on Multimedia, TPAMI, etc
Conference Reviewer of CVPR, ICCV, ECCV, NeurIPS, ICLR, ICML, etc
Program Committee of AAAI, etc