About Me

I am currently an assistant professor at the Research Institute of Multiple Agents and Embodied Intelligence, Pengcheng Laboratory. Before that, I received my PhD at Xi’an Jiaotong University in March 2023, advised by Prof. Jihua Zhu. I was a postdoctoral researcher at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), from 2023 to 2024, working with Prof. Xiaodan Liang. I was a Visiting Scholar at Monash University, from 2019 to 2021, working with Prof. Xiaojun Chang.

My current research interests is Embodied AI. I did some works on 2D/3D Generation, Multimodal Large Language Model, Image Denoising and Point Set Registration.

News

  • [09/2024]: I have been appointed as an Assistant Professor at Pengcheng Laboratory.
  • [09/2024]: One paper is accepted to NeurIPS 2024.
  • [07/2024]: One paper is accepted to ECCV 2024.
  • [12/2023]: One paper is accepted to AAAI 2024.
  • [03/2023]: I started my postdoc journey at MBZUAI.
  • [11/2022]: One paper is accepted to AAAI2023.
  • [10/2022]: One paper is accepted to TIP.

Selected Publications

Multimodal Large Language Model
Web2Code
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen
NeurIPS, 2024 [Code]



2D Generation
HumanRefiner
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang
ECCV, 2024 [Code]


3D Generation
PTUS
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping
Luoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, Xiaochun Cao
AAAI, 2024

3D-TOGO
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei Zhang, Xiaojun Chang, Hang Xu
AAAI, 2023
Image Denoising
Dynamic
Dynamic Slimmable Denoising Network
Zutao Jiang, Changlin Li, Xiaojun Chang, Ling Chen, Jihua Zhu, Yi Yang
TIP, 2023


Point Set Registration
diff_grid_map
Merging grid maps in Diverse Resolutions by the Context-based Descriptor
Zhiyang Lin, Jihua Zhu, Zutao Jiang, Yujie Li, Yaochen Li, Zhongyu Li
ACM Transactions on Internet Technology, 2021.

3D_Mapping
3D mapping of outdoor environments by scan matching and motion averaging
Zutao Jiang, Jihua Zhu, Zhiyang Lin, Zhongyu Li, Guo Rui
Neurocomputing, 2020.






K-means
Efficient registration of multi-view point sets by K-means clustering
Jihua Zhu, Zutao Jiang, Georgios D Evangelidis, Changqing Zhang, Shanmin Pang, Zhongyu Li
Information Sciences, 2019 [Code]

Experiences

Image

Research Intern, Peng Cheng Laboratory

Oct. 2021 - Mar. 2023

Selected Awards

First Prize of Shaanxi Higher Education Natural Science Award

2024

Second Prize of Science and Technology Award of Shaanxi Computer Society

2023

Teaching

CV803: Advanced Techniques in Visual Object Recognition and Detection, MBZUAI (Teaching Assistant)

2024

Reviewer

  • CVPR 2023, 2024
  • NeurIPS 2023, 2024
  • ECCV 2024
  • ICLR 2023-2024
  • ICML 2023
  • ACM MM 2024
  • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
  • IEEE Transactions on Circuits and Systems for Video Technology(TCSVT)
  • IEEE Transactions on Cybernetics (TCYB)
  • IEEE Transactions on Systems, Man, and Cybernetics
  • ACM Transactions on Multimedia Computing, Communications, and Applications
  • Neural Networks
  • Knowledge-Based Systems (KBS)