Haotian Zhang
I'm a Senior Research Scientist at NVIDIA Spatial Intelligence Lab (SIL) .
I received my Ph.D. in Computer Science from Stanford University in 2023, where I was advised by Prof. Kayvon Fatahalian . I obtained my B.E. in Computer Science and Technology from Tsinghua University in 2017, where I was advised by Prof. Shi-Min Hu .
My current research focuses on 3D human motion perception, modeling and generation. In particular, I am interested in leveraging human motion data from videos to solve problems in humanoid robotics.
Email  | 
Google Scholar  | 
Twitter
Synthesizing High-Quality and Controllable Tennis Animation From Real-World Video Collections
Haotian Zhang
Ph.D. in Computer Science, Stanford University, 2023
thesis
Your browser does not support the video tag.
Kimodo: Scaling Controllable Human Motion Generation
Davis Rempe *,
Mathis Petrovich *,
Ye Yuan ,
Haotian Zhang ,
Xue Bin Peng ,
Yifeng Jiang ,
Tingwu Wang ,
Umar Iqbal ,
David Minor ,
Michael de Ruyter ,
Jiefeng Li ,
Chen Tessler ,
Edy Lim,
Eugene Jeong,
Sam Wu,
Ehsan Hassani,
Michael Huang,
Jin-Bey Yu,
Chaeyeon Chung,
Lina Song,
Olivier Dionne,
Jan Kautz ,
Simon Yuen ,
Sanja Fidler (*Equal Contribution)
arXiv , 2026
project page |
code |
tech report
Your browser does not support the video tag.
GENMO: A GENeralist Model for Human MOtion
Jiefeng Li ,
Jinkun Cao ,
Haotian Zhang ,
Davis Rempe ,
Jan Kautz ,
Umar Iqbal ,
Ye Yuan
ICCV , 2025   (Highlight)
project page |
code |
paper |
video |
Two Minute Papers
HIL: Hybrid Imitation Learning of Diverse Parkour Skills from Videos
Jiashun Wang ,
Yifeng Jiang ,
Haotian Zhang ,
Chen Tessler ,
Davis Rempe ,
Jessica Hodgins ,
Xue Bin Peng ,
arXiv , 2025
paper |
video
Generative Motion Infilling from Imprecisely Timed Keyframes
Purvi Goel ,
Haotian Zhang ,
Karen Liu ,
Kayvon Fatahalian
Eurographics , 2025
project page |
paper |
video
Your browser does not support the video tag.
HumanoidOlympics: Sports Environments for Physically Simulated Humanoids
Zhengyi Luo* ,
Jiashun Wang* ,
Kangni Liu* ,
Haotian Zhang ,
Chen Tessler ,
Jingbo Wang ,
Ye Yuan ,
Jinkun Cao
Zihui Lin ,
Fengyi Wang ,
Jessica Hodgins ,
Kris Kitani
arXiv , 2025
project page |
paper |
code
Your browser does not support the video tag.
COIN:Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
Jiefeng Li ,
Ye Yuan ,
Davis Rempe ,
Haotian Zhang ,
Pavlo Molchanov ,
Cewu Lu ,
Jan Kautz ,
Umar Iqbal
ECCV , 2024
project page |
paper
Your browser does not support the video tag.
Learning Physically Simulated Tennis Skills from Broadcast Videos
Haotian Zhang ,
Ye Yuan ,
Viktor Makoviychuk ,
Yunrong Guo ,
Sanja Fidler ,
Xue Bin Peng ,
Kayvon Fatahalian
SIGGRAPH , 2023   (Best Paper Honorable Mention)
project page |
paper |
video |
code |
Two Minute Papers
Spotting Temporally Precise, Fine-Grained Events in Video
James Hong ,
Haotian Zhang ,
Michaël Gharbi ,
Matthew Fisher ,
Kayvon Fatahalian
European Conference on Computer Vision (ECCV) , 2022
project page |
paper |
code
Your browser does not support the video tag.
Vid2Player: Controllable Video Sprites that Behave and Appear like Professional Tennis Players
Haotian Zhang ,
Cristobal Sciutto ,
Maneesh Agrawala ,
Kayvon Fatahalian
ACM Transactions on Graphics (TOG) , 2021
project page |
paper |
video |
Two Minute Papers
Analysis of Faces in a Decade of US Cable TV News
James Hong ,
Will Crichton ,
Haotian Zhang ,
Dan Fu ,
Jacob Ritchie ,
Jeremy Barenholtz ,
Ben Hannel ,
Xinwei Yao ,
Michaela Murray ,
Geraldine Moriba ,
Maneesh Agrawala ,
Kayvon Fatahalian
ACM Conference on Knowledge Discovery and Data Mining (KDD) , 2021
project page |
paper |
demo
Coherent Video Generation for Multiple Hand-held Cameras with Dynamic Foreground
Fang-Lue Zhang ,
Connelly Barnes ,
Haotian Zhang ,
Junhong Zhao ,
Gabriel Salas
Computational Visual Media , 2020
paper
An Internal Learning Approach to Video Inpainting
Haotian Zhang ,
Long Mai ,
Ning Xu ,
Zhaowen Wang ,
John Collomosse
Hailin Jin
International Conference on Computer Vision (ICCV) , 2019
project page |
paper |
video |
code
TextureNet: Consistent Local Parametrizations for Learning from High-Resolution Signals on Meshes
Jingwei Huang ,
Haotian Zhang ,
Li Yi ,
Thomas Funkhouser ,
Matthias Nießner
Leonidas Guibas
Computer Vision and Pattern Recognition (CVPR) , 2019   (Oral presentation)
project page |
paper |
code
Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels
Dan Fu ,
Will Crichton ,
James Hong ,
Xinwei Yao ,
Haotian Zhang ,
Anh Truong ,
Avanika Narayan ,
Maneesh Agrawala ,
Christopher Ré ,
Kayvon Fatahalian
ACM Symposium on Operating Systems Principles (SOSP), workshop on AI Systems , 2019
project page |
paper |
code
Image-based Clothes Changing System
Zhao-Heng Zheng ,
Haotian Zhang ,
Fang-Lue Zhang ,
Tai-Jiang Mu
Computational Visual Media , 2017
paper
Robust Background Identification for Dynamic Video Editing
Fang-Lue Zhang ,
Xian Wu ,
Haotian Zhang ,
Jue Wang ,
Shi-Min Hu
SIGGRAPH Asia , 2016
paper
Last updated: March, 2026
Template adapted from Ye Yuan .