Qing Shuai | 帅青
I am currently working at Tencent (2024.7-), where my focus is on human motion capture and generation under multimodal inputs. Prior to this, I was a Ph.D. student in Computer Science at Zhejiang University from 2019 to 2024, under the supervision of Xiaowei Zhou. My research interests lie at the intersection of computer vision and computer graphics, with a particular emphasis on 3D human pose estimation and generation, 3D reconstruction, and novel view synthesis.
During my past career, my main focus was on the EasyMoCap repository. The goal of this repository is to make human motion capture more accessible and straightforward. It encompasses a collection of code from my work over the past few years and includes essential tools for the field of human motion capture, such as camera calibration, interactive keypoint annotation, visualization, and more.
Demos
Professional Motion Capture with Multi-Camera Systems
Simple Motion Capture from Complex Internet Videos
Novel View Synthesis
4D Scene Reconstruction and Editing
All Publications
2026
-
AnchorCrafter: Animate Cyber-Anchors Selling Your Products via Human-Object Interacting Video GenerationIEEE Transactions on Visualization and Computer Graphics 2026
2025
-
HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion GenerationarXiv preprint arXiv:2512.23464 2025 -
Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion GenerationsIn Proceedings of the IEEE/CVF International Conference on Computer Vision 2025 -
Idol: Instant photorealistic 3d human creation from a single imageIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 -
Dyn-e: Local appearance editing of dynamic neural radiance fieldsComputers & Graphics 2025 -
Ready-to-react: Online reaction policy for two-character interaction generationarXiv preprint arXiv:2502.20370 2025
2024
-
Animatable implicit neural representations for creating realistic avatars from videosIEEE Transactions on Pattern Analysis and Machine Intelligence 2024 -
Anidress: animatable loose-dressed avatar from sparse views using garment rigging modelarXiv preprint arXiv:2401.15348 2024
2023
-

-

- Reconstructing Close Human Interactions from Multiple ViewsACM Transactions on Graphics (TOG) Jun 2023
- Motion capture method based on unsynchorized videosAug 2023
-
Learning human mesh recovery in 3D scenesIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Aug 2023
2022
-
Efficient Neural Radiance Fields for Interactive Free-viewpoint VideoIn SIGGRAPH Asia Conference Proceedings Aug 2022 -
