I’m currently an undergraduate student at Shanghai Jiao Tong University, majoring in Computer Science and Technology (IEEE Pilot Class). I’m now a research intern in Machine Vision and Intelligence Group (MVIG), under the supervision of Prof. Cewu Lu and Prof. Lixin Yang. My research interests include computer vision, multi-modal models as well as embodied intelligence. I welcome any invitation for collaboration or discussion. If you are interested, please drop me an email! (CV last updated: 2025.1.14)
🔥 News
- 2024.8: We release POEM-v2 as a generalizable multi-view hand mesh reconstruction model.
- 2024.6: I will work as an intern in the AI department of bilibili this summer.
- 2023.12: One paper FAVOR was accepted by AAAI 2024
- 2023.07: I have spent wonderful two weeks attending the summer course IDS2301 at HKU-IDS.
- 2023.03: 🎉🎉 I joined MVIG and began exploring the field of Computer Vision.
📝 Publications
PoseAnything6D: Generalized Object Pose Estimation via Cousin Reference
In submission to ICCV 2025

Multi-view Hand Reconstruction with a Point-Embedded Transformer
Lixin Yang, Licheng Zhong, Pengxiang Zhu, Xinyu Zhan, Junxiao Kong, Jian Xu, Cewu Lu
POEM is a generalizable multi-view hand mesh reconstruction model which embeds a static basis point within the multi-view stereo space. To infer accurate 3D hand mesh from multi-view images, POEM introduce a point-embedded transformer decoder. By employing a combination of five large-scale multi-view datasets and sufficient data augmentation, POEM demonstrates superior generalization ability in real-world applications.

FAVOR: Full-body AR-driven Virtual Object Rearrangement Guided by Textual Instructions
AAAI 2024 | Paper
Kailin Li*, Lixin Yang*, Zenan Lin, Jian Xu, Xinyu Zhan, Yifei Zhao, Pengxiang Zhu, Wenxiong Kang, Kejian Wu, Cewu Lu
FAVOR is a novel dataset for Full-body AR-driven Virtual Object Rearrangement that uniquely employs motion capture systems and VR. A pipeline for producing digital human rearrangement motion sequences is also presented.
💻 Undergraduate Projects

In this project, we build our CUT++ model for image style transfer on the renowned CUT model. By introducing attention into the GAN-based model and modifying the PatchNCE loss, we achieve decent result on the given dataset.

Chinese Slot Language Understanding
In this project, we build a BERT-based pipeline for Chinese slot understanding. We incorporated Lexicon information into the BERT backbone and achieved descent result on the given noisy dataset.
📇 Experiences
Machine Vision and Intelligence Group
Undergraduate Research Intern, 2023.3 - (now)
Advisor: Lixin Yang, Cewu Lu
🎖 Honors and Awards
- 2024 Academic Excellence Scholarship of SJTU (top 30%)
- 2023 Academic Excellence Scholarship of SJTU (top 10%)
- 2022 Academic Excellence Scholarship of SJTU (top 10%)
- 2022.3 Finalist of Mathematical Contest of Modeling (top 5%)
📖 Educations
- 2021.09 - (now), Shanghai Jiao Tong University, Shanghai, China.
- 2018.09 - 2021.06, Shanghai High School, Shanghai, China (Outstanding Graduate).