I’m currently an undergraduate student at Shanghai Jiao Tong University, majoring in Computer Science and Technology (IEEE Pilot Class). I’m now a research intern in Machine Vision and Intelligence Group (MVIG), under the supervision of Prof. Cewu Lu and Prof. Lixin Yang. My research interests revolve around the intersection between Computer Vision and Robotics, and I welcome any invitation for collaboration or discussion.

I intend to gain some experience as an intern in a company or research institution while prepare my MS applications in Fall 2025. If you are interested, please drop me an email! (CV last updated: 2024.8.26)

🔥 News

  • 2024.8: We release POEM-v2 as a generalizable multi-view hand mesh reconstruction model.
  • 2024.6: I will work as an intern in the AI department of bilibili this summer.
  • 2023.12: One paper FAVOR was accepted by AAAI 2024
  • 2023.07: I have spent wonderful two weeks attending the summer course IDS2301 at HKU-IDS.
  • 2023.03:  🎉🎉 I joined MVIG and began exploring the field of Computer Vision.

📝 Publications

Arxiv
sym

Multi-view Hand Reconstruction with a Point-Embedded Transformer

Arxiv | Paper | Code

Lixin Yang, Licheng Zhong, Pengxiang Zhu, Xinyu Zhan, Junxiao Kong, Jian Xu, Cewu Lu

POEM is a generalizable multi-view hand mesh reconstruction model which embeds a static basis point within the multi-view stereo space. To infer accurate 3D hand mesh from multi-view images, POEM introduce a point-embedded transformer decoder. By employing a combination of five large-scale multi-view datasets and sufficient data augmentation, POEM demonstrates superior generalization ability in real-world applications.

AAAI 2024
sym

FAVOR: Full-body AR-driven Virtual Object Rearrangement Guided by Textual Instructions

AAAI 2024 | Paper

Kailin Li*, Lixin Yang*, Zenan Lin, Jian Xu, Xinyu Zhan, Yifei Zhao, Pengxiang Zhu, Wenxiong Kang, Kejian Wu, Cewu Lu

FAVOR is a novel dataset for Full-body AR-driven Virtual Object Rearrangement that uniquely employs motion capture systems and VR. A pipeline for producing digital human rearrangement motion sequences is also presented.

💻 Undergraduate Projects

AI3603

CUT++: Image Style Transfer

GitHub PDF

In this project, we build our CUT++ model for image style transfer on the renowned CUT model. By introducing attention into the GAN-based model and modifying the PatchNCE loss, we achieve decent result on the given dataset.

CS3602

Chinese Slot Language Understanding

GitHub PDF

In this project, we build a BERT-based pipeline for Chinese slot understanding. We incorporated Lexicon information into the BERT backbone and achieved descent result on the given noisy dataset.

📇 Experiences

Machine Vision and Intelligence Group

Undergraduate Research Intern, 2023.3 - (now)

Advisor: Lixin Yang, Cewu Lu

Department of AI Technology, bilibili

Algorithm Intern, 2024.6 - 2024.9

Mentor: Jun Xu

🎖 Honors and Awards

  • 2023 Academic Excellence Scholarship of SJTU (top 10%)
  • 2022 Academic Excellence Scholarship of SJTU (top 10%)
  • 2022.3 Finalist of Mathematical Contest of Modeling (top 5%)

📖 Educations

  • 2021.09 - (now), Shanghai Jiao Tong University, Shanghai, China.
  • 2018.09 - 2021.06, Shanghai High School, Shanghai, China (Outstanding Graduate).