I am an incoming CS PhD student at Stanford University, starting from 2024 Fall.

If you have shared interests in Robotics/Learning/Vision, don't hesitate to shoot me an mail. I'm always up for exploring potential collaborations and/or engaging in insightful conversations.

Research Interest

My research is on intersection of Robotics, Learning, and Vision. My focus is on learning generalizable {3D, 2D, language, dynamics} representations through {images, videos, texts, interaction data} for robotics, as in 3D Diffusion Policy (DP3) and GNFactor. I like to explore the natural supervision signals from the data itself.


3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
Yanjie Ze*, Gu Zhang*, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xu
Robotics: Science and Systems (RSS), 2024 (Oral)
project page / arXiv / code / bibtex
GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields
Yanjie Ze*, Ge Yan*, Yueh-Hua Wu*, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang
Conference on Robot Learning (CoRL), 2023 (Oral)
project page / arXiv / code / poster / bibtex

Recent Talks

Visual Representations for Generalizable Robotic Manipulation
November 2, 2023
Present at TechBeat, Shanghai AI Lab, and Tsinghua University.
slides / YouTube / Bilibili