I am currently a Ph.D. candidate at HMI Lab, NERCV²T, School of Computer Science, Peking University, supervised by Prof. Shanghang Zhang. I received my Bachelor’s degree in Artificial Intelligence (Turing Honor Degree) from Peking University in 2023, where I also obtained a Bachelor’s degree in Economics.
My research interests lie in computer vision and multimodal learning, including visual foundation models, multimodal large language models, visual complex reasoning, visual token compression, visual continual learning, and embodied artificial intelligence. The overall goal of my research is to develop a large-scale efficient visual perception system with human-like expression, adaptation, and generalization, equipped with powerful abilities including fundamental perception, cognitive reasoning, and autonomous creativity.
📧 Email: theia@pku.edu.cn, theia4869@gmail.com
Feel free to reach out for collaboration!