I am a PhD candidate at Princeton University and a Princeton AI Lab Fellow, working with Prof. Mengdi Wang, Prof. Andy Yao, and Prof. Quanquan Gu, where my research focuses on building scalable and capable large language models (LLMs) and multimodal foundation models. My work explores methods to improve LLM reasoning via reinforcement learning, data curation and algorithms for foundation models, as well as the development of new attention mechanisms, positional encodings, and model architectures. Previously, I was a visiting PhD student at the University of California, Los Angeles.
About Me
Research Interests
- Language Modeling and Pretraining
- LLM Reasoning and Reinforcement Learning
- Physics of Deep Learning
You can find my publications on Google Scholar.
You can find my blog posts at Yifan's Blog.
Selected Works
arXiv:2601.00417
International Conference on Learning Representations (ICLR 2026)
International Conference on Learning Representations (ICLR 2026); See also Thinking Machines Tinker
Conference on Neural Information Processing Systems (NeurIPS 2025 Spotlight)
International Conference on Machine Learning (ICML 2025)
Findings of the Association for Computational Linguistics (ACL 2025 Findings)
AAAI Conference on Artificial Intelligence (AAAI 2025)
Transactions on Machine Learning Research (TMLR)
(* denotes equal contribution)
Recent Publications
International Conference on Learning Representations (ICLR 2026)
International Conference on Learning Representations (ICLR 2026); See also Thinking Machines Tinker and DeepSeek V3.2
Conference on Neural Information Processing Systems (NeurIPS 2025 Spotlight)
Findings of the Association for Computational Linguistics (ACL 2025 Findings)
International Conference on Machine Learning (ICML 2025)
AAAI Conference on Artificial Intelligence (AAAI 2025)
International Conference on Learning Representations (ICLR 2025 Spotlight)
(* denotes equal contribution)
Blog Highlights
Yifan's Blog, January 12, 2026
Yifan's Blog, December 27, 2025
Yifan's Blog, December 16, 2025
Yifan's Blog, December 15, 2025
Professional Activities
Teaching
- Teaching Assistant, Machine Learning for Yao class, IIIS, Tsinghua University
Academic Services
- Conference Reviewer: NeurIPS, ICLR, ICML, COLM, AAAI, AISTATS
- Journal Reviewer: ACM TKDD, Neural Computing, Neural Networks