I am a senior research scientist at Sea AI Lab. I received my Ph.D. from Tsinghua University, where I was advised by Prof. Jun Zhu, and my undergraduate degree from the Yao Class at Tsinghua University.
I work on machine learning. My research goal is to understand what intelligence is and how it arises, and to answer these questions from the perspective of probability and statistics. Currently, I focus on developing probabilistic methods for generative modeling from first principles, seeking to explain how structured representations and knowledge emerge from data.
We are hiring Research Scientists and Research Interns to work on generative models (e.g., diffusion models, LLMs). Feel free to reach out if you are interested.
Selected Publications
Generative Modeling
Nonparametric Data Attribution for Diffusion Models
Preprint, 2025
On Memorization in Diffusion Models
Transactions on Machine Learning Research (TMLR), 2025
Scaling up Masked Diffusion Models on Text
International Conference on Learning Representations (ICLR), Singapore, 2025
Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows
International Conference on Machine Learning (ICML), Hawaii, USA, 2023
Reinforcement Learning
Continual Reinforcement Learning by Planning with Online World Models
International Conference on Machine Learning (ICML), Vancouver, Canada, 2025
Locality Sensitive Sparse Encoding for Learning World Models Online
International Conference on Learning Representations (ICLR), Vienna, Austria, 2024
RL for Language Models
Reinforcing General Reasoning Without Verifiers
International Conference on Learning Representations (ICLR), Rio de Janeiro, Brazil, 2026
Sample-Efficient Alignment for LLMs
Preprint, 2024
Service
- Area Chair ICLR 2026, ACL 2026
- Reviewer ICML, NeurIPS, ICLR, AISTATS, CVPR, ICCV, BMVC, COLM, AAAI, SIGKDD, TPAMI