I'm a Computer Science PhD student at the University of Southern California, advised by Willie Neiswanger. I received my Master's and Bachelor's degrees from ShanghaiTech University.
My research centers on reinforcement learning and reasoning models / agents.
[email protected] · Scholar · GitHub · X · LinkedIn
Research Intern · Qwen · Summer 2026
Stable and efficient RL post-training for large language models.
Hosts: Guoyin Wang (Qwen Pilot Team), Hao Zhou (Qwen 3.7 Team)
Research Intern · Bespoke Labs · Winter 2025
Agentic RL post-training acceleration via reward / environment curation.
Hosts: Mahesh Sathiamoorthy, Alex Dimakis, Greg Durrett, Shreyas Pimpalgaonkar
Student Researcher · BluelightAI · Summer 2025
Pre-training cross-layer transcoders over reasoning models.
Hosts: John Carlsson, Gunnar Carlsson, Jakob Hansen
Chiyu Ma, Shuo Yang, Kexin Huang, Jinda Lu, Haoming Meng, Shangshang Wang, Bolin Ding, Soroush Vosoughi, Guoyin Wang, Jingren Zhou.
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization.