I'm a Computer Science PhD student at the University of Southern California, advised by Willie Neiswanger. I received my Master's and Bachelor's degrees from ShanghaiTech University.

My research centers on reinforcement learning and reasoning models / agents.

[email protected] · Scholar · GitHub · X · LinkedIn


Experience

Research Intern · Qwen · Summer 2026

Stable and efficient RL post-training for large language models.

Hosts: Guoyin Wang (Qwen Pilot Team), Hao Zhou (Qwen 3.7 Team)

Research Intern · Bespoke Labs · Winter 2025

Agentic RL post-training acceleration via reward / environment curation.

Hosts: Mahesh Sathiamoorthy, Alex Dimakis, Greg Durrett, Shreyas Pimpalgaonkar

Student Researcher · BluelightAI · Summer 2025

Pre-training cross-layer transcoders over reasoning models.

Hosts: John Carlsson, Gunnar Carlsson, Jakob Hansen


Selected Publications

Release

Reasoning LLM

Chiyu Ma, Shuo Yang, Kexin Huang, Jinda Lu, Haoming Meng, Shangshang Wang, Bolin Ding, Soroush Vosoughi, Guoyin Wang, Jingren Zhou.

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization.