arxiv:2602.06079
Liangyu Wang
ly4096
AI & ML interests
Efficient reinforcement learning (RL) for LLMs reasoning
Distributed training and inference of LLMs
Efficient algorithm and infrastructure design for LLMs
Recent Activity
upvoted a paper about 2 hours ago
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training authored a paper 3 months ago
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers submitted a paper 3 months ago
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers