sian cao
sonald
AI & ML interests
AI, big data, OS
Recent Activity
upvoted
an
article
2 days ago
Deriving the DPO Loss from First Principles
upvoted
an
article
5 days ago
Deriving the PPO Loss from First Principles
upvoted
an
article
7 days ago
From GRPO to DAPO and GSPO: What, Why, and How