Weijing Huang
waleking
ยท
AI & ML interests
Language Models
Recent Activity
upvoted
a
paper
26 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
upvoted
a
paper
9 months ago
VAPO: Efficient and Reliable Reinforcement Learning for Advanced
Reasoning Tasks
liked
a dataset
11 months ago
OpenStellarTeam/Chinese-SimpleQA
Organizations
None yet