arxiv:2412.01800
hangyu guo
Rosiness
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
paper
1 day ago
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
upvoted
a
paper
7 days ago
Scaling Laws for Code: Every Programming Language Matters