arxiv:2412.01800
hangyu guo
Rosiness
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
paper
3 days ago
mHC: Manifold-Constrained Hyper-Connections
upvoted
a
paper
5 days ago
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
upvoted
a
paper
11 days ago
Scaling Laws for Code: Every Programming Language Matters