arxiv:2412.01800
hangyu guo
Rosiness
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
paper
2 days ago
mHC: Manifold-Constrained Hyper-Connections
upvoted
a
paper
4 days ago
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
upvoted
a
paper
10 days ago
Scaling Laws for Code: Every Programming Language Matters