xunzhou's picture

2

xunzhou

xunzhou

·

AI & ML interests

None yet

Organizations

authored a paper 10 months ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21, 2025 • 15

authored a paper 11 months ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28, 2025 • 31

authored 2 papers over 1 year ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 25

Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs

Paper • 2406.08657 • Published Jun 12, 2024 • 10

authored a paper over 2 years ago

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond

Paper • 2309.16583 • Published Sep 28, 2023 • 12