-
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality
Paper • 2510.22037 • Published • 17 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 462 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 522 -
Scaling Language-Centric Omnimodal Representation Learning
Paper • 2510.11693 • Published • 97
Clément Castellon
Clemspace
AI & ML interests
Reinforcement learning, Neural Architecture Search, Transformers
Recent Activity
updated
a collection
5 days ago
Bangers 2025
updated
a collection
5 days ago
Bangers 2025
updated
a collection
5 days ago
Bangers 2025