Julien Klauss
zucco
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 2 months ago
VQ
updated
a collection
8 months ago
LLM
updated
a collection
over 1 year ago
LLM
Organizations
None yet
Better LLM datasets
MoE
Transformers
RAG
LLM
-
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 116 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 78 -
Larimar: Large Language Models with Episodic Memory Control
Paper • 2403.11901 • Published • 33 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 58
SSL
VQ
Better LLM datasets
Efficient
MoE
Speed
Transformers
ViT
RAG
Transfer
LLM
-
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 116 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 78 -
Larimar: Large Language Models with Episodic Memory Control
Paper • 2403.11901 • Published • 33 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 58
Agents