Improving LLMs' Generalized Reasoning Abilities by Graph Problems Paper • 2507.17168 • Published Jul 23 • 1 • 1
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 10 days ago • 98 • 5
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 9 days ago • 93 • 4
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 9 days ago • 87 • 5
s21mind/HexaMind-Llama-3.1-8B-v25-Generalist Text Generation • 8B • Updated 17 days ago • 52 • 1
s21mind/HexaMind-Llama-3.1-8B-v25-Generalist Text Generation • 8B • Updated 17 days ago • 52 • 1