Gaperon: A Peppered English-French Generative Language Model Suite Paper • 2510.25771 • Published 4 days ago • 12 • 2
Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs Paper • 2510.20475 • Published 11 days ago • 1 • 2
The Art of Asking: Multilingual Prompt Optimization for Synthetic Data Paper • 2510.19806 • Published 11 days ago • 1 • 1
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models Paper • 2504.14366 • Published Apr 19 • 1 • 1
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models Paper • 2510.13996 • Published 18 days ago • 6 • 2
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian Paper • 2509.05668 • Published Sep 6 • 5 • 2
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications Paper • 2503.17247 • Published Mar 21 • 1 • 2
German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German Paper • 2508.17973 • Published Aug 25 • 1 • 5
German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German Paper • 2508.17973 • Published Aug 25 • 1 • 5
German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German Paper • 2508.17973 • Published Aug 25 • 1 • 5
Tokens with Meaning: A Hybrid Tokenization Approach for NLP Paper • 2508.14292 • Published Aug 19 • 1 • 2
GLiClass: Generalist Lightweight Model for Sequence Classification Tasks Paper • 2508.07662 • Published Aug 11 • 9 • 2
Do Construction Distributions Shape Formal Language Learning In German BabyLMs? Paper • 2503.11593 • Published Mar 14 • 1 • 1
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface Paper • 2507.18546 • Published Jul 24 • 24 • 11