view article Article There is no such thing as a tokenizer-free lunch By catherinearnett • Sep 25 • 84
view article Article An Analysis of Multilingual Models on Hugging Face By catherinearnett and 1 other • Sep 18 • 4
view article Article Best Practices for Open Multilingual LLM Evaluation By catherinearnett • May 7 • 3
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais and 2 others • Nov 13, 2024 • 104
view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett • Sep 27, 2024 • 50