Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper ⢠2507.11412 ⢠Published Jul 15 ⢠28
BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP Paper ⢠2506.10896 ⢠Published Jun 12 ⢠4
Multitask Prompted Training Enables Zero-Shot Task Generalization Paper ⢠2110.08207 ⢠Published Oct 15, 2021 ⢠2
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper ⢠2211.05100 ⢠Published Nov 9, 2022 ⢠34
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper ⢠2412.13663 ⢠Published Dec 18, 2024 ⢠156
Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling Paper ⢠2409.14683 ⢠Published Sep 23, 2024 ⢠12
Three Bricks to Consolidate Watermarks for Large Language Models Paper ⢠2308.00113 ⢠Published Jul 26, 2023 ⢠14