Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 48
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 66
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback Paper • 2410.19133 • Published Oct 24, 2024 • 11
Do NLP Models Know Numbers? Probing Numeracy in Embeddings Paper • 1909.07940 • Published Sep 17, 2019
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs Paper • 1903.00161 • Published Mar 1, 2019
One Embedder, Any Task: Instruction-Finetuned Text Embeddings Paper • 2212.09741 • Published Dec 19, 2022 • 4
HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation Paper • 2212.10315 • Published Dec 20, 2022 • 1
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering Paper • 2303.11897 • Published Mar 21, 2023
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics Paper • 2009.10795 • Published Sep 22, 2020
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging Paper • 2310.11564 • Published Oct 17, 2023 • 2
Fine-grained Hallucination Detection and Editing for Language Models Paper • 2401.06855 • Published Jan 12, 2024 • 4
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models Paper • 2310.01329 • Published Oct 2, 2023
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 Paper • 2311.10702 • Published Nov 17, 2023 • 20
Self-Instruct: Aligning Language Model with Self Generated Instructions Paper • 2212.10560 • Published Dec 20, 2022 • 9
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Paper • 2204.07705 • Published Apr 16, 2022 • 2
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Paper • 2310.11511 • Published Oct 17, 2023 • 78
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources Paper • 2306.04751 • Published Jun 7, 2023 • 5