Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face Paper • 2302.14534 • Published Feb 28, 2023 • 1
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages Paper • 2305.06897 • Published May 11, 2023 • 9
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration Paper • 2306.01481 • Published Jun 2, 2023 • 2
MasakhaNEWS: News Topic Classification for African languages Paper • 2304.09972 • Published Apr 19, 2023
AfroBench: How Good are Large Language Models on African Languages? Paper • 2311.07978 • Published Nov 14, 2023
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue Paper • 2204.10757 • Published Apr 22, 2022 • 1
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages Paper • 2210.09984 • Published Oct 18, 2022 • 2
NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation Paper • 2312.11361 • Published Dec 18, 2023 • 1
When Chosen Wisely, More Data Is What You Need: A Universal Sample-Efficient Strategy For Data Augmentation Paper • 2203.09391 • Published Mar 17, 2022 • 1
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution Paper • 2307.16883 • Published Jul 31, 2023