MastermindEval: A Simple But Scalable Reasoning Benchmark Paper • 2503.05891 • Published Mar 7 • 1
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models Paper • 2504.14366 • Published Apr 19 • 1
Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements Paper • 2511.05560 • Published Nov 4 • 1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data Paper • 2412.10121 • Published Dec 13, 2024 • 2
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models Paper • 2412.15978 • Published Dec 20, 2024 • 1
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 35
OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs Paper • 2309.03876 • Published Sep 7, 2023 • 3
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs Paper • 2309.09582 • Published Sep 18, 2023 • 4
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper • 2206.15076 • Published Jun 30, 2022 • 5