Patrick Haller's picture

Patrick Haller PRO

PatrickHaller

·

HallerPatrick

AI & ML interests

NLP, Language Models, Autoregressive Models

Recent Activity

upvoted a paper 8 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

updated a collection 8 days ago

Distillled SmolLM2-1.7B

updated a collection 8 days ago

Distillled SmolLM2-1.7B

View all activity

Organizations

authored 3 papers about 1 month ago

MastermindEval: A Simple But Scalable Reasoning Benchmark

Paper • 2503.05891 • Published Mar 7 • 1

Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models

Paper • 2504.14366 • Published Apr 19 • 1

Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements

Paper • 2511.05560 • Published Nov 4 • 1

authored 2 papers 9 months ago

Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data

Paper • 2412.10121 • Published Dec 13, 2024 • 2

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models

Paper • 2412.15978 • Published Dec 20, 2024 • 1

authored 5 papers over 1 year ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 35

PECC: Problem Extraction and Coding Challenges

Paper • 2404.18766 • Published Apr 29, 2024

OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs

Paper • 2309.03876 • Published Sep 7, 2023 • 3

Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

Paper • 2309.09582 • Published Sep 18, 2023 • 4

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

Paper • 2206.15076 • Published Jun 30, 2022 • 5