LEMUR Decoding

AI & ML interests

None defined yet.

Recent Activity

pminervini authored a paper about 2 months ago

OpenSIR: Open-Ended Self-Improving Reasoner

pminervini authored a paper 3 months ago

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

pminervini authored a paper 3 months ago

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization

View all activity

pminervini

authored a paper about 2 months ago

OpenSIR: Open-Ended Self-Improving Reasoner

Paper • 2511.00602 • Published Nov 1, 2025 • 20

pminervini

authored 9 papers 3 months ago

Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs

Paper • 2410.15438 • Published Oct 20, 2024

PosterSum: A Multimodal Benchmark for Scientific Poster Summarization

Paper • 2502.17540 • Published Feb 24, 2025 • 3

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9, 2025

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4, 2025 • 10

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

Paper • 2307.03042 • Published Jul 6, 2023

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Paper • 2503.23415 • Published Mar 30, 2025 • 1

MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction

Paper • 2204.04779 • Published Apr 10, 2022

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29, 2025 • 4

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

Paper • 2509.21552 • Published Sep 25, 2025 • 11

yuzhaouoe

authored a paper 3 months ago

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

Paper • 2509.21552 • Published Sep 25, 2025 • 11

aryopg

authored a paper 4 months ago

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29, 2025 • 4

aryopg

authored 4 papers 5 months ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9, 2025

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

Paper • 2307.03042 • Published Jul 6, 2023

Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Paper • 2507.10616 • Published Jul 13, 2025 • 1

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 27

pminervini

authored a paper 5 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 27

pminervini

authored 2 papers 8 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15, 2025 • 54

Neurosymbolic Diffusion Models

Paper • 2505.13138 • Published May 19, 2025 • 36

rohitsaxena

authored a paper 8 months ago

What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations

Paper • 2502.08279 • Published Feb 12, 2025 • 1