1 36 12

Naman Anand

naman5a

AI & ML interests

RAG , LLMs

Recent Activity

upvoted a paper 4 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

upvoted an article 9 days ago

Automatic Prompt Optimization with DSPy and Cross Encoders

upvoted an article 15 days ago

We Got Claude to Fine-Tune an Open Source LLM

View all activity

Organizations

upvoted a paper 4 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published 10 days ago • 82

upvoted an article 9 days ago

Article

Automatic Prompt Optimization with DSPy and Cross Encoders

Aug 2

•

upvoted an article 15 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

17 days ago

•

520

commented on Continuous batching from first principles 24 days ago

Love this article :) @ArthurZ

upvoted an article 24 days ago

Article

Continuous batching from first principles

26 days ago

•

281

upvoted 2 articles 25 days ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Jun 3

•

Article

20x Faster TRL Fine-tuning with RapidFire AI

30 days ago

•

upvoted a collection 4 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 103

commented a paper 4 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263 •

upvoted a paper 4 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263

liked a model 5 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26 • 7.35M • • 4.09k

upvoted 4 articles 6 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

Feb 14, 2020

•

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

Apr 16

•

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

•

186

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

712

upvoted a paper 7 months ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26 • 91

liked a model 7 months ago

nvidia/parakeet-tdt-0.6b-v2

Automatic Speech Recognition • Updated 23 days ago • 557k • 1.39k

liked a model 8 months ago

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17 • 82.6k • 1.6k

upvoted a collection 8 months ago

GLM-4-0414

Collection

GLM-4-0414 series model • 8 items • Updated Jun 30 • 133

liked a model 8 months ago

nari-labs/Dia-1.6B

Text-to-Speech • Updated Jun 1 • 120k • • 2.81k

Naman Anand

AI & ML interests

Recent Activity

Organizations

naman5a's activity

Automatic Prompt Optimization with DSPy and Cross Encoders

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

20x Faster TRL Fine-tuning with RapidFire AI

How to train a new language model from scratch using Transformers and Tokenizers

Introducing HELMET: Holistically Evaluating Long-context Language Models

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Finally, a Replacement for BERT: Introducing ModernBERT