Eason shi

Easonshi

Lightblues

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

authored a paper 1 day ago

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

authored a paper 1 day ago

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

View all activity

Organizations

None yet

upvoted a paper about 19 hours ago

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 6 days ago • 81

authored 12 papers 1 day ago

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

Paper • 2404.09145 • Published Apr 14, 2024 • 1

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4, 2024 • 19

Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration

Paper • 2408.11416 • Published Aug 21, 2024

P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models

Paper • 2405.04960 • Published May 8, 2024 • 1

AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction

Paper • 2409.01854 • Published Sep 3, 2024 • 1

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2, 2025 • 16

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 29

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 44

LTD-Bench: Evaluating Large Language Models by Letting Them Draw

Paper • 2511.02347 • Published Nov 4, 2025 • 8

RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning

Paper • 2509.25958 • Published Sep 30, 2025

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published 10 days ago • 37

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published 6 days ago • 81

upvoted a paper 7 days ago

SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents

Paper • 2512.22322 • Published 10 days ago • 37

upvoted a paper 3 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 29

liked a Space 3 months ago

Academia MСP

👁

MCP tools related to the search of scientific papers

upvoted an article 3 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark 🏅

Jul 1, 2024

•

upvoted 2 papers 7 months ago

Agent AI: Surveying the Horizons of Multimodal Interaction

Paper • 2401.03568 • Published Jan 7, 2024 • 1

Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Paper • 2506.01413 • Published Jun 2, 2025 • 16

upvoted a collection 10 months ago

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 145

Eason shi

AI & ML interests

Recent Activity

Organizations

Easonshi's activity

Academia MСP

Our Transformers Code Agent beats the GAIA benchmark 🏅