Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 6 days ago • 81
ToNER: Type-oriented Named Entity Recognition with Generative Language Model Paper • 2404.09145 • Published Apr 14, 2024 • 1
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models Paper • 2408.02085 • Published Aug 4, 2024 • 19
Subgoal-based Hierarchical Reinforcement Learning for Multi-Agent Collaboration Paper • 2408.11416 • Published Aug 21, 2024
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models Paper • 2405.04960 • Published May 8, 2024 • 1
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction Paper • 2409.01854 • Published Sep 3, 2024 • 1
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Paper • 2506.01413 • Published Jun 2, 2025 • 16
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning Paper • 2509.22601 • Published Sep 26, 2025 • 29
LTD-Bench: Evaluating Large Language Models by Letting Them Draw Paper • 2511.02347 • Published Nov 4, 2025 • 8
RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning Paper • 2509.25958 • Published Sep 30, 2025
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents Paper • 2512.22322 • Published 10 days ago • 37
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 6 days ago • 81
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents Paper • 2512.22322 • Published 10 days ago • 37
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning Paper • 2509.22601 • Published Sep 26, 2025 • 29
Agent AI: Surveying the Horizons of Multimodal Interaction Paper • 2401.03568 • Published Jan 7, 2024 • 1
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Paper • 2506.01413 • Published Jun 2, 2025 • 16