-
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
Paper • 2604.02268 • Published • 102 -
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning
Paper • 2603.05863 • Published • 6 -
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning
Paper • 2604.02721 • Published • 634 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 180
glenn ba
glennba
·
AI & ML interests
None yet
Recent Activity
liked a model about 1 month ago
openbmb/MiniCPM-V-4.6 liked a model about 1 month ago
Supertone/supertonic-3 updated a collection about 1 month ago
texts papersOrganizations
None yet