Jianhao Yan's picture

Jianhao Yan

Elliott

·

ElliottYan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper about 1 month ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

upvoted a paper 4 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

View all activity

Organizations

None yet

commented 2 papers about 1 year ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88 •

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29, 2025 • 99 •

New activity in Elliott/Qwen2.5-Math-7B-16k-think about 1 year ago

Add library name, pipeline tag, link to Github

#1 opened about 1 year ago by

New activity in Elliott/Openr1-Math-46k-8192 about 1 year ago

Add task category

#2 opened about 1 year ago by

New activity in Elliott/LUFFY-Qwen-Math-7B-Zero about 1 year ago

Correct pipeline tag and add Github link

#1 opened about 1 year ago by

commented 2 papers about 1 year ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88 •

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88 •