tiny-reasoning LLM trained via RL for reasoning tasks. CaptainHPY/Qwen2.5-7B-R1-Zero Text Generation • 5B • Updated Sep 16 • 6 CaptainHPY/Qwen2.5-7B-R1 Text Generation • 5B • Updated Sep 17 • 10 CaptainHPY/Qwen2.5-7B-R1-Zero-GGUF Text Generation • 8B • Updated Sep 17 • 23 CaptainHPY/Qwen2.5-7B-R1-GGUF Text Generation • 8B • Updated Sep 18 • 69
tiny-reasoning LLM trained via RL for reasoning tasks. CaptainHPY/Qwen2.5-7B-R1-Zero Text Generation • 5B • Updated Sep 16 • 6 CaptainHPY/Qwen2.5-7B-R1 Text Generation • 5B • Updated Sep 17 • 10 CaptainHPY/Qwen2.5-7B-R1-Zero-GGUF Text Generation • 8B • Updated Sep 17 • 23 CaptainHPY/Qwen2.5-7B-R1-GGUF Text Generation • 8B • Updated Sep 18 • 69