arxiv:2506.18701
CHUNLI PENG
chunli-peng
AI & ML interests
None yet
Organizations
None yet
models
17
chunli-peng/Qwen2.5-1.5B-NS-GRPO
Text Generation
•
2B
•
Updated
•
3
chunli-peng/DeepSeek-R1-Distill-Qwen-1.5B-NS-GRPO
Text Generation
•
2B
•
Updated
•
10
chunli-peng/OpenRS-GRPO-sft-8-e10
Text Generation
•
2B
•
Updated
•
4
chunli-peng/OpenRS-GRPO-sft-8.5
Text Generation
•
2B
•
Updated
•
4
chunli-peng/OpenRS-GRPO-sft-9
Text Generation
•
2B
•
Updated
•
2
chunli-peng/OpenRS-GRPO-sft-8
Text Generation
•
2B
•
Updated
•
2
chunli-peng/OpenRS-GRPO-sft
Text Generation
•
2B
•
Updated
•
3
chunli-peng/OpenRS-GRPO-sft-7-wr15
Text Generation
•
2B
•
Updated
•
2
chunli-peng/OpenRS-GRPO-sft-7-wr5
Text Generation
•
2B
•
Updated
•
1
chunli-peng/OpenRS-GRPO-sft-6.9
Text Generation
•
2B
•
Updated
•
2