arxiv:2502.13943
Junjie Lu
Lux0926
AI & ML interests
None yet
Organizations
models
18
Lux0926/MetaMath-Mistral-7B-CGPO
7B
•
Updated
•
3
Lux0926/Qwen1.5-32B-SFT-CGPO
33B
•
Updated
•
5
Lux0926/MetaMath-Llama-8B-CGPO
8B
•
Updated
•
3
Lux0926/Qwen2-7B-SFT-CGPO
8B
•
Updated
•
3
Lux0926/MetaMath-Mistral-7B-Step-DPO
7B
•
Updated
•
5
Lux0926/MetaMath-Llama-8B-Step-DPO
8B
•
Updated
•
6
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO
7B
•
Updated
•
4
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO
7B
•
Updated
•
2
Lux0926/ASPRM-D-ORM
7B
•
Updated
•
6
Lux0926/LCD-DS
7B
•
Updated
•
7
datasets
16
Lux0926/Qwen2-7B-SFT-CGPO-10k
Viewer
•
Updated
•
10.8k
•
10
Lux0926/Qwen1.5-32B-SFT-CGPO-10k
Viewer
•
Updated
•
10.8k
•
8
Lux0926/DeepSeekMath-Base-7B-SFT-CGPO-10k
Viewer
•
Updated
•
10.8k
•
6
Lux0926/Deepseek-Coder-7B-Instruct-v1.5-CGPO-10k
Viewer
•
Updated
•
10.6k
•
5
Lux0926/MetaMath-Llama-8B-CGPO-10k
Viewer
•
Updated
•
10.8k
•
5
Lux0926/MetaMath-Mistral-7B-CGPO-10k
Viewer
•
Updated
•
10.8k
•
6
Lux0926/ASPRM-BON-Evaluation-Dataset-Code
Preview
•
Updated
•
52
Lux0926/ASPRM-BON-Evaluation-Dataset-Math
Preview
•
Updated
•
245
Lux0926/ASPRM-Math-Rollout-Result
Viewer
•
Updated
•
215k
•
35
Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset
Viewer
•
Updated
•
99.8k
•
9