James-WYang 's Collections

Language Imbalance Driven Rewarding

All checkpoints for our work "Language Imbalance Driven Rewarding for Multilingual Self-improving", https://arxiv.org/abs/2410.08964