icon
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? w/ Yang Yue of Tsinghua University
Arxiv_id: arXiv:2504.13837