Post_deepseek_r1
My new technical post From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning (千呼万唤始出来:DeepSeek-R1 如何通过强化学习实现复杂推理) is now online! English and Chinese versions both available!
My new technical post From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning (千呼万唤始出来:DeepSeek-R1 如何通过强化学习实现复杂推理) is now online! English and Chinese versions both available!