Tag: loss-functions
All the articles with the tag "loss-functions".
- 6.5
How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum
用 Tsallis q-logarithm 定义损失族插值 RLVR 和密度估计两极,统一 reasoning model 后训练视角
All the articles with the tag "loss-functions".
用 Tsallis q-logarithm 定义损失族插值 RLVR 和密度估计两极,统一 reasoning model 后训练视角