Tag: reasoning-models
All the articles with the tag "reasoning-models".
- 6.0
ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation
ReflectMT:两阶段反思内化算法,将推理模型的显式反思内化到MT模型以兼顾质量和效率
- 7.0
TEMPO: Scaling Test-time Training for Large Reasoning Models
TEMPO:扩展大推理模型的test-time training能力,解决TTT方法快速plateau和奖励漂移问题