Tag: reasoning-models

All the articles with the tag "reasoning-models".

6.0
ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation
2026年04月22日
· cs.CL updates on arXiv.org· 04/22 14:31 采集
ReflectMT：两阶段反思内化算法，将推理模型的显式反思内化到MT模型以兼顾质量和效率
7.0
TEMPO: Scaling Test-time Training for Large Reasoning Models
2026年04月22日
· cs.LG updates on arXiv.org· 04/22 14:31 采集
TEMPO：扩展大推理模型的test-time training能力，解决TTT方法快速plateau和奖励漂移问题

ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation