评分依据:RL-based framework for conversational QA with interleaved reasoning-retrieval
ChatR1: Reinforcement Learning for Conversational Reasoning and Retrieval Augmented Question Answering
原文: arxiv.org
评分依据:RL-based framework for conversational QA with interleaved reasoning-retrieval