OThink-SRR1: Search, Refine and Reasoning with Reinforced Learning for Large Language Models

发布

2026年04月23日

采集 2026年04月23日 00:00

行业动态 6.5 分 — Dynamic retrieval with refinement and RL-reasoning. Addresses noise and cost issues in multi-hop RAG.

原文： arXiv

评分 6.5 · 来源： · 发布于

评分依据：Dynamic retrieval with refinement and RL-reasoning. Addresses noise and cost issues in multi-hop RAG.