Skip to content
星际流动

OThink-SRR1: Search, Refine and Reasoning with Reinforced Learning for Large Language Models

发布
采集
行业动态 6.5 分 — Dynamic retrieval with refinement and RL-reasoning. Addresses noise and cost issues in multi-hop RAG.
原文: arXiv

评分 6.5 · 来源: · 发布于

评分依据:Dynamic retrieval with refinement and RL-reasoning. Addresses noise and cost issues in multi-hop RAG.