Skip to content
星际流动

Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs

发布
采集
学术前沿 7.0 分 — Novel parallel self-refinement approach for test-time scaling beyond Best-of-N
原文: arxiv.org

评分 7 · 来源: · 发布于 2026-04-28

评分依据:Novel parallel self-refinement approach for test-time scaling beyond Best-of-N