RAGEN-2: Reasoning Collapse in Agentic RL

发布

2026年04月09日

采集 2026年04月09日 00:00

学术前沿 8.0 分 — Agentic RL中的推理崩溃——Agent RL训练的核心警告：RL优化可能损害推理能力

原文： arXiv

评分 8 · 来源：arXiv · 发布于 2026-04-09

评分依据：Agentic RL中的推理崩溃——Agent RL训练的核心警告：RL优化可能损害推理能力

Invisible to Humans, Triggered by Agents: Stealthy Jailbreak on Mobile Vision-Language Agents

Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents