评分 7 · 来源:cs.AI updates on arXiv.org · 发布于 2026-04-29
评分依据:Rigorous measurement of how much harness contributes vs LLM in planning agents, critical for understanding agent design tradeoffs
How Much Heavy Lifting Can an Agent Harness Do?: Measuring LLM Residual Role in Planning Agent
发布
采集
学术前沿 7.0 分
— Rigorous measurement of how much harness contributes vs LLM in planning agents, critical for understanding agent design tradeoffs