Skip to content
星际流动

Are Video Reasoning Models Ready to Go Outside?

发布
采集
学术前沿 3.3 分 — Moderate AI relevance +novelty(2) +practical(2)
原文: cs.AI updates on arXiv.org

评分 3.3 · 来源:cs.AI updates on arXiv.org · 发布于 2026-04-15

评分依据:Moderate AI relevance +novelty(2) +practical(2)

arXiv:2603.10652v2 Announce Type: replace-cross Abstract: In real-world deployment, vision-language models often encounter disturbances such as weather, occlusion, and camera motion. Under such conditions, their understanding and reasoning degrade substantially, revealing a gap between clean, controlled (i.e., unperturbed) evaluation settings and real-world robustness. To address this limitation, we propose ROVA, a novel training framework that improves robustness by modeling a robustness-aware…