Variational Visual Question Answering for Uncertainty-Aware Selective Prediction

发布

2026年04月14日

采集 2026年04月14日 04:31

学术前沿 5.3 分 — 中等质量：常规学术论文，有适度参考价值

评分 5.3 · 来源：cs.AI updates on arXiv.org · 发布于 2026-04-14

评分依据：中等质量：常规学术论文，有适度参考价值

Variational Visual Question Answering for Uncertainty-Aware Selective Prediction

arXiv:2505.09591v3 Announce Type: replace-cross Abstract: Despite remarkable progress in recent years, Vision Language Models (VLMs) remain prone to overconfidence and hallucinations on tasks such as Visual Question Answering (VQA) and Visual Reasoning. Bayesian methods can potentially improve reliability by helping models predict selectively, that is, models respond only when they are sufficiently confident. Unfortunately, such approaches can be costly and ineffective for large models, and…