Skip to content
星际流动

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

发布
采集
行业动态 7.5 分 — High-relevance AI research with significant insights or practical impact
原文: cs.LG updates on arXiv.org

评分 7.5 · 来源:cs.LG updates on arXiv.org · 发布于 2026-04-13

评分依据:High-relevance AI research with significant insights or practical impact