SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models

发布

2026年04月14日

采集 2026年04月14日 04:31

学术前沿 5.5 分 — 中等偏上：有一定信息增量和参考价值

评分 5.5 · 来源：cs.AI updates on arXiv.org · 发布于 2026-04-14

评分依据：中等偏上：有一定信息增量和参考价值

SVD-Prune: Training-Free Token Pruning For Efficient Vision-Language Models

arXiv:2604.11530v1 Announce Type: cross Abstract: Vision-Language Models (VLM) have revolutionized multimodal learning by jointly processing visual and textual information. Yet, they face significant challenges due to the high computational and memory demands of processing long sequences of vision tokens. Many existing methods rely on local heuristics, such as attention scores or token norms. However, these criteria suffer from positional bias and information dispersion, limiting their ability…