Skip to content
星际流动

The Illusion of Equivalence: Systematic FP16 Divergence in KV-Cached Autoregressive Inference

发布
采集
学术前沿 7.7 分 — Important systems finding: 100% FP16 token divergence in KV-cached inference
原文: cs.LG updates on arXiv.org

评分 7.7 · 来源:cs.LG updates on arXiv.org · 发布于 2026-04-20

评分依据:Important systems finding: 100% FP16 token divergence in KV-cached inference