Tag: Speculative Decoding
All the articles with the tag "Speculative Decoding".
- 6.0
z-lab / dflash
DFlash — 基于 Block Diffusion 的 Flash Speculative Decoding 推理加速方案,Python 实现
All the articles with the tag "Speculative Decoding".
DFlash — 基于 Block Diffusion 的 Flash Speculative Decoding 推理加速方案,Python 实现