Tag: efficient-inference
All the articles with the tag "efficient-inference".
- 7.0
Highly Efficient and Effective LLMs with Multi-Boolean Architectures
用多核布尔参数表示LLM的新型二值化框架,无需全精度潜权重即可实现高效推理
All the articles with the tag "efficient-inference".
用多核布尔参数表示LLM的新型二值化框架,无需全精度潜权重即可实现高效推理