Tag: distribution-matching
All the articles with the tag "distribution-matching".
- 6.0
Guiding Distribution Matching Distillation with Gradient-Based Reinforcement Learning
将梯度RL融入Distribution Matching Distillation(DMD),解决样本评分与蒸馏轨迹冲突问题
All the articles with the tag "distribution-matching".
将梯度RL融入Distribution Matching Distillation(DMD),解决样本评分与蒸馏轨迹冲突问题