Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA

发布

2026年04月28日

采集 2026年04月28日 10:31

学术前沿 7.0 分 — Revisits OPD formulation with theoretical analysis and empirical fixes for post-training

评分 7 · 来源： · 发布于 2026-04-28

评分依据：Revisits OPD formulation with theoretical analysis and empirical fixes for post-training