Bringing Value Models Back: Generative Critics for Value Modeling in LLM Reinforcement Learning

发布

2026年04月14日

采集 2026年04月14日 04:31

学术前沿 7.0 分 — Strong position paper arguing for generative critics over discriminative ones in LLM RL. Directly relevant to current RLHF/RLVF trends and the shift away from value models.

原文： cs.AI updates on arXiv.org

评分 7 · 来源：cs.AI updates on arXiv.org · 发布于 2026-04-14

评分依据：Strong position paper arguing for generative critics over discriminative ones in LLM RL. Directly relevant to current RLHF/RLVF trends and the shift away from value models.

Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents

Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models