Skip to content
星际流动

FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models

发布
采集
学术前沿 6.0 分 — FineSteer: 统一细粒度推理时steering框架
原文: cs.LG updates on arXiv.org

评分 6 · 来源:cs.LG updates on arXiv.org · 发布于 2026-04-20

评分依据:FineSteer: 统一细粒度推理时steering框架

要点

arXiv:2604.15488v1 Announce Type: new Abstract: Large language models (LLMs) often exhibit undesirable behaviors, such as safety violations and hallucinations. Although inference-time steering offers a cost-effective way to adjust model behavior without updating its parameters, existing methods often fail to be simultaneously effective, utility-preserving, and training-efficient due to their rigid, one-size-fits-all designs and limited adaptability. In this work, we present FineSteer, a novel steering framework that decomposes inference-time steering into two complementary stages: conditional …

🤖 AI 点评

本文提供了AI领域的重要信息,值得行业从业者关注。


标签: