Test-Time Safety Alignment

发布

2026年04月30日

采集 2026年04月30日 06:33

学术前沿 6.0 分 — Shows input embeddings can control aligned model behavior at test time, extends beyond simple profanity reduction

原文： cs.LG updates on arXiv.org

评分 6 · 来源：cs.LG updates on arXiv.org · 发布于 2026-04-30

评分依据：Shows input embeddings can control aligned model behavior at test time, extends beyond simple profanity reduction

Unifying Sparse Attention with Hierarchical Memory for Scalable Long-Context LLM Serving

Lifting Embodied World Models for Planning and Control