Tag: jailbreak-detection
All the articles with the tag "jailbreak-detection".
- 6.0
An Empirical Study of Multi-Generation Sampling for Jailbreak Detection in Large Language Models
真实条件下输出型jailbreak检测的实证研究,对比TF-IDF和生成不一致性检测器
All the articles with the tag "jailbreak-detection".
真实条件下输出型jailbreak检测的实证研究,对比TF-IDF和生成不一致性检测器