评分依据:Long-term memory benchmark going beyond fact retrieval. Fills gap in agent memory evaluation.
From Recall to Forgetting: Benchmarking Long-Term Memory for Personalized Agents
发布
采集
行业动态 6.0 分
— Long-term memory benchmark going beyond fact retrieval. Fills gap in agent memory evaluation. 原文: arXiv