From Recall to Forgetting: Benchmarking Long-Term Memory for Personalized Agents

发布

2026年04月23日

采集 2026年04月23日 00:00

行业动态 6.0 分 — Long-term memory benchmark going beyond fact retrieval. Fills gap in agent memory evaluation.

原文： arXiv

评分 6.0 · 来源： · 发布于

评分依据：Long-term memory benchmark going beyond fact retrieval. Fills gap in agent memory evaluation.

Avoiding Overthinking and Underthinking: Curriculum-Aware Budget Scheduling for LLMs

TriEx: A Game-based Tri-View Framework for Explaining Internal Reasoning in Multi-Agent LLMs