评分依据:First continual learning benchmark for agent skill generation. Fills important gap in agent evaluation.
SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks
发布
采集
行业动态 6.0 分
— First continual learning benchmark for agent skill generation. Fills important gap in agent evaluation. 原文: arXiv