评分依据:First benchmark for true corpus-level reasoning beyond sparse retrieval assumption
CorpusQA: A 10 Million Token Benchmark for Corpus-Level Analysis and Reasoning
发布
采集
学术前沿 7.0 分
— First benchmark for true corpus-level reasoning beyond sparse retrieval assumption 原文: arxiv.org