Skip to content
星际流动

Sample Complexity of Autoregressive Reasoning: Chain-of-Thought vs. End-to-End

发布
采集
学术前沿 3.2 分 — Moderate AI relevance +novelty(1) +practical(2)
原文: cs.LG updates on arXiv.org

评分 3.2 · 来源:cs.LG updates on arXiv.org · 发布于 2026-04-15

评分依据:Moderate AI relevance +novelty(1) +practical(2)

arXiv:2604.12013v1 Announce Type: new Abstract: Modern large language models generate text autoregressively, producing tokens one at a time. To study the learnability of such systems, Joshi et al. (COLT 2025) introduced a PAC-learning framework for next-token generators, the primitive underlying autoregressive models. In this framework, an unknown next-token generator maps a sequence of tokens to the next token and is iteratively applied for $T$ steps, producing a chain of tokens whose final…