Manufacturing lines, service journeys, supply chains, and AI reasoning chains share a common challenge: attributing a terminal outcome to the intermediate stage that caused it. We establish an information-theoretic barrier to this credit assignment problem: the signal connecting early steps to final outcomes decays exponentially with depth, creating a critical horizon beyond which no algorithm can learn from endpoint data alone. We prove four results. First, a Signal Decay Bound: sample complexity for attributing outcomes to early stages grows exponentially in the number of intervening steps. Second, Width Limits: parallel rollouts provide only logarithmic relief, with correlation capping the effective number of independent samples. Third, an Objective Mismatch: additive reward aggregation optimizes the wrong quantity when sequential validity requires all steps to be correct. Fourth, Optimal Inspection Design: uniform checkpoint spacing is minimax-optimal under homogeneous signal attenuation, while a greedy algorithm yields optimal non-uniform schedules under heterogeneous attenuation. Together, these results provide a common analytical foundation for inspection design in operations and supervision design in AI.
翻译:制造流水线、服务流程、供应链与人工智能推理链面临一个共同挑战:如何将终端结果归因于引发该结果的中间阶段。我们为此信用分配问题建立了信息论屏障:连接早期步骤与最终结果的信号会随深度呈指数级衰减,从而形成一个关键视界,超出此视界后任何算法都无法仅凭端点数据进行学习。我们证明了四个结果。其一,信号衰减界限:将结果归因于早期阶段所需的样本复杂度会随中间步骤数量呈指数增长。其二,宽度限制:并行推演仅能提供对数级缓解,相关性会限制有效独立样本的数量。其三,目标失配:当序列有效性要求所有步骤均正确时,累加式奖励聚合所优化的目标量是错误的。其四,最优检验设计:在均匀信号衰减条件下,均匀检查点间距是极小极大最优的;而在非均匀衰减条件下,贪心算法可生成最优的非均匀调度方案。这些结果共同为操作领域的检验设计与人工智能领域的监督设计提供了统一的分析基础。