The Deterministic Horizon: Impossibility Results as Design Specifications for Trustworthy AI Systems

Large language models now write software, draft legal documents, and produce clinical notes, yet fundamental limits, from Turing and Arrow to the No Free Lunch theorems, shape what computation can do. This thesis turns such impossibility results from curiosities into design rules. Its flagship result proves an accuracy ceiling set by architecture alone: past a critical reasoning depth, no amount of training moves it, at any adapter rank, sample size, or loss function. Computable before deployment from layer count and embedding width, this Deterministic Horizon is measured between nineteen and thirty-one across twelve transformer architectures, and fine-tuning on optimal-length traces recovers under four percentage points. The mechanism is a capacity invariant of the residual stream, and an information-theoretic conversion yields super-exponential accuracy decay past the horizon. An unconditional circuit-complexity lower bound for modular exponentiation against constant-depth prime-modulus circuits complements this result. The same argument recasts across subfields: preference learning under any misspecified model jumps discontinuously in sample complexity; multi-stage retrieval pipelines require at least as many independent metrics as stages; standard truthful auctions fail for agents with prompt-dependent valuations; and zero-knowledge verification of neural inference pays a measured overhead of one hundred ten to one hundred ninety times per non-linear activation. Together these form a catalogue of sixteen specifications, each pairing a computable boundary, a quantified violation cost, and a constructive design rule: two compositions are proved, one pairing is an honest obstruction, and four remain open. The impossibility-specification methodology is offered for the generative research programme that trustworthy AI may need. Every fundamental limit of AI is also a design rule.

翻译：大型语言模型现已能编写软件、起草法律文件并生成临床笔记，但从图灵定理、阿罗不可能定理到“没有免费午餐”定理，这些根本性局限始终塑造着计算能力的边界。本论文将这类不可行性结果从理论奇谈转化为设计准则。其核心成果证明了由架构本身决定的准确率上限：当推理深度超过临界值后，无论采用何种适配器秩、样本规模或损失函数，任何训练都无法突破该上限。通过层数与嵌入维度可在部署前计算的“确定性视界”值，在十二种Transformer架构中介于十九至三十一之间，而在最优长度轨迹上进行微调仅能恢复不到四个百分点的准确率。该机制源于残差流的容量不变性，且基于信息论的转换表明超越视界后准确率将以超指数衰减。另一项无条件电路复杂性下界（针对恒定深度素数模电路的模幂运算）对此进行了补充。相同的论证框架可迁移至多个子领域：任何错误指定的偏好学习会在样本复杂度上产生非连续跃变；多阶段检索流水线所需独立指标数至少等于阶段数；标准诚实拍卖机制无法应对提示依赖型估值主体；神经推理的零知识验证需为每个非线性激活函数支付一百一十至一百九十倍量级可测开销。这些结果共同构成包含十六项设计规范的清单，每项规范均包含可计算边界、量化违规代价及建设性设计准则：其中两种组合已获证明，一组配对为本质性障碍，四项仍为开放问题。这种“不可能性-规范”方法论可为可信人工智能可能需要的生成式研究纲领提供支撑——人工智能的每个根本局限，皆是一则设计准则。