Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning

Verifying whether a language model is genuinely reasoning or pattern-matching remains an open problem: learned verifiers are expensive, and output-based heuristics are brittle. We show that valid mathematical reasoning induces a measurable, training-free spectral signature in transformer attention. By treating each attention matrix as a weighted token graph, we extract four diagnostics: Fiedler value, High-Frequency Energy Ratio (HFER), spectral entropy, and smoothness, that require no learned parameters. Experiments across seven models from four architectural families yield effect sizes up to Cohen's $d = 3.30$ ($p < 10^{-116}$), enabling $85$--$96\%$ single-threshold classification accuracy. Two findings sharpen the interpretation. First, \emph{Platonic validity}: the spectral signal tracks logical coherence rather than compiler acceptance, proofs rejected for timeouts or missing imports are correctly classified as valid, a distinction confirmed by a manual audit ($κ= 0.82$, $n = 51$). Second, \emph{architectural determinism}: Sliding Window Attention shifts the discriminative feature from HFER to smoothness ($d = 2.09$, $p < 10^{-48}$), showing that attention design governs which spectral channel encodes reasoning quality. Causal ablation confirms the signature traces induction-head circuits. The method generalises to informal chain-of-thought ($d = 0.78$, $p < 10^{-3}$), and in proof search, HFER reranking improves Best-of-16 Pass@1 by $+4.4$--$6.6$\%, matching $98\%$ of the AUC of fully supervised probes with zero labels. Spectral graph analysis is a principled, architecture-aware primitive for reasoning verification.

翻译：验证语言模型是真正推理还是模式匹配仍是一个悬而未决的问题：学习型验证器成本高昂，且基于输出的启发式方法脆弱。我们证明，有效数学推理会在Transformer注意力中诱导出可测量、无需训练的谱特征。通过将每个注意力矩阵视为加权令牌图，我们提取了四个诊断指标：Fiedler值、高频能量比（HFER）、谱熵和平滑度，这些指标无需学习参数。在来自四个架构家族的七个模型上的实验表明，效应量高达Cohen's $d = 3.30$（$p < 10^{-116}$），实现了$85$--$96\%$的单阈值分类准确率。两个发现深化了理解。首先，*柏拉图式有效性*：谱信号追踪逻辑连贯性而非编译器接受度——因超时或缺失导入而被拒绝的证明被正确分类为有效，这一区分通过人工审计得到确认（$κ= 0.82$，$n = 51$）。其次，*架构决定论*：滑动窗口注意力将判别特征从HFER转移至平滑度（$d = 2.09$，$p < 10^{-48}$），表明注意力设计决定了哪个谱通道编码推理质量。因果消融实验确认该特征追踪归纳头电路。该方法泛化至非形式化思维链（$d = 0.78$，$p < 10^{-3}$），在证明搜索中，HFER重排序将Best-of-16 Pass@1提升了$+4.4$--$6.6\%$，在零标签情况下达到了完全监督探针AUC的$98\%$。谱图分析是一种有原则且架构感知的推理验证原语。