Entropy-Gradient Inversion: Moving Toward Internal Mechanism of Large Reasoning Models

from arxiv, The authors are withdrawing this manuscript due to fundamental inaccuracies in the institutional affiliations and administrative attributions provided at the time of submission. As this version cannot be validated under the correct institutional framework, the authors request its formal withdrawal from the repository. No immediate replacement is intended

The advancement of Large Reasoning Models (LRMs) has catalyzed a paradigm shift from reactive ``fast thinking'' text generation to systematic, step-by-step ``slow thinking'' reasoning, unlocking state-of-the-art performance in complex mathematical and logical tasks. However, the field faces \textit{the fundamental gap between token-level behavioral analysis and internal reasoning mechanisms, and the instability of reinforcement learning (RL) for reasoning optimization relying on costly external verifiers}. We identify and formally define \textbf{Entropy-Gradient Inversion}, a robust negative correlation between token entropy and logit gradients that acts as a definitive geometric fingerprint for LRM reasoning capability. Building on this, we propose \textbf{Correlation-Regularized Group Policy Optimization (CorR-PO)}, which embeds this inversion signature into RL reward regularization. Extensive experiments on various reasoning benchmarks across multiple model scales show CorR-PO consistently outperforms state-of-the-art baselines, confirming that stronger inversion directly correlates with superior reasoning performance.

翻译：大型推理模型（LRMs）的进展催生了从反应式“快思考”文本生成到系统性、逐步“慢思考”推理的范式转变，在复杂数学与逻辑任务中实现了最优性能。然而，该领域面临**词元层面行为分析与内部推理机制之间的根本鸿沟，以及依赖昂贵外部验证器的强化学习（RL）在推理优化中的不稳定性**。我们识别并正式定义了**熵梯度反转（Entropy-Gradient Inversion）**——一种词元熵与对数几率梯度之间的稳健负相关关系，可作为LRM推理能力的确定性几何特征。基于此，我们提出了**相关性正则化群体策略优化（CorR-PO）**，该算法将这一反转特征嵌入RL奖励正则化中。在多种模型规模下的多个推理基准上的广泛实验表明，CorR-PO稳定优于最先进的基线模型，证实更强的反转直接关联更优的推理性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

迈向大推理模型的机理理解：关于训练、推理及失效模式的综述

专知会员服务

17+阅读 · 1月29日

面向大型推理模型的强化学习综述

专知会员服务

29+阅读 · 2025年9月11日

大模型推理的天花板在哪里？

专知会员服务

16+阅读 · 2025年6月12日

强化多模态大语言模型：基于强化学习的推理综述

专知会员服务

37+阅读 · 2025年5月3日