Constructing the Umwelt: Cognitive Planning through Belief-Intent Co-Evolution

This paper challenges a prevailing epistemological assumption in End-to-End Autonomous Driving: that high-performance planning necessitates high-fidelity world reconstruction. Inspired by cognitive science, we propose the Mental Bayesian Causal World Model (MBCWM) and instantiate it as the Tokenized Intent World Model (TIWM), a novel cognitive computing architecture. Its core philosophy posits that intelligence emerges not from pixel-level objective fidelity, but from the Cognitive Consistency between the agent's internal intentional world and physical reality. By synthesizing von Uexküll's $\textit{Umwelt}$ theory, the neural assembly hypothesis, and the triple causal model (integrating symbolic deduction, probabilistic induction, and force dynamics) into an end-to-end embodied planning system, we demonstrate the feasibility of this paradigm on the nuPlan benchmark. Experimental results in open-loop validation confirm that our Belief-Intent Co-Evolution mechanism effectively enhances planning performance. Crucially, in closed-loop simulations, the system exhibits emergent human-like cognitive behaviors, including map affordance understanding, free exploration, and self-recovery strategies. We identify Cognitive Consistency as the core learning mechanism: during long-term training, belief (state understanding) and intent (future prediction) spontaneously form a self-organizing equilibrium through implicit computational replay, achieving semantic alignment between internal representations and physical world affordances. TIWM offers a neuro-symbolic, cognition-first alternative to reconstruction-based planners, establishing a new direction: planning as active understanding, not passive reaction.

翻译：本文挑战了端到端自动驾驶领域一个盛行的认识论假设：高性能规划必须依赖高保真的世界重建。受认知科学启发，我们提出了心智贝叶斯因果世界模型（MBCWM），并将其实例化为一种新颖的认知计算架构——Tokenized Intent World Model（TIWM）。其核心哲学观点认为，智能并非源于像素级的客观保真度，而是产生于智能体内部意图世界与物理现实之间的认知一致性。通过将冯·于克斯屈尔的感知世界理论、神经集群假说以及三重因果模型（整合符号演绎、概率归纳与力动力学）综合成一个端到端的具身规划系统，我们在nuPlan基准测试中验证了该范式的可行性。开环验证的实验结果证实，我们的信念-意图协同演化机制能有效提升规划性能。尤为关键的是，在闭环仿真中，系统展现出类人的涌现认知行为，包括地图可供性理解、自由探索与自我恢复策略。我们将认知一致性确认为核心学习机制：在长期训练过程中，信念（状态理解）与意图（未来预测）通过隐式计算回放自发形成自组织平衡，实现了内部表征与物理世界可供性之间的语义对齐。TIWM为基于重建的规划器提供了一种神经符号、认知优先的替代方案，开创了新的研究方向：规划即主动理解，而非被动反应。