The proliferation of large-scale distributed systems, such as satellite constellations and high-performance computing clusters, demands robust communication primitives that maintain coordination under unreliable links. The torus topology, with its inherent rotational and reflection symmetries, is a prevalent architecture in these domains. However, conventional routing schemes suffer from substantial packet loss during control-plane synchronization after link failures. This paper introduces a symmetry-driven asynchronous forwarding mechanism that leverages the torus's geometric properties to achieve reliable packet delivery without control-plane coordination. We model packet flow using a topological potential gradient and demonstrate that symmetry-breaking failures naturally induce a reverse flow, which we harness for fault circumvention. We propose two local forwarding strategies, Reverse Flow with Counter-facing Priority (RF-CF) and Lateral-facing Priority (RF-LF), that guarantee reachability to the destination via forward-flow phase transition points, without protocol modifications or additional in-packet overhead. Through percolation analysis and packet-level simulations on a 16 x 16 torus, we show that our mechanism reduces packet loss by up to 17.5% under a 1% link failure rate, with the RF-LF strategy contributing to 28% of successfully delivered packets. This work establishes a foundational link between topological symmetry and communication resilience, providing a lightweight, protocol-agnostic substrate for enhancing distributed systems.
翻译:随着卫星星座和高性能计算集群等大规模分布式系统的普及,亟需在不可靠链路条件下保持协调的鲁棒通信原语。环面拓扑因其固有的旋转与反射对称性,已成为这些领域的主流架构。然而,传统路由方案在链路故障后的控制平面同步过程中存在严重的丢包问题。本文提出一种基于对称性的异步转发机制,利用环面的几何特性实现无需控制平面协调的可靠数据包传输。我们采用拓扑势梯度对数据包流进行建模,并证明对称性破缺故障会自然诱发反向流,我们利用该反向流实现故障规避。我们提出了两种本地转发策略:反向流-对向优先(RF-CF)与反向流-侧向优先(RF-LF),这两种策略能通过前向流相变点保证数据包可达目的地,且无需修改协议或增加包内开销。通过在16×16环面网络上进行渗流分析与数据包级仿真,我们证明该机制在1%链路故障率下可降低高达17.5%的丢包率,其中RF-LF策略贡献了28%的成功传输数据包。本研究建立了拓扑对称性与通信韧性之间的基础联系,为增强分布式系统提供了一种轻量级、协议无关的底层支撑。