As satellite networks evolve to support increasingly diverse services and artificial general intelligence (AGI), large language models (LLMs) are emerging as a critical foundation for future space systems. However, deploying LLMs on satellites is hindered by stringent constraints on onboard memory, computation, and energy. In this context, the mixture-of-experts (MoE) architecture emerges as a promising solution, leveraging sparse expert activation to enable scalable model inference. By harnessing the architectural advantages of MoE, this article provides a comprehensive overview of SpaceMoE, a new paradigm for distributed MoE inference in satellite networks. We first review recent industrial progress and emerging standardization trends that motivate the evolution toward space AGI systems. Then, we introduce the fundamentals and architectural evolution of SpaceMoE. Subsequently, we discuss three fundamental design problems in SpaceMoE, namely expert placement, expert selection, and hidden-state transmission and routing, highlighting how satellite-specific factors such as dynamic topology, battery degradation, and thermal limits fundamentally reshape their solutions. Finally, we outline promising research directions for realizing scalable, efficient, and sustainable on-orbit MoE inference in future satellite networks.
翻译:随着卫星网络不断演进以支持日益多样化的服务与通用人工智能(AGI),大语言模型(LLM)正成为未来空间系统的关键基础。然而,在卫星上部署LLM受限于星载内存、计算能力和能源的严格约束。在此背景下,混合专家(MoE)架构作为一种有前景的解决方案应运而生,其通过稀疏专家激活实现可扩展的模型推理。本文利用MoE的架构优势,全面综述了SpaceMoE——一种面向卫星网络分布式MoE推理的新范式。我们首先回顾了近期工业进展与新兴的标准化趋势,这些趋势推动了空间AGI系统的演进。接着,介绍了SpaceMoE的基本原理与架构演进。随后,讨论了SpaceMoE中的三个基本设计问题,即专家放置、专家选择以及隐状态传输与路由,强调了动态拓扑、电池衰减和热约束等卫星特有因素如何从根本上重塑其解决方案。最后,展望了在未来卫星网络中实现可扩展、高效且可持续的在轨MoE推理的若干研究方向。