The development of Courses of Action (COAs) in military operations is traditionally a time-consuming and intricate process. Addressing this challenge, this study introduces COA-GPT, a novel algorithm employing Large Language Models (LLMs) for rapid and efficient generation of valid COAs. COA-GPT incorporates military doctrine and domain expertise to LLMs through in-context learning, allowing commanders to input mission information - in both text and image formats - and receive strategically aligned COAs for review and approval. Uniquely, COA-GPT not only accelerates COA development, producing initial COAs within seconds, but also facilitates real-time refinement based on commander feedback. This work evaluates COA-GPT in a military-relevant scenario within a militarized version of the StarCraft II game, comparing its performance against state-of-the-art reinforcement learning algorithms. Our results demonstrate COA-GPT's superiority in generating strategically sound COAs more swiftly, with added benefits of enhanced adaptability and alignment with commander intentions. COA-GPT's capability to rapidly adapt and update COAs during missions presents a transformative potential for military planning, particularly in addressing planning discrepancies and capitalizing on emergent windows of opportunities.
翻译:在军事行动中,行动方案(COA)的制定传统上是一个耗时且复杂的流程。针对这一挑战,本研究提出COA-GPT——一种利用大语言模型快速高效生成有效行动方案的新型算法。COA-GPT通过上下文学习将军事条令与领域专业知识融入大语言模型,使指挥官能够输入包含文本和图像格式的任务信息,并获得符合战略要求的行动方案以供审阅与批准。尤为独特的是,COA-GPT不仅能在数秒内生成初始行动方案以加速制定流程,还能基于指挥官的反馈进行实时优化。本研究在军事化版本的《星际争霸II》游戏中,于军事相关场景下对COA-GPT进行了评估,并将其性能与最先进的强化学习算法进行对比。结果表明,COA-GPT在更快速生成战略合理的行动方案方面具有显著优势,同时具备更强的适应性与指挥官意图一致性。COA-GPT在任务中快速调整和更新行动方案的能力,为军事规划带来了变革潜力,特别是在应对规划偏差和把握突发机遇窗口方面。