On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective

The pervasive uncertainty and dynamic nature of real-world environments present significant challenges for the widespread implementation of machine-driven Intelligent Decision-Making (IDM) systems. Consequently, IDM should possess the ability to continuously acquire new skills and effectively generalize across a broad range of applications. The advancement of Artificial General Intelligence (AGI) that transcends task and application boundaries is critical for enhancing IDM. Recent studies have extensively investigated the Transformer neural architecture as a foundational model for various tasks, including computer vision, natural language processing, and reinforcement learning. We propose that a Foundation Decision Model (FDM) can be developed by formulating diverse decision-making tasks as sequence decoding tasks using the Transformer architecture, offering a promising solution for expanding IDM applications in complex real-world situations. In this paper, we discuss the efficiency and generalization improvements offered by a foundation decision model for IDM and explore its potential applications in multi-agent game AI, production scheduling, and robotics tasks. Lastly, we present a case study demonstrating our FDM implementation, DigitalBrain (DB1) with 1.3 billion parameters, achieving human-level performance in 870 tasks, such as text generation, image captioning, video game playing, robotic control, and traveling salesman problems. As a foundation decision model, DB1 represents an initial step toward more autonomous and efficient real-world IDM applications.

翻译：真实世界环境中普遍存在的不确定性和动态性，给机器驱动的智能决策系统的广泛应用带来了重大挑战。因此，智能决策应具备持续获取新技能并有效泛化至广泛领域的能力。超越任务与应用边界的人工通用智能的进步，对于提升智能决策至关重要。近期研究广泛探索了Transformer神经架构作为计算机视觉、自然语言处理及强化学习等各类任务的基础模型。我们提出，通过将多样化决策任务形式化为基于Transformer架构的序列解码任务，可构建基础决策模型，这为扩展智能决策在复杂真实场景中的应用提供了有前景的解决方案。本文讨论了基础决策模型为智能决策带来的效率与泛化改进，并探索了其在多智能体游戏AI、生产调度及机器人任务中的潜在应用。最后，我们通过案例研究展示了所实现的基础决策模型——包含13亿参数的DigitalBrain（DB1），该模型在870项任务中达到人类水平性能，涵盖文本生成、图像描述、视频游戏操控、机器人控制及旅行商问题等。作为基础决策模型，DB1标志着向更自主、高效的真实世界智能决策应用迈出了初步一步。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/