Many Artificial Intelligence (AI) algorithms are inspired by physics and employ stochastic fluctuations. We connect these physics-inspired AI algorithms by unifying them under a single mathematical framework that we call Thermodynamic AI. Seemingly disparate algorithmic classes can be described by this framework, for example, (1) Generative diffusion models, (2) Bayesian neural networks, (3) Monte Carlo sampling and (4) Simulated annealing. Such Thermodynamic AI algorithms are currently run on digital hardware, ultimately limiting their scalability and overall potential. Stochastic fluctuations naturally occur in physical thermodynamic systems, and such fluctuations can be viewed as a computational resource. Hence, we propose a novel computing paradigm, where software and hardware become inseparable. Our algorithmic unification allows us to identify a single full-stack paradigm, involving Thermodynamic AI hardware, that could accelerate such algorithms. We contrast Thermodynamic AI hardware with quantum computing where noise is a roadblock rather than a resource. Thermodynamic AI hardware can be viewed as a novel form of computing, since it uses a novel fundamental building block. We identify stochastic bits (s-bits) and stochastic modes (s-modes) as the respective building blocks for discrete and continuous Thermodynamic AI hardware. In addition to these stochastic units, Thermodynamic AI hardware employs a Maxwell's demon device that guides the system to produce non-trivial states. We provide a few simple physical architectures for building these devices and we develop a formalism for programming the hardware via gate sequences. We hope to stimulate discussion around this new computing paradigm. Beyond acceleration, we believe it will impact the design of both hardware and algorithms, while also deepening our understanding of the connection between physics and intelligence.
翻译:许多人工智能(AI)算法源于物理学并利用随机涨落。我们通过将这些受物理启发的AI算法统一在一个数学框架下,称之为热力学人工智能。看似不同的算法类别均可由该框架描述,例如:(1)生成扩散模型、(2)贝叶斯神经网络、(3)蒙特卡洛采样和(4)模拟退火。此类热力学人工智能算法目前在数字硬件上运行,这从根本上限制了其可扩展性和整体潜力。随机涨落在物理热力学系统中自然发生,此类涨落可视为一种计算资源。因此,我们提出一种新型计算范式,其中软件与硬件密不可分。算法的统一使我们能够识别一个涉及热力学人工智能硬件的全栈范式,从而加速此类算法。我们将热力学人工智能硬件与量子计算进行对比——在量子计算中噪声是障碍而非资源。热力学人工智能硬件可被视为一种新型计算形式,因为它使用了新颖的基本构建模块。我们将随机比特和随机模式分别确定为离散与连续型热力学人工智能硬件的构建模块。除这些随机单元外,热力学人工智能硬件还采用一种麦克斯韦妖器件,用于引导系统产生非平凡状态。我们提供了构建此类器件的几种简单物理架构,并开发了一种通过门序列对硬件进行编程的形式化方法。我们期望激发围绕这一新型计算范式的讨论。除加速之外,我们相信它将影响硬件与算法的设计,同时深化我们对物理与智能之间联系的理解。