Can a micron sized sack of interacting molecules autonomously learn an internal model of a complex and fluctuating environment? We draw insights from control theory, machine learning theory, chemical reaction network theory, and statistical physics to develop a general architecture whereby a broad class of chemical systems can autonomously learn complex distributions. Our construction takes the form of a chemical implementation of machine learning's optimization workhorse: gradient descent on the relative entropy cost function. We show how this method can be applied to optimize any detailed balanced chemical reaction network and that the construction is capable of using hidden units to learn complex distributions. This result is then recast as a form of integral feedback control. Finally, due to our use of an explicit physical model of learning, we are able to derive thermodynamic costs and trade-offs associated to this process.
翻译:一个微米大小的分子相互作用团块能否自主地学习一个复杂且波动环境的内部模型?我们从控制理论、机器学习理论、化学反应网络理论和统计物理学中汲取洞见,开发出一种通用架构,使广泛的化学系统能够自主地学习复杂分布。我们的构造采用了机器学习优化核心方法——相对熵代价函数上的梯度下降的化学实现形式。我们展示了如何将此方法应用于优化任意详细平衡的化学反应网络,并且该构造能够利用隐含单元学习复杂分布。随后,这一结果被重新诠释为一种积分反馈控制形式。最后,由于我们使用了明确的学习物理模型,得以推导出与此过程相关的热力学代价与权衡。