熵最优路径：迈向谦逊人工智能 (An entropy-optimal path to humble AI)

Progress of AI has led to a creation of very successful, but by no means humble models and tools, especially regarding (i) the huge and further exploding costs and resources they demand, and (ii) the over-confidence of these tools with the answers they provide. Here we introduce a novel mathematical framework for a non-equilibrium entropy-optimizing reformulation of Boltzmann machines based on the exact law of total probability. It results in the highly-performant, but much cheaper, gradient-descent-free learning framework with mathematically-justified existence and uniqueness criteria, and answer confidence/reliability measures. Comparisons to state-of-the-art AI tools in terms of performance, cost and the model descriptor lengths on a set of synthetic problems with varying complexity reveal that the proposed method results in more performant and slim models, with the descriptor lengths being very close to the intrinsic complexity scaling bounds for the underlying problems. Applying this framework to historical climate data results in models with systematically higher prediction skills for the onsets of La Ni\~na and El Ni\~no climate phenomena, requiring just few years of climate data for training - a small fraction of what is necessary for contemporary climate prediction tools.

翻译：人工智能的发展催生了极为成功但远非谦逊的模型与工具，尤其体现在两方面：(i) 其所需成本与资源极为庞大且持续激增；(ii) 这些工具对其提供的答案表现出过度自信。本文基于全概率定律，提出一种非平衡熵优化的玻尔兹曼机重构数学框架。该框架产生了一种高性能、低成本的免梯度下降学习范式，具备数学可证明的存在性与唯一性判据，以及答案置信度/可靠性度量方法。在一系列复杂度各异的合成问题上，与前沿人工智能工具在性能、成本及模型描述长度方面的对比表明：所提方法能生成性能更优且更精简的模型，其描述长度非常接近底层问题的本征复杂度缩放边界。将该框架应用于历史气候数据，所得模型对拉尼娜与厄尔尼诺气候现象起始时刻的预测能力呈现系统性提升，且仅需数年气候数据即可完成训练——这仅是当代气候预测工具所需训练数据的极小部分。

相关内容

关注 7093

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日