Generative Artificial Intelligence Reproducibility and Consensus

We performed a billion locality sensitive hash comparisons between artificially generated data samples to answer the critical question - can we reproduce the results of generative AI models? Reproducibility is one of the pillars of scientific research for verifiability, benchmarking, trust, and transparency. Futhermore, we take this research to the next level by verifying the "correctness" of generative AI output in a non-deterministic, trustless, decentralized network. We generate millions of data samples from a variety of open source diffusion and large language models and describe the procedures and trade-offs between generating more verses less deterministic output. Additionally, we analyze the outputs to provide empirical evidence of different parameterizations of tolerance and error bounds for verification. For our results, we show that with a majority vote between three independent verifiers, we can detect image generated perceptual collisions in generated AI with over 99.89% probability and less than 0.0267% chance of intra-class collision. For large language models (LLMs), we are able to gain 100% consensus using greedy methods or n-way beam searches to generate consensus demonstrated on different LLMs. In the context of generative AI training, we pinpoint and minimize the major sources of stochasticity and present gossip and synchronization training techniques for verifiability. Thus, this work provides a practical, solid foundation for AI verification, reproducibility, and consensus for generative AI applications.

翻译：我们对人工生成的数据样本进行了数十亿次局部敏感哈希比较，以解答一个关键问题：能否复现生成式AI模型的结果？可复现性是科学研究的支柱之一，关乎可验证性、基准测试、可信度与透明度。此外，我们进一步在非确定性、无需信任的去中心化网络中验证了生成式AI输出的“正确性”。我们从多种开源扩散模型与大型语言模型中生成数百万个数据样本，描述了生成过程中确定性与非确定性输出之间的权衡流程。同时，我们分析输出结果，为不同容差与误差边界的参数化验证提供了经验证据。研究结果表明，通过三个独立验证者的多数投票机制，我们能够以超过99.89%的概率检测生成式AI中的图像感知碰撞，且同类碰撞概率低于0.0267%。对于大型语言模型，我们采用贪心方法或n路束搜索在不同模型上实现了100%的共识。在生成式AI训练方面，我们定位并最小化了随机性的主要来源，并提出了用于验证的八卦协议与同步训练技术。因此，本文为生成式AI应用的可验证性、可复现性与共识构建了实用且坚实的基础。

相关内容

生成式人工智能

关注 38

生成式人工智能是利用复杂的算法、模型和规则，从大规模数据集中学习，以创造新的原创内容的人工智能技术。这项技术能够创造文本、图片、声音、视频和代码等多种类型的内容，全面超越了传统软件的数据处理和分析能力。2022年末，OpenAI推出的ChatGPT标志着这一技术在文本生成领域取得了显著进展，2023年被称为生成式人工智能的突破之年。这项技术从单一的语言生成逐步向多模态、具身化快速发展。在图像生成方面，生成系统在解释提示和生成逼真输出方面取得了显著的进步。同时，视频和音频的生成技术也在迅速发展，这为虚拟现实和元宇宙的实现提供了新的途径。生成式人工智能技术在各行业、各领域都具有广泛的应用前景。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日