Recent advancements in generative machine learning have enabled rapid progress in biological design tools (BDTs) such as protein structure and sequence prediction models. The unprecedented predictive accuracy and novel design capabilities of BDTs present new and significant dual-use risks. For example, their predictive accuracy allows biological agents, whether vaccines or pathogens, to be developed more quickly, while the design capabilities could be used to discover drugs or evade DNA screening techniques. Similar to other dual-use AI systems, BDTs present a wicked problem: how can regulators uphold public safety without stifling innovation? We highlight how current regulatory proposals that are primarily tailored toward large language models may be less effective for BDTs, which require fewer computational resources to train and are often developed in an open-source manner. We propose a range of measures to mitigate the risk that BDTs are misused, across the areas of responsible development, risk assessment, transparency, access management, cybersecurity, and investing in resilience. Implementing such measures will require close coordination between developers and governments.
翻译:生成式机器学习的最新进展推动了生物设计工具(BDTs)的快速发展,例如蛋白质结构和序列预测模型。BDTs前所未有的预测精度和新型设计能力带来了新的重大双重用途风险。例如,其预测精度使得生物制剂(无论是疫苗还是病原体)能够更快地开发,而设计能力则可能被用于发现药物或规避DNA筛查技术。与其他具有双重用途的人工智能系统类似,BDTs提出了一个棘手的问题:监管者如何在维护公共安全的同时不扼杀创新?我们强调,当前主要针对大型语言模型的监管提案可能对BDTs效果较差,因为后者训练所需的计算资源更少,且通常以开源方式开发。我们提出了一系列措施,以降低BDTs被滥用的风险,涵盖负责任开发、风险评估、透明度、访问管理、网络安全以及增强韧性等领域。实施这些措施需要开发者与政府之间的密切协调。