Constraining Generative Models for Engineering Design with Negative Data

Generative models have recently achieved remarkable success and widespread adoption in society, yet they often struggle to generate realistic and accurate outputs. This challenge extends beyond language and vision into fields like engineering design, where safety-critical engineering standards and non-negotiable physical laws tightly constrain what outputs are considered acceptable. In this work, we introduce a novel training method to guide a generative model toward constraint-satisfying outputs using `negative data' -- examples of what to avoid. Our negative-data generative model (NDGM) formulation easily outperforms classic models, generating 1/6 as many constraint-violating samples using 1/8 as much data in certain problems. It also consistently outperforms other baselines, achieving a balance between constraint satisfaction and distributional similarity that is unsurpassed by any other model in 12 of the 14 problems tested. This widespread superiority is rigorously demonstrated across numerous synthetic tests and real engineering problems, such as ship hull synthesis with hydrodynamic constraints and vehicle design with impact safety constraints. Our benchmarks showcase both the best-in-class performance of our new NDGM formulation and the overall dominance of NDGMs versus classic generative models. We publicly release the code and benchmarks at https://github.com/Lyleregenwetter/NDGMs.

翻译：生成模型近年来取得了显著成功并在社会各领域得到广泛应用，然而其生成结果往往难以同时满足真实性与精确性要求。这一挑战不仅存在于语言与视觉领域，更延伸至工程设计等专业领域——在工程设计中，安全关键性工程标准与不可违背的物理定律严格限定了可接受输出的范围。本研究提出一种创新的训练方法，通过引入"负数据"（即需要规避的示例）来引导生成模型产生满足约束条件的输出。我们提出的负数据生成模型（NDGM）框架在多项测试中显著优于经典模型：在某些问题中仅需1/8的数据量即可将约束违反样本数量降低至1/6。该模型在14个测试问题中的12个问题上持续超越其他基线方法，在约束满足度与分布相似性之间实现了当前最优的平衡。我们通过大量合成测试与真实工程问题（如考虑水动力约束的船体合成、满足碰撞安全约束的车辆设计）严谨验证了该方法的广泛优越性。基准测试既展示了新NDGM框架的顶尖性能，也证明了NDGM相较于经典生成模型的整体优势。相关代码与基准测试已公开发布于https://github.com/Lyleregenwetter/NDGMs。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日