Skip the Benchmark: Generating System-Level High-Level Synthesis Data using Generative Machine Learning

High-Level Synthesis (HLS) Design Space Exploration (DSE) is a widely accepted approach for efficiently exploring Pareto-optimal and optimal hardware solutions during the HLS process. Several HLS benchmarks and datasets are available for the research community to evaluate their methodologies. Unfortunately, these resources are limited and may not be sufficient for complex, multi-component system-level explorations. Generating new data using existing HLS benchmarks can be cumbersome, given the expertise and time required to effectively generate data for different HLS designs and directives. As a result, synthetic data has been used in prior work to evaluate system-level HLS DSE. However, the fidelity of the synthetic data to real data is often unclear, leading to uncertainty about the quality of system-level HLS DSE. This paper proposes a novel approach, called Vaegan, that employs generative machine learning to generate synthetic data that is robust enough to support complex system-level HLS DSE experiments that would be unattainable with only the currently available data. We explore and adapt a Variational Autoencoder (VAE) and Generative Adversarial Network (GAN) for this task and evaluate our approach using state-of-the-art datasets and metrics. We compare our approach to prior works and show that Vaegan effectively generates synthetic HLS data that closely mirrors the ground truth's distribution.

翻译：高层次综合（HLS）设计空间探索（DSE）是一种在HLS过程中高效探索帕累托最优和最优硬件解决方案的广泛接受方法。研究社区可利用多个HLS基准测试和数据集来评估其方法论。然而，这些资源有限，可能不足以用于复杂的多组件系统级探索。利用现有HLS基准测试生成新数据可能繁琐，因为有效生成不同HLS设计和指令所需的数据需要大量的专业知识和时间。因此，先前的工作中已使用合成数据来评估系统级HLS DSE。但合成数据对真实数据的保真度通常不明确，导致对系统级HLS DSE质量的质疑。本文提出一种名为Vaegan的新方法，利用生成式机器学习生成足够鲁棒的合成数据，以支持仅凭现有数据无法实现的复杂系统级HLS DSE实验。我们探索并改编了变分自编码器（VAE）和生成对抗网络（GAN）用于此任务，并使用最先进的数据集和指标评估我们的方法。我们将我们的方法与先前工作进行比较，并表明Vaegan能有效生成与真实数据分布高度吻合的合成HLS数据。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日