FLAME Diffuser: Grounded Wildfire Image Synthesis using Mask Guided Diffusion

The rise of machine learning in recent years has brought benefits to various research fields such as wide fire detection. Nevertheless, small object detection and rare object detection remain a challenge. To address this problem, we present a dataset automata that can generate ground truth paired datasets using diffusion models. Specifically, we introduce a mask-guided diffusion framework that can fusion the wildfire into the existing images while the flame position and size can be precisely controlled. In advance, to fill the gap that the dataset of wildfire images in specific scenarios is missing, we vary the background of synthesized images by controlling both the text prompt and input image. Furthermore, to solve the color tint problem or the well-known domain shift issue, we apply the CLIP model to filter the generated massive dataset to preserve quality. Thus, our proposed framework can generate a massive dataset of that images are high-quality and ground truth-paired, which well addresses the needs of the annotated datasets in specific tasks.

翻译：近年来机器学习的兴起为野火检测等研究领域带来了诸多益处。然而，小目标检测和罕见目标检测仍然是挑战。为解决该问题，我们提出一种数据集生成框架，可利用扩散模型生成具有真实标注的配对数据集。具体而言，我们引入一种掩码引导扩散框架，能够将野火融合至现有图像中，同时精确控制火焰位置与尺寸。进一步地，为弥合特定场景野火图像数据集缺失的空白，我们通过控制文本提示与输入图像来改变合成图像的背景。此外，为解决色偏问题或已知的域偏移问题，我们应用CLIP模型对生成的大规模数据集进行筛选以保证质量。因此，我们所提出的框架能够生成高质量且具有真实标注配对的大规模数据集，充分满足特定任务对标注数据集的需求。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日