Supervised Virtual-to-Real Domain Adaptation for Object Detection Task using YOLO

Deep neural network shows excellent use in a lot of real-world tasks. One of the deep learning tasks is object detection. Well-annotated datasets will affect deep neural network accuracy. More data learned by deep neural networks will make the model more accurate. However, a well-annotated dataset is hard to find, especially in a specific domain. To overcome this, computer-generated data or virtual datasets are used. Researchers could generate many images with specific use cases also with its annotation. Research studies showed that virtual datasets could be used for object detection tasks. Nevertheless, with the usage of the virtual dataset, the model must adapt to real datasets, or the model must have domain adaptability features. We explored the domain adaptation inside the object detection model using a virtual dataset to overcome a few well-annotated datasets. We use VW-PPE dataset, using 5000 and 10000 virtual data and 220 real data. For model architecture, we used YOLOv4 using CSPDarknet53 as the backbone and PAN as the neck. The domain adaptation technique with fine-tuning only on backbone weight achieved a mean average precision of 74.457%.

翻译：深度神经网络在诸多实际任务中展现出卓越性能，其中目标检测是深度学习的重要应用领域之一。高质量标注数据集直接影响深度神经网络的精度，而模型学习的数据量越大，其准确性越高。然而，高质量标注数据集（尤其在特定领域）往往难以获取。为解决这一问题，计算机生成数据或虚拟数据集被广泛采用。研究者可针对特定场景生成大量图像及其对应标注。已有研究表明，虚拟数据集能够有效应用于目标检测任务。但使用虚拟数据集时，模型必须适应真实数据集，即具备域适应能力。本研究探索了基于虚拟数据集的目标检测域适应方法，以缓解高质量标注数据集匮乏的问题。我们采用VW-PPE数据集，使用5000张和10000张虚拟数据及220张真实数据进行实验。模型架构选用YOLOv4，以CSPDarknet53作为骨干网络，PAN作为颈部网络。通过仅在骨干网络权重上进行微调的域适应技术，模型平均精度均值达到74.457%。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日