尺度不变目标检测：基于统一全局-局部上下文的自适应卷积方法 (Scale-Invariant Object Detection by Adaptive Convolution with Unified Global-Local Context)

Dense features are important for detecting minute objects in images. Unfortunately, despite the remarkable efficacy of the CNN models in multi-scale object detection, CNN models often fail to detect smaller objects in images due to the loss of dense features during the pooling process. Atrous convolution addresses this issue by applying sparse kernels. However, sparse kernels often can lose the multi-scale detection efficacy of the CNN model. In this paper, we propose an object detection model using a Switchable (adaptive) Atrous Convolutional Network (SAC-Net) based on the efficientDet model. A fixed atrous rate limits the performance of the CNN models in the convolutional layers. To overcome this limitation, we introduce a switchable mechanism that allows for dynamically adjusting the atrous rate during the forward pass. The proposed SAC-Net encapsulates the benefits of both low-level and high-level features to achieve improved performance on multi-scale object detection tasks, without losing the dense features. Further, we apply a depth-wise switchable atrous rate to the proposed network, to improve the scale-invariant features. Finally, we apply global context on the proposed model. Our extensive experiments on benchmark datasets demonstrate that the proposed SAC-Net outperforms the state-of-the-art models by a significant margin in terms of accuracy.

翻译：密集特征对于检测图像中的微小目标至关重要。然而，尽管CNN模型在多尺度目标检测中表现出卓越效能，但由于池化过程中密集特征的丢失，CNN模型常无法有效检测图像中的较小目标。空洞卷积通过应用稀疏卷积核来解决此问题，但稀疏卷积核往往会导致CNN模型的多尺度检测能力下降。本文基于efficientDet模型，提出了一种采用可切换（自适应）空洞卷积网络（SAC-Net）的目标检测模型。固定空洞率限制了CNN模型在卷积层中的性能表现。为克服此限制，我们引入了一种可切换机制，可在前向传播过程中动态调整空洞率。所提出的SAC-Net融合了低层与高层特征的优势，在保持密集特征不丢失的前提下，提升了多尺度目标检测任务的性能。此外，我们在网络中应用深度可切换空洞率以增强尺度不变特征。最后，我们在模型中引入全局上下文信息。通过在基准数据集上的大量实验表明，所提出的SAC-Net在检测精度方面显著优于现有最先进模型。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日