DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs

This paper revives Densely Connected Convolutional Networks (DenseNets) and reveals the underrated effectiveness over predominant ResNet-style architectures. We believe DenseNets' potential was overlooked due to untouched training methods and traditional design elements not fully revealing their capabilities. Our pilot study shows dense connections through concatenation are strong, demonstrating that DenseNets can be revitalized to compete with modern architectures. We methodically refine suboptimal components - architectural adjustments, block redesign, and improved training recipes towards widening DenseNets and boosting memory efficiency while keeping concatenation shortcuts. Our models, employing simple architectural elements, ultimately surpass Swin Transformer, ConvNeXt, and DeiT-III - key architectures in the residual learning lineage. Furthermore, our models exhibit near state-of-the-art performance on ImageNet-1K, competing with the very recent models and downstream tasks, ADE20k semantic segmentation, and COCO object detection/instance segmentation. Finally, we provide empirical analyses that uncover the merits of the concatenation over additive shortcuts, steering a renewed preference towards DenseNet-style designs. Our code is available at https://github.com/naver-ai/rdnet.

翻译：本文重新审视了密集连接卷积网络（DenseNets），并揭示了其在主流ResNet类架构之上被低估的有效性。我们认为，由于未优化的训练方法和传统设计元素未能充分展现其潜力，DenseNets的优势长期被忽视。我们的初步研究表明，通过拼接实现的密集连接具有强大性能，证明DenseNets能够通过革新与当代架构竞争。我们系统性地改进了次优组件——包括结构调整、模块重新设计以及改进的训练方案，在保持拼接捷径的同时扩展DenseNets的宽度并提升内存效率。采用简洁架构元素的模型最终超越了残差学习谱系中的关键架构：Swin Transformer、ConvNeXt和DeiT-III。此外，我们的模型在ImageNet-1K上展现出接近最先进的性能，与最新模型及下游任务（包括ADE20k语义分割、COCO目标检测/实例分割）相竞争。最后，我们通过实证分析揭示了拼接机制相较于加性捷径的优势，从而引导学界重新关注DenseNet式设计。代码已开源：https://github.com/naver-ai/rdnet。

相关内容

DenseNet

关注 0

作为CVPR2017年的Best Paper, DenseNet脱离了加深网络层数(ResNet)和加宽网络结构(Inception)来提升网络性能的定式思维,从特征的角度考虑,通过特征重用和旁路(Bypass)设置,既大幅度减少了网络的参数量,又在一定程度上缓解了gradient vanishing问题的产生.结合信息流和特征复用的假设,DenseNet当之无愧成为2017年计算机视觉顶会的年度最佳论文.

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日