Scale-Aware Crowd Count Network with Annotation Error Correction

Traditional crowd counting networks suffer from information loss when feature maps are downsized through pooling layers, leading to inaccuracies in counting crowds at a distance. Existing methods often assume correct annotations during training, disregarding the impact of noisy annotations, especially in crowded scenes. Furthermore, the use of a fixed Gaussian kernel fails to account for the varying pixel distribution with respect to the camera distance. To overcome these challenges, we propose a Scale-Aware Crowd Counting Network (SACC-Net) that introduces a ``scale-aware'' architecture with error-correcting capabilities of noisy annotations. For the first time, we {\bf simultaneously} model labeling errors (mean) and scale variations (variance) by spatially-varying Gaussian distributions to produce fine-grained heat maps for crowd counting. Furthermore, the proposed adaptive Gaussian kernel variance enables the model to learn dynamically with a low-rank approximation, leading to improved convergence efficiency with comparable accuracy. The performance of SACC-Net is extensively evaluated on four public datasets: UCF-QNRF, UCF CC 50, NWPU, and ShanghaiTech A-B. Experimental results demonstrate that SACC-Net outperforms all state-of-the-art methods, validating its effectiveness in achieving superior crowd counting accuracy.

翻译：传统的人群计数网络在通过池化层缩小特征图时存在信息损失，导致远距离人群计数不准确。现有方法通常在训练中假设标注完全正确，忽略了噪声标注的影响，尤其是在拥挤场景中。此外，固定高斯核无法适应与相机距离相关的像素分布变化。为解决这些挑战，我们提出一种具有尺度感知架构且具备噪声标注误差校正能力的人群计数网络（SACC-Net）。首次通过空间变化的高斯分布同时建模标注误差（均值）与尺度变化（方差），生成细粒度热力图用于人群计数。此外，所提出的自适应高斯核方差使模型能够通过低秩近似动态学习，在保持相当精度的同时提升收敛效率。SACC-Net的性能在四个公开数据集（UCF-QNRF、UCF CC 50、NWPU、ShanghaiTech A-B）上进行了全面评估。实验结果表明，SACC-Net优于所有最先进方法，验证了其在实现更优人群计数精度方面的有效性。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日