Cross-Scene Crowd Counting via Deep Convolutional Neural Networks

Cross-scene crowd counting is a challenging task where no laborious data annotation is required for counting people in new target surveillance crowd scenes unseen in the training set. The performance of most existing crowd counting methods drops significantly when they are applied to an unseen scene. To address this problem, we propose a deep convolutional neural network (CNN) for crowd counting, and it is trained alternatively with two related learning objectives, crowd density and crowd count. This proposed switchable learning approach is able to obtain better local optimum for both objectives. To handle an unseen target crowd scene, we present a data-driven method to fine-tune the trained CNN model for the target scene. A new dataset including 108 crowd scenes with nearly 200,000 head annotations is introduced to better evaluate the accuracy of cross-scene crowd counting methods. Extensive experiments on the proposed and another two existing datasets demonstrate the effectiveness and reliability of our approach.

PDF Abstract
No code implementations yet. Submit your code now

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Crowd Counting UCF CC 50 Zhang et al. MAE 467.0 # 21

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Source Paper Compare
Crowd Counting ShanghaiTech A Zhang et al. MAE 181.8 # 30
Crowd Counting ShanghaiTech B Zhang et al. MAE 32.0 # 27
Crowd Counting WorldExpo’10 Zhang et al. Average MAE 12.9 # 15

Methods


No methods listed for this paper. Add relevant methods here