Crowd Density Estimation by Using Attention Based Capsule Network and Multi-Column CNN

Kizrak, Merve; BOLAT, Bülent

doi:10.1109/access.2021.3081529

Crowd Density Estimation by Using Attention Based Capsule Network and Multi-Column CNN

Atıf İçin Kopyala

Kizrak M. A., BOLAT B.

IEEE ACCESS, cilt.9, ss.75435-75445, 2021 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 9
Basım Tarihi: 2021
Doi Numarası: 10.1109/access.2021.3081529
Dergi Adı: IEEE ACCESS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, INSPEC, Directory of Open Access Journals
Sayfa Sayıları: ss.75435-75445
Anahtar Kelimeler: Feature extraction, Estimation, Task analysis, Adaptation models, Distortion, Predictive models, Analytical models, Capsule attention, crowd counting, density map, multi-column CNN, CONVOLUTIONAL NEURAL-NETWORK, COUNTING PEOPLE, TRACKING, LOCALIZATION, RECOGNITION, MODEL
Yıldız Teknik Üniversitesi Adresli: Evet

Özet

We propose a strategy that focuses on estimating the number of people in a crowd, one of the aims of crowd analysis, using static images or video images. While manual feature extraction was not performed with pixel and regression-based methods in the first studies on crowd analysis, recent studies use Convolutional Neural Networks (CNN) based models. However, it is still difficult to extract spatial information such as position, orientation, posture, and angular value for crowd estimation from a density map. This study uses capsule networks and routing by agreement algorithm as an attention module. Our proposed approach consists of both CNN and capsule network-based attention modules in a two-column deep neural network architecture. We evaluate our proposed approach compared with other state-of-the-art methods using three well-known datasets: UCF-QNRF, UCF_CC_50, UCSD, ShangaiTech Part A, and WorldExpo'10.