Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

Junjie Ma; Yaping Dai; Zhiyang Jia; Fuchun Sun; Yap Peng Tan; Jun Liu

doi:10.1016/j.patcog.2023.109585

Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

Junjie Ma, Yaping Dai, Zhiyang Jia, Fuchun Sun^*, Yap Peng Tan, Jun Liu

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

7 Citations (Scopus)

Abstract

Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.

Original language	English
Article number	109585
Journal	Pattern Recognition
Volume	141
DOIs	https://doi.org/10.1016/j.patcog.2023.109585
Publication status	Published - Sept 2023

Keywords

Crowd counting
Density estimation
Foreground enhancement
Multi-Pathway zooming

Access to Document

10.1016/j.patcog.2023.109585

Cite this

@article{b9576c70b4e0484cb3142d7668bd150e,

title = "Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement",

abstract = "Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.",

keywords = "Crowd counting, Density estimation, Foreground enhancement, Multi-Pathway zooming",

author = "Junjie Ma and Yaping Dai and Zhiyang Jia and Fuchun Sun and Tan, {Yap Peng} and Jun Liu",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2023",

month = sep,

doi = "10.1016/j.patcog.2023.109585",

language = "English",

volume = "141",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

AU - Ma, Junjie

AU - Dai, Yaping

AU - Jia, Zhiyang

AU - Sun, Fuchun

AU - Tan, Yap Peng

AU - Liu, Jun

PY - 2023/9

Y1 - 2023/9

N2 - Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.

AB - Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.

KW - Crowd counting

KW - Density estimation

KW - Foreground enhancement

KW - Multi-Pathway zooming

UR - http://www.scopus.com/inward/record.url?scp=85152433578&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2023.109585

DO - 10.1016/j.patcog.2023.109585

M3 - Article

AN - SCOPUS:85152433578

SN - 0031-3203

VL - 141

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 109585

ER -

Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this