Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

Junjie Ma; Yaping Dai; Zhiyang Jia; Fuchun Sun; Yap Peng Tan; Jun Liu

doi:10.1016/j.patcog.2023.109585

Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

Junjie Ma, Yaping Dai, Zhiyang Jia, Fuchun Sun^*, Yap Peng Tan, Jun Liu

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

13 引用（Scopus）

摘要

Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.

源语言	英语
文章编号	109585
期刊	Pattern Recognition
卷	141
DOI	https://doi.org/10.1016/j.patcog.2023.109585
出版状态	已出版 - 9月 2023

访问文件

10.1016/j.patcog.2023.109585

其它文件与链接

链接到 Scopus 的出版物

引用此

Ma, J., Dai, Y., Jia, Z., Sun, F., Tan, Y. P., & Liu, J. (2023). Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement. Pattern Recognition, 141, 文章 109585. https://doi.org/10.1016/j.patcog.2023.109585

@article{b9576c70b4e0484cb3142d7668bd150e,

title = "Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement",

abstract = "Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.",

keywords = "Crowd counting, Density estimation, Foreground enhancement, Multi-Pathway zooming",

author = "Junjie Ma and Yaping Dai and Zhiyang Jia and Fuchun Sun and Tan, {Yap Peng} and Jun Liu",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2023",

month = sep,

doi = "10.1016/j.patcog.2023.109585",

language = "English",

volume = "141",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

AU - Ma, Junjie

AU - Dai, Yaping

AU - Jia, Zhiyang

AU - Sun, Fuchun

AU - Tan, Yap Peng

AU - Liu, Jun

PY - 2023/9

Y1 - 2023/9

N2 - Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.

AB - Crowd counting is a challenging task due to many challenges such as scale variations and noisy background. To handle these challenges, we propose a novel framework named Multi-Pathway Zooming Network (MZNet) in this paper. The proposed framework recursively optimizes multi-scale features using multiple zooming pathways and progressively enhances the foreground information to improve crowd counting performance. Each zooming pathway comprises two zooming directions, zooming in and zooming out. Convolutional features at different resolutions are propagated to optimize the context information at each specific level. By sequentially integrating and interacting multi-observation information, the optimized features are powerful in handling the scale variation issue, and thus the crowd counting performance can be enhanced. To address the noisy background in many scenarios, we also introduce a new scheme to enhance the foreground information by incorporating a masked input image into the network, which is formed by a mask that element-wise multiplies with the original image. Finally, the context information, incorporated with an output density map, is recursively finetuned in our network to boost the counting performance. Extensive experiments evaluated on challenging benchmark datasets show competitive performances for both crowded and sparse scenarios.

KW - Crowd counting

KW - Density estimation

KW - Foreground enhancement

KW - Multi-Pathway zooming

UR - http://www.scopus.com/inward/record.url?scp=85152433578&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2023.109585

DO - 10.1016/j.patcog.2023.109585

M3 - Article

AN - SCOPUS:85152433578

SN - 0031-3203

VL - 141

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 109585

ER -

Crowd counting from single images using recursive multi-pathway zooming and foreground enhancement

摘要

访问文件

其它文件与链接

指纹

引用此