Context-aware Transformer Model for Crowd Localization

Yiming Gong, Kan Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 1
  • Captures
    • Readers: 2
see details

摘要

Because crowd density varies greatly in real scenes, detection-based methods are less reliable in crowded areas. Existing methods of applying detection-based transformer models to complete crowd localization are also subject to the same constraints. Moreover, there are many small targets in the scene of dense crowds, which is even more obvious. To address this issue, our model employs context-aware module to extract information that fuses different scales, thereby addressing the potential rapid scale change, and uses transformer to build an end-to-end crowd localization model. Extensive experiments show that our model adaptively learns contextual information for crowd localization, significantly outperforming previous more advanced models.

源语言英语
主期刊名2022 3rd International Conference on Computer Vision, Image and Deep Learning and International Conference on Computer Engineering and Applications, CVIDL and ICCEA 2022
出版商Institute of Electrical and Electronics Engineers Inc.
199-202
页数4
ISBN(电子版)9781665459112
DOI
出版状态已出版 - 2022
活动3rd International Conference on Computer Vision, Image and Deep Learning and International Conference on Computer Engineering and Applications, CVIDL and ICCEA 2022 - Virtual, Changchun, 中国
期限: 20 5月 202222 5月 2022

出版系列

姓名2022 3rd International Conference on Computer Vision, Image and Deep Learning and International Conference on Computer Engineering and Applications, CVIDL and ICCEA 2022

会议

会议3rd International Conference on Computer Vision, Image and Deep Learning and International Conference on Computer Engineering and Applications, CVIDL and ICCEA 2022
国家/地区中国
Virtual, Changchun
时期20/05/2222/05/22

指纹

探究 'Context-aware Transformer Model for Crowd Localization' 的科研主题。它们共同构成独一无二的指纹。

引用此

Gong, Y., & Li, K. (2022). Context-aware Transformer Model for Crowd Localization. 在 2022 3rd International Conference on Computer Vision, Image and Deep Learning and International Conference on Computer Engineering and Applications, CVIDL and ICCEA 2022 (页码 199-202). (2022 3rd International Conference on Computer Vision, Image and Deep Learning and International Conference on Computer Engineering and Applications, CVIDL and ICCEA 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CVIDLICCEA56201.2022.9824361