Optimizing object classification with single-pixel imaging and transformer networks

Yin Cheng, Yusen Liao, Xin Sun, Jun Ke*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditional computer vision tasks rely on high-quality images for classification, but single-pixel imaging (SPI) uses compressive sensing with a bucket detector to simplify hardware and reduce costs. However, factors like atmospheric turbulence can degrade SPI image quality, lowering classification accuracy. To address this, a block-based single-pixel image-free transformer classification framework is proposed. It uses a learnable binary sampling matrix for flexible sampling during SPI and divides the measurements into blocks, which are processed by a transformer network. Each block is treated as a sequence element, allowing the transformer to capture spatial and contextual relationships. Experiments show strong robustness to atmospheric turbulence, achieving 87% classification accuracy even at a 1% sampling rate, making this method suitable for challenging environments.

Original languageEnglish
Title of host publicationFourth International Computational Imaging Conference, CITA 2024
EditorsXiaopeng Shao, Xiaopeng Shao
PublisherSPIE
ISBN (Electronic)9781510688834
DOIs
Publication statusPublished - 2025
Event4th International Computational Imaging Conference, CITA 2024 - Xiamen, China
Duration: 20 Sept 202422 Sept 2024

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13542
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference4th International Computational Imaging Conference, CITA 2024
Country/TerritoryChina
CityXiamen
Period20/09/2422/09/24

Keywords

  • Single-pixel imaging
  • ViT
  • atmospheric turbulence
  • classification

Fingerprint

Dive into the research topics of 'Optimizing object classification with single-pixel imaging and transformer networks'. Together they form a unique fingerprint.

Cite this