跳到主要导航 跳到搜索 跳到主要内容

Measurements-to-Tokens: A pure Transformer for image-free recognition at arbitrary sampling ratios

  • Jia Shuai Mi
  • , Hu Jiang
  • , Yu Xiao Wei
  • , Wen Bin Geng
  • , Wen Biao Xu
  • , Hui Juan Zhang
  • , Yuan Jin Yu*
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Beijing Institute of Petrochemical Technology

科研成果: 期刊稿件文章同行评审

摘要

Derived from single-pixel imaging, image-free sensing enables efficient semantic interpretation directly from compressed measurements. However, existing deep learning methods are often constrained by fixed sampling ratios, and their architectural designs typically overlook the intrinsic physical modulation mechanisms of the sensing process. In this work, we introduce Measurements-to-Tokens (M2T), a unified framework leveraging a pure Transformer architecture for image-free recognition at arbitrary sampling ratios. Specifically, M2T adapts to varying sampling ratios by employing arbitrarily cropped long-range observation sequences. By treating the sensing process as a sequence modeling task, we explicitly integrate the physical correlation between intensity measurements and their corresponding modulation patterns into semantic tokens. This design enables the network to naturally process inputs of variable lengths, effectively decoupling the model architecture from specific sampling ratios. Extensive analysis demonstrates that M2T achieves state-of-the-art recognition accuracy and adapts to arbitrary sampling ratios. At a 1% sampling ratio, it reaches 96.51% average accuracy on MNIST and outperforms competing methods by 4.15 percentage points on average across two datasets, while remaining robust to noise.

源语言英语
文章编号115558
期刊Optics and Laser Technology
203
DOI
出版状态已出版 - 11月 2026
已对外发布

指纹

探究 'Measurements-to-Tokens: A pure Transformer for image-free recognition at arbitrary sampling ratios' 的科研主题。它们共同构成独一无二的指纹。

引用此