MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision

Zhimin Zhu; Jianguo Zhao; Tong Mu; Yuliang Yang; Mengyu Zhu

doi:10.1007/978-3-031-44210-0_36

MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision

Zhimin Zhu, Jianguo Zhao, Tong Mu, Yuliang Yang^*, Mengyu Zhu

^*此作品的通讯作者

医学技术学院

University of Science and Technology Beijing

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.

源语言	英语
主期刊名	Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings
编辑	Lazaros Iliadis, Antonios Papaleonidas, Plamen Angelov, Chrisina Jayne
出版商	Springer Science and Business Media Deutschland GmbH
页	446-456
页数	11
ISBN（印刷版）	9783031442094
DOI	https://doi.org/10.1007/978-3-031-44210-0_36
出版状态	已出版 - 2023
活动	32nd International Conference on Artificial Neural Networks, ICANN 2023 - Heraklion, 希腊期限: 26 9月 2023 → 29 9月 2023

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	14255 LNCS
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	32nd International Conference on Artificial Neural Networks, ICANN 2023
国家/地区	希腊
市	Heraklion
时期	26/09/23 → 29/09/23

访问文件

10.1007/978-3-031-44210-0_36

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhu, Z., Zhao, J., Mu, T., Yang, Y., & Zhu, M. (2023). MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. 在 L. Iliadis, A. Papaleonidas, P. Angelov, & C. Jayne (编辑), Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings (页码 446-456). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 14255 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-44210-0_36

Zhu, Zhimin ; Zhao, Jianguo ; Mu, Tong 等. / MC-MLP : A Multiple Coordinate Frames MLP-Like Architecture for Vision. Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. 编辑 / Lazaros Iliadis ; Antonios Papaleonidas ; Plamen Angelov ; Chrisina Jayne. Springer Science and Business Media Deutschland GmbH, 2023. 页码 446-456 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{6d886c5920b548e28973aad8cd8b3959,

title = "MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision",

abstract = "In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.",

keywords = "All-MLP Architecture, DCT, Hadamard Transform, Multiple Coordinate Frames, Orthogonal Transform",

author = "Zhimin Zhu and Jianguo Zhao and Tong Mu and Yuliang Yang and Mengyu Zhu",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.; 32nd International Conference on Artificial Neural Networks, ICANN 2023 ; Conference date: 26-09-2023 Through 29-09-2023",

year = "2023",

doi = "10.1007/978-3-031-44210-0_36",

language = "English",

isbn = "9783031442094",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "446--456",

editor = "Lazaros Iliadis and Antonios Papaleonidas and Plamen Angelov and Chrisina Jayne",

booktitle = "Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings",

address = "Germany",

}

Zhu, Z, Zhao, J, Mu, T, Yang, Y & Zhu, M 2023, MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. 在 L Iliadis, A Papaleonidas, P Angelov & C Jayne (编辑), Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 14255 LNCS, Springer Science and Business Media Deutschland GmbH, 页码 446-456, 32nd International Conference on Artificial Neural Networks, ICANN 2023, Heraklion, 希腊, 26/09/23. https://doi.org/10.1007/978-3-031-44210-0_36

MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. / Zhu, Zhimin; Zhao, Jianguo; Mu, Tong 等.
Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. 编辑 / Lazaros Iliadis; Antonios Papaleonidas; Plamen Angelov; Chrisina Jayne. Springer Science and Business Media Deutschland GmbH, 2023. 页码 446-456 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 14255 LNCS).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - MC-MLP

T2 - 32nd International Conference on Artificial Neural Networks, ICANN 2023

AU - Zhu, Zhimin

AU - Zhao, Jianguo

AU - Mu, Tong

AU - Yang, Yuliang

AU - Zhu, Mengyu

PY - 2023

Y1 - 2023

N2 - In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.

AB - In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.

KW - All-MLP Architecture

KW - DCT

KW - Hadamard Transform

KW - Multiple Coordinate Frames

KW - Orthogonal Transform

UR - http://www.scopus.com/inward/record.url?scp=85174605720&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-44210-0_36

DO - 10.1007/978-3-031-44210-0_36

M3 - Conference contribution

AN - SCOPUS:85174605720

SN - 9783031442094

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 446

EP - 456

BT - Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings

A2 - Iliadis, Lazaros

A2 - Papaleonidas, Antonios

A2 - Angelov, Plamen

A2 - Jayne, Chrisina

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 26 September 2023 through 29 September 2023

ER -

Zhu Z, Zhao J, Mu T, Yang Y, Zhu M. MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. 在 Iliadis L, Papaleonidas A, Angelov P, Jayne C, 编辑, Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. Springer Science and Business Media Deutschland GmbH. 2023. 页码 446-456. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-44210-0_36

MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此