MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision

Zhimin Zhu; Jianguo Zhao; Tong Mu; Yuliang Yang; Mengyu Zhu

doi:10.1007/978-3-031-44210-0_36

MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision

Zhimin Zhu, Jianguo Zhao, Tong Mu, Yuliang Yang^*, Mengyu Zhu

^*Corresponding author for this work

School of Medical and Technology

University of Science and Technology Beijing

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.

Original language	English
Title of host publication	Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings
Editors	Lazaros Iliadis, Antonios Papaleonidas, Plamen Angelov, Chrisina Jayne
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	446-456
Number of pages	11
ISBN (Print)	9783031442094
DOIs	https://doi.org/10.1007/978-3-031-44210-0_36
Publication status	Published - 2023
Event	32nd International Conference on Artificial Neural Networks, ICANN 2023 - Heraklion, Greece Duration: 26 Sept 2023 → 29 Sept 2023

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	14255 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	32nd International Conference on Artificial Neural Networks, ICANN 2023
Country/Territory	Greece
City	Heraklion
Period	26/09/23 → 29/09/23

Keywords

All-MLP Architecture
DCT
Hadamard Transform
Multiple Coordinate Frames
Orthogonal Transform

Access to Document

10.1007/978-3-031-44210-0_36

Cite this

Zhu, Z., Zhao, J., Mu, T., Yang, Y., & Zhu, M. (2023). MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. In L. Iliadis, A. Papaleonidas, P. Angelov, & C. Jayne (Eds.), Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings (pp. 446-456). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14255 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-44210-0_36

Zhu, Zhimin ; Zhao, Jianguo ; Mu, Tong et al. / MC-MLP : A Multiple Coordinate Frames MLP-Like Architecture for Vision. Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. editor / Lazaros Iliadis ; Antonios Papaleonidas ; Plamen Angelov ; Chrisina Jayne. Springer Science and Business Media Deutschland GmbH, 2023. pp. 446-456 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{6d886c5920b548e28973aad8cd8b3959,

title = "MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision",

abstract = "In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.",

keywords = "All-MLP Architecture, DCT, Hadamard Transform, Multiple Coordinate Frames, Orthogonal Transform",

author = "Zhimin Zhu and Jianguo Zhao and Tong Mu and Yuliang Yang and Mengyu Zhu",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.; 32nd International Conference on Artificial Neural Networks, ICANN 2023 ; Conference date: 26-09-2023 Through 29-09-2023",

year = "2023",

doi = "10.1007/978-3-031-44210-0_36",

language = "English",

isbn = "9783031442094",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "446--456",

editor = "Lazaros Iliadis and Antonios Papaleonidas and Plamen Angelov and Chrisina Jayne",

booktitle = "Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings",

address = "Germany",

}

Zhu, Z, Zhao, J, Mu, T, Yang, Y & Zhu, M 2023, MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. in L Iliadis, A Papaleonidas, P Angelov & C Jayne (eds), Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 14255 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 446-456, 32nd International Conference on Artificial Neural Networks, ICANN 2023, Heraklion, Greece, 26/09/23. https://doi.org/10.1007/978-3-031-44210-0_36

MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. / Zhu, Zhimin; Zhao, Jianguo; Mu, Tong et al.
Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. ed. / Lazaros Iliadis; Antonios Papaleonidas; Plamen Angelov; Chrisina Jayne. Springer Science and Business Media Deutschland GmbH, 2023. p. 446-456 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14255 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - MC-MLP

T2 - 32nd International Conference on Artificial Neural Networks, ICANN 2023

AU - Zhu, Zhimin

AU - Zhao, Jianguo

AU - Mu, Tong

AU - Yang, Yuliang

AU - Zhu, Mengyu

PY - 2023

Y1 - 2023

N2 - In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.

AB - In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers. In MC-MLP, we propose that the same semantic information has varying levels of difficulty in learning, depending on the coordinate frame of features. To address this, we perform an orthogonal transform on the feature information, equivalent to changing the coordinate frame of features. Through this design, MC-MLP is equipped with multi-coordinate frame receptive fields and the ability to learn information across different coordinate frames. Experiments demonstrate that MC-MLP outperforms most MLPs in image classification tasks, achieving better performance at the same parameter level. The code will be available at: https://github.com/ZZM11/MC-MLP.

KW - All-MLP Architecture

KW - DCT

KW - Hadamard Transform

KW - Multiple Coordinate Frames

KW - Orthogonal Transform

UR - http://www.scopus.com/inward/record.url?scp=85174605720&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-44210-0_36

DO - 10.1007/978-3-031-44210-0_36

M3 - Conference contribution

AN - SCOPUS:85174605720

SN - 9783031442094

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 446

EP - 456

BT - Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings

A2 - Iliadis, Lazaros

A2 - Papaleonidas, Antonios

A2 - Angelov, Plamen

A2 - Jayne, Chrisina

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 26 September 2023 through 29 September 2023

ER -

Zhu Z, Zhao J, Mu T, Yang Y, Zhu M. MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision. In Iliadis L, Papaleonidas A, Angelov P, Jayne C, editors, Artificial Neural Networks and Machine Learning – ICANN 2023 - 32nd International Conference on Artificial Neural Networks, Proceedings. Springer Science and Business Media Deutschland GmbH. 2023. p. 446-456. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-44210-0_36

MC-MLP: A Multiple Coordinate Frames MLP-Like Architecture for Vision

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this