Rethinking Few-Shot Medical Segmentation: A Vector Quantization View

Shiqi Huang; Tingfa Xu; Ning Shen; Feng Mu; Jianan Li

doi:10.1109/CVPR52729.2023.00300

Rethinking Few-Shot Medical Segmentation: A Vector Quantization View

Shiqi Huang, Tingfa Xu^*, Ning Shen, Feng Mu, Jianan Li

^*此作品的通讯作者

光电学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

12 引用（Scopus）

摘要

The existing few-shot medical segmentation networks share the same practice that the more prototypes, the better performance. This phenomenon can be theoretically interpreted in Vector Quantization (VQ) view: the more prototypes, the more clusters are separated from pixel-wise feature points distributed over the full space. However, as we further think about few-shot segmentation with this perspective, it is found that the clusterization of feature points and the adaptation to unseen tasks have not received enough attention. Motivated by the observation, we propose a learning VQ mechanism consisting of grid-format VQ (GFVQ), self-organized VQ (SOVQ) and residual oriented VQ (ROVQ). To be specific, GFVQ generates the prototype matrix by averaging square grids over the spatial extent, which uniformly quantizes the local details; SOVQ adaptively assigns the feature points to different local classes and creates a new representation space where the learnable local prototypes are updated with a global view; ROVQ introduces residual information to fine-tune the aforementioned learned local prototypes without retraining, which benefits the generalization performance for the irrelevance to the training task. We empirically show that our VQ framework yields the state-of-the-art performance over abdomen, cardiac and prostate MRI datasets and expect this work will provoke a rethink of the current few-shot medical segmentation model design. Our code will soon be publicly available.

源语言	英语
主期刊名	Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
出版商	IEEE Computer Society
页	3072-3081
页数	10
ISBN（电子版）	9798350301298
DOI	https://doi.org/10.1109/CVPR52729.2023.00300
出版状态	已出版 - 2023
活动	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Vancouver, 加拿大期限: 18 6月 2023 → 22 6月 2023

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2023-June
ISSN（印刷版）	1063-6919

会议

会议	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
国家/地区	加拿大
市	Vancouver
时期	18/06/23 → 22/06/23

访问文件

10.1109/CVPR52729.2023.00300

其它文件与链接

链接到 Scopus 的出版物

引用此

Huang, S., Xu, T., Shen, N., Mu, F., & Li, J. (2023). Rethinking Few-Shot Medical Segmentation: A Vector Quantization View. 在 Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 (页码 3072-3081). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2023-June). IEEE Computer Society. https://doi.org/10.1109/CVPR52729.2023.00300

@inproceedings{e35a645ac8c2427ba5f6cf005aef5033,

title = "Rethinking Few-Shot Medical Segmentation: A Vector Quantization View",

abstract = "The existing few-shot medical segmentation networks share the same practice that the more prototypes, the better performance. This phenomenon can be theoretically interpreted in Vector Quantization (VQ) view: the more prototypes, the more clusters are separated from pixel-wise feature points distributed over the full space. However, as we further think about few-shot segmentation with this perspective, it is found that the clusterization of feature points and the adaptation to unseen tasks have not received enough attention. Motivated by the observation, we propose a learning VQ mechanism consisting of grid-format VQ (GFVQ), self-organized VQ (SOVQ) and residual oriented VQ (ROVQ). To be specific, GFVQ generates the prototype matrix by averaging square grids over the spatial extent, which uniformly quantizes the local details; SOVQ adaptively assigns the feature points to different local classes and creates a new representation space where the learnable local prototypes are updated with a global view; ROVQ introduces residual information to fine-tune the aforementioned learned local prototypes without retraining, which benefits the generalization performance for the irrelevance to the training task. We empirically show that our VQ framework yields the state-of-the-art performance over abdomen, cardiac and prostate MRI datasets and expect this work will provoke a rethink of the current few-shot medical segmentation model design. Our code will soon be publicly available.",

keywords = "Medical and biological vision, cell microscopy",

author = "Shiqi Huang and Tingfa Xu and Ning Shen and Feng Mu and Jianan Li",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 ; Conference date: 18-06-2023 Through 22-06-2023",

year = "2023",

doi = "10.1109/CVPR52729.2023.00300",

language = "English",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "3072--3081",

booktitle = "Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023",

address = "United States",

}

Huang, S, Xu, T, Shen, N, Mu, F & Li, J 2023, Rethinking Few-Shot Medical Segmentation: A Vector Quantization View. 在 Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2023-June, IEEE Computer Society, 页码 3072-3081, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, 加拿大, 18/06/23. https://doi.org/10.1109/CVPR52729.2023.00300

Rethinking Few-Shot Medical Segmentation: A Vector Quantization View. / Huang, Shiqi; Xu, Tingfa; Shen, Ning 等.
Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023. IEEE Computer Society, 2023. 页码 3072-3081 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2023-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Rethinking Few-Shot Medical Segmentation

T2 - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023

AU - Huang, Shiqi

AU - Xu, Tingfa

AU - Shen, Ning

AU - Mu, Feng

AU - Li, Jianan

PY - 2023

Y1 - 2023

N2 - The existing few-shot medical segmentation networks share the same practice that the more prototypes, the better performance. This phenomenon can be theoretically interpreted in Vector Quantization (VQ) view: the more prototypes, the more clusters are separated from pixel-wise feature points distributed over the full space. However, as we further think about few-shot segmentation with this perspective, it is found that the clusterization of feature points and the adaptation to unseen tasks have not received enough attention. Motivated by the observation, we propose a learning VQ mechanism consisting of grid-format VQ (GFVQ), self-organized VQ (SOVQ) and residual oriented VQ (ROVQ). To be specific, GFVQ generates the prototype matrix by averaging square grids over the spatial extent, which uniformly quantizes the local details; SOVQ adaptively assigns the feature points to different local classes and creates a new representation space where the learnable local prototypes are updated with a global view; ROVQ introduces residual information to fine-tune the aforementioned learned local prototypes without retraining, which benefits the generalization performance for the irrelevance to the training task. We empirically show that our VQ framework yields the state-of-the-art performance over abdomen, cardiac and prostate MRI datasets and expect this work will provoke a rethink of the current few-shot medical segmentation model design. Our code will soon be publicly available.

AB - The existing few-shot medical segmentation networks share the same practice that the more prototypes, the better performance. This phenomenon can be theoretically interpreted in Vector Quantization (VQ) view: the more prototypes, the more clusters are separated from pixel-wise feature points distributed over the full space. However, as we further think about few-shot segmentation with this perspective, it is found that the clusterization of feature points and the adaptation to unseen tasks have not received enough attention. Motivated by the observation, we propose a learning VQ mechanism consisting of grid-format VQ (GFVQ), self-organized VQ (SOVQ) and residual oriented VQ (ROVQ). To be specific, GFVQ generates the prototype matrix by averaging square grids over the spatial extent, which uniformly quantizes the local details; SOVQ adaptively assigns the feature points to different local classes and creates a new representation space where the learnable local prototypes are updated with a global view; ROVQ introduces residual information to fine-tune the aforementioned learned local prototypes without retraining, which benefits the generalization performance for the irrelevance to the training task. We empirically show that our VQ framework yields the state-of-the-art performance over abdomen, cardiac and prostate MRI datasets and expect this work will provoke a rethink of the current few-shot medical segmentation model design. Our code will soon be publicly available.

KW - Medical and biological vision

KW - cell microscopy

UR - http://www.scopus.com/inward/record.url?scp=85173956228&partnerID=8YFLogxK

U2 - 10.1109/CVPR52729.2023.00300

DO - 10.1109/CVPR52729.2023.00300

M3 - Conference contribution

AN - SCOPUS:85173956228

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 3072

EP - 3081

BT - Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023

PB - IEEE Computer Society

Y2 - 18 June 2023 through 22 June 2023

ER -

Huang S, Xu T, Shen N, Mu F, Li J. Rethinking Few-Shot Medical Segmentation: A Vector Quantization View. 在 Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023. IEEE Computer Society. 2023. 页码 3072-3081. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52729.2023.00300

Rethinking Few-Shot Medical Segmentation: A Vector Quantization View

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此