Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks

Enze Zhang; Boheng Zhang; Shaohan Hu; Fa Zhang; Xiaohua Wan

doi:10.1007/978-3-030-26763-6_43

Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks

Enze Zhang, Boheng Zhang, Shaohan Hu, Fa Zhang^*, Xiaohua Wan

^*此作品的通讯作者

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Proteins contribute significantly in most body functions within cells, and are essential to the physiological activities of every creature. Microscopy imaging, as a remarkable technique, is applied to observe and identify proteins in different kinds of cells, by which the analysis results are critical to the biomedical studies. However, as the development of high-throughput microscopy imaging, images of protein microscopy are generated in a faster pace ever, making it harder for experts to manually identify them. For better digging and understanding the information of the proteins in those huge amounts of images, it is urgent for methods to identify the mixed-patterned proteins within various cells automatically and accurately. Here in this paper, we design some novel and effective data preparation and preprocessing methods for high-throughput microscopy protein datasets. We propose ACP layer and “buffering” layers, using them to design customized architectures for some typical CNN classifiers with new inputs and head parts. The modifications let the models be more adaptive and accurate to our task. We train the models in more effective and efficient optimization strategies that we design, e.g., cycle learning with learning rate scheduling. Besides, greedy selection of thresholds and multi-sized models ensembling in the post-process stage are proposed to further improve the prediction accuracy. Our experimental results based on Human Protein Atlas datasets demonstrates that the proposed methods show an excellent performance in mixed-patterned protein classifications to date, even beyond the state-of-the-art architecture GapNet-PL by 0.02 to 0.03 in F1 score. The whole work reveals the usefulness of our methods for high-throughput microscopy protein images identification.

源语言	英语
主期刊名	Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings
编辑	De-Shuang Huang, Vitoantonio Bevilacqua, Prashan Premaratne
出版商	Springer Verlag
页	448-459
页数	12
ISBN（印刷版）	9783030267629
DOI	https://doi.org/10.1007/978-3-030-26763-6_43
出版状态	已出版 - 2019
已对外发布	是
活动	15th International Conference on Intelligent Computing, ICIC 2019 - Nanchang, 中国期限: 3 8月 2019 → 6 8月 2019

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	11643 LNCS
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	15th International Conference on Intelligent Computing, ICIC 2019
国家/地区	中国
市	Nanchang
时期	3/08/19 → 6/08/19

访问文件

10.1007/978-3-030-26763-6_43

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, E., Zhang, B., Hu, S., Zhang, F., & Wan, X. (2019). Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks. 在 D.-S. Huang, V. Bevilacqua, & P. Premaratne (编辑), Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings (页码 448-459). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 11643 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-26763-6_43

Zhang, Enze ; Zhang, Boheng ; Hu, Shaohan 等. / Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks. Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings. 编辑 / De-Shuang Huang ; Vitoantonio Bevilacqua ; Prashan Premaratne. Springer Verlag, 2019. 页码 448-459 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{8eae1ec1535642038ae42f3ec36d1838,

title = "Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks",

abstract = "Proteins contribute significantly in most body functions within cells, and are essential to the physiological activities of every creature. Microscopy imaging, as a remarkable technique, is applied to observe and identify proteins in different kinds of cells, by which the analysis results are critical to the biomedical studies. However, as the development of high-throughput microscopy imaging, images of protein microscopy are generated in a faster pace ever, making it harder for experts to manually identify them. For better digging and understanding the information of the proteins in those huge amounts of images, it is urgent for methods to identify the mixed-patterned proteins within various cells automatically and accurately. Here in this paper, we design some novel and effective data preparation and preprocessing methods for high-throughput microscopy protein datasets. We propose ACP layer and “buffering” layers, using them to design customized architectures for some typical CNN classifiers with new inputs and head parts. The modifications let the models be more adaptive and accurate to our task. We train the models in more effective and efficient optimization strategies that we design, e.g., cycle learning with learning rate scheduling. Besides, greedy selection of thresholds and multi-sized models ensembling in the post-process stage are proposed to further improve the prediction accuracy. Our experimental results based on Human Protein Atlas datasets demonstrates that the proposed methods show an excellent performance in mixed-patterned protein classifications to date, even beyond the state-of-the-art architecture GapNet-PL by 0.02 to 0.03 in F1 score. The whole work reveals the usefulness of our methods for high-throughput microscopy protein images identification.",

keywords = "Deep learning, High-throughput microscopy images, Mixed patterns of proteins, Protein classification",

author = "Enze Zhang and Boheng Zhang and Shaohan Hu and Fa Zhang and Xiaohua Wan",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Nature Switzerland AG.; 15th International Conference on Intelligent Computing, ICIC 2019 ; Conference date: 03-08-2019 Through 06-08-2019",

year = "2019",

doi = "10.1007/978-3-030-26763-6_43",

language = "English",

isbn = "9783030267629",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "448--459",

editor = "De-Shuang Huang and Vitoantonio Bevilacqua and Prashan Premaratne",

booktitle = "Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings",

address = "Germany",

}

Zhang, E, Zhang, B, Hu, S, Zhang, F & Wan, X 2019, Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks. 在 D-S Huang, V Bevilacqua & P Premaratne (编辑), Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 11643 LNCS, Springer Verlag, 页码 448-459, 15th International Conference on Intelligent Computing, ICIC 2019, Nanchang, 中国, 3/08/19. https://doi.org/10.1007/978-3-030-26763-6_43

Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks. / Zhang, Enze; Zhang, Boheng; Hu, Shaohan 等.
Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings. 编辑 / De-Shuang Huang; Vitoantonio Bevilacqua; Prashan Premaratne. Springer Verlag, 2019. 页码 448-459 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 11643 LNCS).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks

AU - Zhang, Enze

AU - Zhang, Boheng

AU - Hu, Shaohan

AU - Zhang, Fa

AU - Wan, Xiaohua

PY - 2019

Y1 - 2019

N2 - Proteins contribute significantly in most body functions within cells, and are essential to the physiological activities of every creature. Microscopy imaging, as a remarkable technique, is applied to observe and identify proteins in different kinds of cells, by which the analysis results are critical to the biomedical studies. However, as the development of high-throughput microscopy imaging, images of protein microscopy are generated in a faster pace ever, making it harder for experts to manually identify them. For better digging and understanding the information of the proteins in those huge amounts of images, it is urgent for methods to identify the mixed-patterned proteins within various cells automatically and accurately. Here in this paper, we design some novel and effective data preparation and preprocessing methods for high-throughput microscopy protein datasets. We propose ACP layer and “buffering” layers, using them to design customized architectures for some typical CNN classifiers with new inputs and head parts. The modifications let the models be more adaptive and accurate to our task. We train the models in more effective and efficient optimization strategies that we design, e.g., cycle learning with learning rate scheduling. Besides, greedy selection of thresholds and multi-sized models ensembling in the post-process stage are proposed to further improve the prediction accuracy. Our experimental results based on Human Protein Atlas datasets demonstrates that the proposed methods show an excellent performance in mixed-patterned protein classifications to date, even beyond the state-of-the-art architecture GapNet-PL by 0.02 to 0.03 in F1 score. The whole work reveals the usefulness of our methods for high-throughput microscopy protein images identification.

AB - Proteins contribute significantly in most body functions within cells, and are essential to the physiological activities of every creature. Microscopy imaging, as a remarkable technique, is applied to observe and identify proteins in different kinds of cells, by which the analysis results are critical to the biomedical studies. However, as the development of high-throughput microscopy imaging, images of protein microscopy are generated in a faster pace ever, making it harder for experts to manually identify them. For better digging and understanding the information of the proteins in those huge amounts of images, it is urgent for methods to identify the mixed-patterned proteins within various cells automatically and accurately. Here in this paper, we design some novel and effective data preparation and preprocessing methods for high-throughput microscopy protein datasets. We propose ACP layer and “buffering” layers, using them to design customized architectures for some typical CNN classifiers with new inputs and head parts. The modifications let the models be more adaptive and accurate to our task. We train the models in more effective and efficient optimization strategies that we design, e.g., cycle learning with learning rate scheduling. Besides, greedy selection of thresholds and multi-sized models ensembling in the post-process stage are proposed to further improve the prediction accuracy. Our experimental results based on Human Protein Atlas datasets demonstrates that the proposed methods show an excellent performance in mixed-patterned protein classifications to date, even beyond the state-of-the-art architecture GapNet-PL by 0.02 to 0.03 in F1 score. The whole work reveals the usefulness of our methods for high-throughput microscopy protein images identification.

KW - Deep learning

KW - High-throughput microscopy images

KW - Mixed patterns of proteins

KW - Protein classification

UR - http://www.scopus.com/inward/record.url?scp=85070703026&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-26763-6_43

DO - 10.1007/978-3-030-26763-6_43

M3 - Conference contribution

AN - SCOPUS:85070703026

SN - 9783030267629

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 448

EP - 459

BT - Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings

A2 - Huang, De-Shuang

A2 - Bevilacqua, Vitoantonio

A2 - Premaratne, Prashan

PB - Springer Verlag

T2 - 15th International Conference on Intelligent Computing, ICIC 2019

Y2 - 3 August 2019 through 6 August 2019

ER -

Zhang E, Zhang B, Hu S, Zhang F, Wan X. Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks. 在 Huang DS, Bevilacqua V, Premaratne P, 编辑, Intelligent Computing Theories and Application - 15th International Conference, ICIC 2019, Proceedings. Springer Verlag. 2019. 页码 448-459. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-26763-6_43

Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此