Deep learning based data augmentation for large-scale mineral image recognition and classification

Yang Liu, Xueyi Wang, Zelin Zhang, Fang Deng*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

源语言英语
文章编号108411
期刊Minerals Engineering
204
DOI
出版状态已出版 - 12月 2023

指纹

探究 'Deep learning based data augmentation for large-scale mineral image recognition and classification' 的科研主题。它们共同构成独一无二的指纹。

引用此