Deep learning based data augmentation for large-scale mineral image recognition and classification

Yang Liu; Xueyi Wang; Zelin Zhang; Fang Deng

doi:10.1016/j.mineng.2023.108411

Deep learning based data augmentation for large-scale mineral image recognition and classification

Yang Liu, Xueyi Wang, Zelin Zhang, Fang Deng^*

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

源语言	英语
文章编号	108411
期刊	Minerals Engineering
卷	204
DOI	https://doi.org/10.1016/j.mineng.2023.108411
出版状态	已出版 - 12月 2023

访问文件

10.1016/j.mineng.2023.108411

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3fb6be7ae28f45c7821552884ab1fe22,

title = "Deep learning based data augmentation for large-scale mineral image recognition and classification",

abstract = "Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.",

keywords = "Data augmentation, Generative adversarial networks, Large-scale image classification, Ore sorting",

author = "Yang Liu and Xueyi Wang and Zelin Zhang and Fang Deng",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2023",

month = dec,

doi = "10.1016/j.mineng.2023.108411",

language = "English",

volume = "204",

journal = "Minerals Engineering",

issn = "0892-6875",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Deep learning based data augmentation for large-scale mineral image recognition and classification

AU - Liu, Yang

AU - Wang, Xueyi

AU - Zhang, Zelin

AU - Deng, Fang

PY - 2023/12

Y1 - 2023/12

N2 - Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

AB - Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

KW - Data augmentation

KW - Generative adversarial networks

KW - Large-scale image classification

KW - Ore sorting

UR - http://www.scopus.com/inward/record.url?scp=85173228765&partnerID=8YFLogxK

U2 - 10.1016/j.mineng.2023.108411

DO - 10.1016/j.mineng.2023.108411

M3 - Article

AN - SCOPUS:85173228765

SN - 0892-6875

VL - 204

JO - Minerals Engineering

JF - Minerals Engineering

M1 - 108411

ER -

Deep learning based data augmentation for large-scale mineral image recognition and classification

摘要

访问文件

其它文件与链接

指纹

引用此