Deep learning based data augmentation for large-scale mineral image recognition and classification

Yang Liu; Xueyi Wang; Zelin Zhang; Fang Deng

doi:10.1016/j.mineng.2023.108411

Deep learning based data augmentation for large-scale mineral image recognition and classification

Yang Liu, Xueyi Wang, Zelin Zhang, Fang Deng^*

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

Original language	English
Article number	108411
Journal	Minerals Engineering
Volume	204
DOIs	https://doi.org/10.1016/j.mineng.2023.108411
Publication status	Published - Dec 2023

Keywords

Data augmentation
Generative adversarial networks
Large-scale image classification
Ore sorting

Access to Document

10.1016/j.mineng.2023.108411

Cite this

@article{3fb6be7ae28f45c7821552884ab1fe22,

title = "Deep learning based data augmentation for large-scale mineral image recognition and classification",

abstract = "Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.",

keywords = "Data augmentation, Generative adversarial networks, Large-scale image classification, Ore sorting",

author = "Yang Liu and Xueyi Wang and Zelin Zhang and Fang Deng",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Ltd",

year = "2023",

month = dec,

doi = "10.1016/j.mineng.2023.108411",

language = "English",

volume = "204",

journal = "Minerals Engineering",

issn = "0892-6875",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Deep learning based data augmentation for large-scale mineral image recognition and classification

AU - Liu, Yang

AU - Wang, Xueyi

AU - Zhang, Zelin

AU - Deng, Fang

PY - 2023/12

Y1 - 2023/12

N2 - Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

AB - Vision-based mineral image recognition and classification is a proven solution for autonomous unmanned ore sorting. Although accurate identification can be achieved by training models offline using large-scale datasets, the lack of sufficient labeled images still limits the accessibility and exploration of high-performance deep learning models. To address the above issues, referring to the generative adversarial networks, three different deep learning-based mineral image data augmentation models are proposed in this work. The experimental results show that the proposed models can generate mineral images with high fidelity and have high similarity to the ground truth in terms of texture, color and shape. Compared with classic data augmentation methods, proposed ones can better optimize downstream sorting tasks: the accuracy of ResNet101, ResNet50, InceptionV3 and VGG19 is improved by 18.52%, 9.94%, 4.39% and 2.39%, respectively. Finally, this work also presents a monolithic three-stage system workflow for large-scale mineral image recognition and classification.

KW - Data augmentation

KW - Generative adversarial networks

KW - Large-scale image classification

KW - Ore sorting

UR - http://www.scopus.com/inward/record.url?scp=85173228765&partnerID=8YFLogxK

U2 - 10.1016/j.mineng.2023.108411

DO - 10.1016/j.mineng.2023.108411

M3 - Article

AN - SCOPUS:85173228765

SN - 0892-6875

VL - 204

JO - Minerals Engineering

JF - Minerals Engineering

M1 - 108411

ER -

Deep learning based data augmentation for large-scale mineral image recognition and classification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this