Study on Unbalanced Binary Classification with Unknown Misclassification Costs

J. Gao, L. Gong, J. Y. Wang, Z. C. Mo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

With the rapid development of big data and machine learning technologies, many fields have begun to use related algorithms and methods. Classification algorithms have been widely used in the fields of financial risk identification, fault diagnosis, medical diagnosis, etc. However, the datasets are often unbalanced in these cases and the original methods fail to classify instances correctly. Many methods such as over-sampling, under-sampling and ensemble methods were raised to improve the classifier's performance, but which one to choose for a certain dataset still remains a problem. Therefore, this paper aims at a experimental conclusion on which kind of method can perform best on unbalanced classification problems generally. In detail, we evaluated the performances of 13 kinds of methods for unbalanced classification on several unbalanced datasets which have different amounts of instances and different ratios of positive instances, and finally came to a conclusion.

Original languageEnglish
Title of host publication2018 IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2018
PublisherIEEE Computer Society
Pages1538-1542
Number of pages5
ISBN (Electronic)9781538667866
DOIs
Publication statusPublished - 2 Jul 2018
Event2018 IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2018 - Bangkok, Thailand
Duration: 16 Dec 201819 Dec 2018

Publication series

NameIEEE International Conference on Industrial Engineering and Engineering Management
Volume2019-December
ISSN (Print)2157-3611
ISSN (Electronic)2157-362X

Conference

Conference2018 IEEE International Conference on Industrial Engineering and Engineering Management, IEEM 2018
Country/TerritoryThailand
CityBangkok
Period16/12/1819/12/18

Keywords

  • Binary Classification
  • unbalanced Data

Fingerprint

Dive into the research topics of 'Study on Unbalanced Binary Classification with Unknown Misclassification Costs'. Together they form a unique fingerprint.

Cite this