Distribution based ensemble for class imbalance learning

Ghulam Mustafa, Zhendong Niu, Abdallah Yousif, John Tarus

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

MultiBoost ensemble has been well acknowledged as an effective learning algorithm which able to reduce both bias and variance in error and has high generalization performance. However, to deal with the class imbalanced learning, the Multi- Boost shall be amended. In this paper, a new hybrid machine learning method called Distribution based MultiBoost (DBMB) for class imbalanced problems is proposed, which combines Distribution based balanced sampling with the MultiBoost algo- rithm to achieve better minority class performance. It minimizes the within class and between class imbalance by learning and sampling different distributions (Gaussian and Poisson) and reduces bias and variance in error by employing the MultiBoost ensemble. Therefore, DBMB could output the final strong learner that is more proficient ensemble of weak base learners for imbalanced data sets. We prove that the G-mean, F1 measure and AUC of the DBMB is significantly superior to others. The experimental verification has shown that the proposed DBMB outperforms other state-of-the-art algorithms on many real world class imbalanced problems. Furthermore, our proposed method is scalable as compare to other boosting methods.

Original languageEnglish
Title of host publication5th International Conference on Innovative Computing Technology, INTECH 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5-10
Number of pages6
ISBN (Electronic)9781467375504
DOIs
Publication statusPublished - 30 Jul 2015
Event5th International Conference on Innovative Computing Technology, INTECH 2015 - Galicia, Spain
Duration: 20 May 201522 May 2015

Publication series

Name5th International Conference on Innovative Computing Technology, INTECH 2015

Conference

Conference5th International Conference on Innovative Computing Technology, INTECH 2015
Country/TerritorySpain
CityGalicia
Period20/05/1522/05/15

Keywords

  • Class imbalance learning
  • MultiBoost
  • distribution based resampling
  • ensemble learning

Fingerprint

Dive into the research topics of 'Distribution based ensemble for class imbalance learning'. Together they form a unique fingerprint.

Cite this