Exploiting the categorical reliability difference for binary classification

Lei Sun; Kar Ann Toh; Badong Chen; Zhiping Lin

doi:10.1016/j.jfranklin.2017.11.024

Exploiting the categorical reliability difference for binary classification

Lei Sun, Kar Ann Toh^*, Badong Chen, Zhiping Lin

^*Corresponding author for this work

School of Integrated Circuits and Electronics

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

In binary pattern classification, the reliabilities of statistics obtained from the samples of the two categories are generally different. When the statistics are used for modeling a classifier, such reliability difference could impact the generalization performance. We formulate a disparity index to show the statistical disparity based on the generalized eigenvalue decomposition of the categorical moment matrices. It is shown that this disparity index can effectively indicate the reliability difference between the two categories. The obtained reliability difference is subsequently utilized to adjust the regularization term of a classifier for effective learning generalization. Our experiments based on 10 real-world benchmark data sets validate the effectiveness of the proposed method.

Original language	English
Pages (from-to)	2022-2040
Number of pages	19
Journal	Journal of the Franklin Institute
Volume	355
Issue number	4
DOIs	https://doi.org/10.1016/j.jfranklin.2017.11.024
Publication status	Published - Mar 2018

Access to Document

10.1016/j.jfranklin.2017.11.024

Cite this

@article{87be471517754ebfbb973eed4a7c7fc9,

title = "Exploiting the categorical reliability difference for binary classification",

abstract = "In binary pattern classification, the reliabilities of statistics obtained from the samples of the two categories are generally different. When the statistics are used for modeling a classifier, such reliability difference could impact the generalization performance. We formulate a disparity index to show the statistical disparity based on the generalized eigenvalue decomposition of the categorical moment matrices. It is shown that this disparity index can effectively indicate the reliability difference between the two categories. The obtained reliability difference is subsequently utilized to adjust the regularization term of a classifier for effective learning generalization. Our experiments based on 10 real-world benchmark data sets validate the effectiveness of the proposed method.",

author = "Lei Sun and Toh, {Kar Ann} and Badong Chen and Zhiping Lin",

note = "Publisher Copyright: {\textcopyright} 2017 The Franklin Institute",

year = "2018",

month = mar,

doi = "10.1016/j.jfranklin.2017.11.024",

language = "English",

volume = "355",

pages = "2022--2040",

journal = "Journal of the Franklin Institute",

issn = "0016-0032",

publisher = "Elsevier Ltd.",

number = "4",

}

TY - JOUR

T1 - Exploiting the categorical reliability difference for binary classification

AU - Sun, Lei

AU - Toh, Kar Ann

AU - Chen, Badong

AU - Lin, Zhiping

PY - 2018/3

Y1 - 2018/3

N2 - In binary pattern classification, the reliabilities of statistics obtained from the samples of the two categories are generally different. When the statistics are used for modeling a classifier, such reliability difference could impact the generalization performance. We formulate a disparity index to show the statistical disparity based on the generalized eigenvalue decomposition of the categorical moment matrices. It is shown that this disparity index can effectively indicate the reliability difference between the two categories. The obtained reliability difference is subsequently utilized to adjust the regularization term of a classifier for effective learning generalization. Our experiments based on 10 real-world benchmark data sets validate the effectiveness of the proposed method.

AB - In binary pattern classification, the reliabilities of statistics obtained from the samples of the two categories are generally different. When the statistics are used for modeling a classifier, such reliability difference could impact the generalization performance. We formulate a disparity index to show the statistical disparity based on the generalized eigenvalue decomposition of the categorical moment matrices. It is shown that this disparity index can effectively indicate the reliability difference between the two categories. The obtained reliability difference is subsequently utilized to adjust the regularization term of a classifier for effective learning generalization. Our experiments based on 10 real-world benchmark data sets validate the effectiveness of the proposed method.

UR - http://www.scopus.com/inward/record.url?scp=85038849332&partnerID=8YFLogxK

U2 - 10.1016/j.jfranklin.2017.11.024

DO - 10.1016/j.jfranklin.2017.11.024

M3 - Article

AN - SCOPUS:85038849332

SN - 0016-0032

VL - 355

SP - 2022

EP - 2040

JO - Journal of the Franklin Institute

JF - Journal of the Franklin Institute

IS - 4

ER -

Exploiting the categorical reliability difference for binary classification

Abstract

Access to Document

Other files and links

Fingerprint

Cite this