TY - JOUR
T1 - An online Bayesian approach to change-point detection for categorical data
AU - Fan, Yiwei
AU - Lu, Xiaoling
N1 - Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2020/5/21
Y1 - 2020/5/21
N2 - Change-point detection for categorical data has wide applications in many fields. Existing methods either are distribution-free, not utilizing categorical information sufficiently, or have limited performance when there exists “rare events” (events that occur with low frequency). In this paper, we propose a Bayesian change-point detection model for categorical data based on Dirichlet-multinomial mixtures. Because of the introduction of prior information, our method performs well for the existence of “rare events”. An online parameter estimation procedure and an online detection strategy are then designed to adapt to data streams. Monte Carlo simulations discuss the power of the proposed method and show advantages compared with existing algorithms. Applications in biomedical research, document analysis, health news case study and location monitoring indicate practical values of our method.
AB - Change-point detection for categorical data has wide applications in many fields. Existing methods either are distribution-free, not utilizing categorical information sufficiently, or have limited performance when there exists “rare events” (events that occur with low frequency). In this paper, we propose a Bayesian change-point detection model for categorical data based on Dirichlet-multinomial mixtures. Because of the introduction of prior information, our method performs well for the existence of “rare events”. An online parameter estimation procedure and an online detection strategy are then designed to adapt to data streams. Monte Carlo simulations discuss the power of the proposed method and show advantages compared with existing algorithms. Applications in biomedical research, document analysis, health news case study and location monitoring indicate practical values of our method.
KW - Bayes factor
KW - Change-point detection
KW - Dirichlet-multinomial mixtures
KW - Online strategy
UR - http://www.scopus.com/inward/record.url?scp=85082818425&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2020.105792
DO - 10.1016/j.knosys.2020.105792
M3 - Article
AN - SCOPUS:85082818425
SN - 0950-7051
VL - 196
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 105792
ER -