TY - JOUR
T1 - Developing mortality surveillance systems using Google trend
T2 - A pilot study
AU - Yeh, Fu Chun
AU - Yeh, Chien Hung
N1 - Publisher Copyright:
© 2019 Elsevier B.V.
PY - 2019/8/1
Y1 - 2019/8/1
N2 - In this paper, the mortality model for the developed country, which the United State possesses the largest economy in the world and thus serves as an ideal representation, is investigated. Early surveillance of the causes of death is critical which can allow the preparation of preventive steps against critical disease such as dengue fever. Studies reported that some search queries, especially those diseases related terms on Google Trends are essential. To this end, we include either main cause of death or the extended or the more general terminologies from Google Trends to decode the mortality related terms using the Wiener Cascade Model. Using time series and Wavelet scalogram of search terms, the patterns of search queries are categorized into different levels of periodicity. The results include (1)the decoding trend, (2)the features importance, and (3)the accuracy of the decoding patterns. Three scenarios regard predictors include the use of (1)all 19 features, (2)the top ten most periodic predictors, or (3)the ten predictors with the highest weighting. All search queries spans from December 2013–December 2018. The results show that search terms with both higher weight and annual periodic pattern contribute more in forecasting the word “die”; however, only predictors with higher weight are valuable to forecast the word “death”.
AB - In this paper, the mortality model for the developed country, which the United State possesses the largest economy in the world and thus serves as an ideal representation, is investigated. Early surveillance of the causes of death is critical which can allow the preparation of preventive steps against critical disease such as dengue fever. Studies reported that some search queries, especially those diseases related terms on Google Trends are essential. To this end, we include either main cause of death or the extended or the more general terminologies from Google Trends to decode the mortality related terms using the Wiener Cascade Model. Using time series and Wavelet scalogram of search terms, the patterns of search queries are categorized into different levels of periodicity. The results include (1)the decoding trend, (2)the features importance, and (3)the accuracy of the decoding patterns. Three scenarios regard predictors include the use of (1)all 19 features, (2)the top ten most periodic predictors, or (3)the ten predictors with the highest weighting. All search queries spans from December 2013–December 2018. The results show that search terms with both higher weight and annual periodic pattern contribute more in forecasting the word “die”; however, only predictors with higher weight are valuable to forecast the word “death”.
KW - Big data
KW - Cause of death
KW - Decode
KW - Google trends
KW - Mortality surveillance
KW - Wiener Cascade Model
UR - http://www.scopus.com/inward/record.url?scp=85065151513&partnerID=8YFLogxK
U2 - 10.1016/j.physa.2019.121125
DO - 10.1016/j.physa.2019.121125
M3 - Article
AN - SCOPUS:85065151513
SN - 0378-4371
VL - 527
JO - Physica A: Statistical Mechanics and its Applications
JF - Physica A: Statistical Mechanics and its Applications
M1 - 121125
ER -