TY - GEN
T1 - A study of age gaps between online friends
AU - Liao, Lizi
AU - Jiang, Jing
AU - Lim, Ee Peng
AU - Huang, Heyan
PY - 2014
Y1 - 2014
N2 - User attribute extraction on social media has gain considerable attention, while existing methods are mostly supervised which suffer great difficulty in insufficient gold standard data. In this paper, we validate a strong hypothesis based on homophily and adapt it to ensure the certainty of user attribute we extracted via weakly supervised propagation. Homophily, the theory which states that people who are similar tend to become friends, has been well studied in the setting of online social networks. When we focus on age attribute, based on this theory, online friends tend to have similar age. In this work, we take a step further and study the hypothesis that the age gap between online friends become even smaller in a larger friendship clique. We empirically validate our hypothesis using two real social network data sets. We further design a propagation-based algorithm to predict online users' age, leveraging the clique-based hypothesis. We find that our algorithm can outperform several baselines. We believe that this method could work as a way to enrich sparse data and the hypothesis we validated would shed light on exploring the proximity of other user attributes such as education as well.
AB - User attribute extraction on social media has gain considerable attention, while existing methods are mostly supervised which suffer great difficulty in insufficient gold standard data. In this paper, we validate a strong hypothesis based on homophily and adapt it to ensure the certainty of user attribute we extracted via weakly supervised propagation. Homophily, the theory which states that people who are similar tend to become friends, has been well studied in the setting of online social networks. When we focus on age attribute, based on this theory, online friends tend to have similar age. In this work, we take a step further and study the hypothesis that the age gap between online friends become even smaller in a larger friendship clique. We empirically validate our hypothesis using two real social network data sets. We further design a propagation-based algorithm to predict online users' age, leveraging the clique-based hypothesis. We find that our algorithm can outperform several baselines. We believe that this method could work as a way to enrich sparse data and the hypothesis we validated would shed light on exploring the proximity of other user attributes such as education as well.
KW - age prediction
KW - homophily
KW - social network analysis
UR - http://www.scopus.com/inward/record.url?scp=84907417891&partnerID=8YFLogxK
U2 - 10.1145/2631775.2631800
DO - 10.1145/2631775.2631800
M3 - Conference contribution
AN - SCOPUS:84907417891
SN - 9781450329545
T3 - HT 2014 - Proceedings of the 25th ACM Conference on Hypertext and Social Media
SP - 98
EP - 106
BT - HT 2014 - Proceedings of the 25th ACM Conference on Hypertext and Social Media
PB - Association for Computing Machinery
T2 - 25th ACM Conference on Hypertext and Social Media, HT 2014
Y2 - 1 September 2014 through 4 September 2014
ER -