TY - GEN
T1 - Crowd saliency prediction with optimal feature combinations
AU - Ma, Kun
AU - Gao, Guangyu
AU - Ding, Gangyi
AU - Liu, Chi Harold
AU - Liu, Erwu
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/11/21
Y1 - 2016/11/21
N2 - Crowd saliency prediction refers to predicting where people look at in crowd scene. Humans have remarkable ability to rapidly direct their gaze to select visual information of interest when looking at a visual scene. Until now, research efforts are still focused on that which type of feature is representative for crowd saliency, and which type of learning model is the robust one for crowd saliency prediction. In this paper, we propose a Random Forest (RF) based crowd saliency prediction approach with optimal feature combination, i.e., the Feature Combination Selection for Crowd Saliency (FCSCS) framework. More specifically, we first define two representative crowd saliency features: FaceSizeDiff and FacePoseDiff. Next, we adopt the Random Forest (RF) algorithm to construct our saliency learning model. Then, we evaluate the performance of crowd saliency prediction classifiers with different feature combinations (fifteen combinations in our experiments). Those selected features include low-level features (i.e., color, intensity, orientation), four existing crowd features (i.e., face size, face density, frontal face, profile face) and two new defined features (i.e., FaceSizeDiff and FacePoseDiff). Finally, we obtain the optimal feature combination that is most suitable for crowd saliency prediction. We conduct extensive experiments and empirical evaluation to demonstrate the satisfactory performance of our approach.
AB - Crowd saliency prediction refers to predicting where people look at in crowd scene. Humans have remarkable ability to rapidly direct their gaze to select visual information of interest when looking at a visual scene. Until now, research efforts are still focused on that which type of feature is representative for crowd saliency, and which type of learning model is the robust one for crowd saliency prediction. In this paper, we propose a Random Forest (RF) based crowd saliency prediction approach with optimal feature combination, i.e., the Feature Combination Selection for Crowd Saliency (FCSCS) framework. More specifically, we first define two representative crowd saliency features: FaceSizeDiff and FacePoseDiff. Next, we adopt the Random Forest (RF) algorithm to construct our saliency learning model. Then, we evaluate the performance of crowd saliency prediction classifiers with different feature combinations (fifteen combinations in our experiments). Those selected features include low-level features (i.e., color, intensity, orientation), four existing crowd features (i.e., face size, face density, frontal face, profile face) and two new defined features (i.e., FaceSizeDiff and FacePoseDiff). Finally, we obtain the optimal feature combination that is most suitable for crowd saliency prediction. We conduct extensive experiments and empirical evaluation to demonstrate the satisfactory performance of our approach.
UR - http://www.scopus.com/inward/record.url?scp=85006741531&partnerID=8YFLogxK
U2 - 10.1109/WCSP.2016.7752552
DO - 10.1109/WCSP.2016.7752552
M3 - Conference contribution
AN - SCOPUS:85006741531
T3 - 2016 8th International Conference on Wireless Communications and Signal Processing, WCSP 2016
BT - 2016 8th International Conference on Wireless Communications and Signal Processing, WCSP 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th International Conference on Wireless Communications and Signal Processing, WCSP 2016
Y2 - 13 October 2016 through 15 October 2016
ER -