Cross-View Action Recognition Over Heterogeneous Feature Spaces

Xinxiao Wu; Han Wang; Cuiwei Liu; Yunde Jia

doi:10.1109/TIP.2015.2445293

Cross-View Action Recognition Over Heterogeneous Feature Spaces

Xinxiao Wu, Han Wang, Cuiwei Liu, Yunde Jia

计算机学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

17 引用（Scopus）

摘要

In cross-view action recognition, what you saw in one view is different from what you recognize in another view, since the data distribution even the feature space can change from one view to another. In this paper, we address the problem of transferring action models learned in one view (source view) to another different view (target view), where action instances from these two views are represented by heterogeneous features. A novel learning method, called heterogeneous transfer discriminant-analysis of canonical correlations (HTDCC), is proposed to discover a discriminative common feature space for linking source view and target view to transfer knowledge between them. Two projection matrices are learned to, respectively, map data from the source view and the target view into a common feature space via simultaneously minimizing the canonical correlations of interclass training data, maximizing the canonical correlations of intraclass training data, and reducing the data distribution mismatch between the source and target views in the common feature space. In our method, the source view and the target view neither share any common features nor have any corresponding action instances. Moreover, our HTDCC method is capable of handling only a few or even no labeled samples available in the target view, and can also be easily extended to the situation of multiple source views. We additionally propose a weighting learning framework for multiple source views adaptation to effectively leverage action knowledge learned from multiple source views for the recognition task in the target view. Under this framework, different source views are assigned different weights according to their different relevances to the target view. Each weight represents how contributive the corresponding source view is to the target view. Extensive experiments on the IXMAS data set demonstrate the effectiveness of HTDCC on learning the common feature space for heterogeneous cross-view action recognition. In addition, the weighting learning framework can achieve promising results on automatically adapting multiple transferred source-view knowledge to the target view.

源语言	英语
文章编号	7122882
页（从-至）	4096-4108
页数	13
期刊	IEEE Transactions on Image Processing
卷	24
期	11
DOI	https://doi.org/10.1109/TIP.2015.2445293
出版状态	已出版 - 1 11月 2015

访问文件

10.1109/TIP.2015.2445293

其它文件与链接

链接到 Scopus 的出版物

引用此

Wu, X., Wang, H., Liu, C., & Jia, Y. (2015). Cross-View Action Recognition Over Heterogeneous Feature Spaces. IEEE Transactions on Image Processing, 24(11), 4096-4108. 文章 7122882. https://doi.org/10.1109/TIP.2015.2445293

@article{231e94272ce64f208ce8f36bed6b8abf,

title = "Cross-View Action Recognition Over Heterogeneous Feature Spaces",

abstract = "In cross-view action recognition, what you saw in one view is different from what you recognize in another view, since the data distribution even the feature space can change from one view to another. In this paper, we address the problem of transferring action models learned in one view (source view) to another different view (target view), where action instances from these two views are represented by heterogeneous features. A novel learning method, called heterogeneous transfer discriminant-analysis of canonical correlations (HTDCC), is proposed to discover a discriminative common feature space for linking source view and target view to transfer knowledge between them. Two projection matrices are learned to, respectively, map data from the source view and the target view into a common feature space via simultaneously minimizing the canonical correlations of interclass training data, maximizing the canonical correlations of intraclass training data, and reducing the data distribution mismatch between the source and target views in the common feature space. In our method, the source view and the target view neither share any common features nor have any corresponding action instances. Moreover, our HTDCC method is capable of handling only a few or even no labeled samples available in the target view, and can also be easily extended to the situation of multiple source views. We additionally propose a weighting learning framework for multiple source views adaptation to effectively leverage action knowledge learned from multiple source views for the recognition task in the target view. Under this framework, different source views are assigned different weights according to their different relevances to the target view. Each weight represents how contributive the corresponding source view is to the target view. Extensive experiments on the IXMAS data set demonstrate the effectiveness of HTDCC on learning the common feature space for heterogeneous cross-view action recognition. In addition, the weighting learning framework can achieve promising results on automatically adapting multiple transferred source-view knowledge to the target view.",

keywords = "Cross-view action recognition, heterogeneous features, multiple views adaptation, transfer learning",

author = "Xinxiao Wu and Han Wang and Cuiwei Liu and Yunde Jia",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2015",

month = nov,

day = "1",

doi = "10.1109/TIP.2015.2445293",

language = "English",

volume = "24",

pages = "4096--4108",

journal = "IEEE Transactions on Image Processing",

issn = "1057-7149",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "11",

}

TY - JOUR

T1 - Cross-View Action Recognition Over Heterogeneous Feature Spaces

AU - Wu, Xinxiao

AU - Wang, Han

AU - Liu, Cuiwei

AU - Jia, Yunde

PY - 2015/11/1

Y1 - 2015/11/1

N2 - In cross-view action recognition, what you saw in one view is different from what you recognize in another view, since the data distribution even the feature space can change from one view to another. In this paper, we address the problem of transferring action models learned in one view (source view) to another different view (target view), where action instances from these two views are represented by heterogeneous features. A novel learning method, called heterogeneous transfer discriminant-analysis of canonical correlations (HTDCC), is proposed to discover a discriminative common feature space for linking source view and target view to transfer knowledge between them. Two projection matrices are learned to, respectively, map data from the source view and the target view into a common feature space via simultaneously minimizing the canonical correlations of interclass training data, maximizing the canonical correlations of intraclass training data, and reducing the data distribution mismatch between the source and target views in the common feature space. In our method, the source view and the target view neither share any common features nor have any corresponding action instances. Moreover, our HTDCC method is capable of handling only a few or even no labeled samples available in the target view, and can also be easily extended to the situation of multiple source views. We additionally propose a weighting learning framework for multiple source views adaptation to effectively leverage action knowledge learned from multiple source views for the recognition task in the target view. Under this framework, different source views are assigned different weights according to their different relevances to the target view. Each weight represents how contributive the corresponding source view is to the target view. Extensive experiments on the IXMAS data set demonstrate the effectiveness of HTDCC on learning the common feature space for heterogeneous cross-view action recognition. In addition, the weighting learning framework can achieve promising results on automatically adapting multiple transferred source-view knowledge to the target view.

AB - In cross-view action recognition, what you saw in one view is different from what you recognize in another view, since the data distribution even the feature space can change from one view to another. In this paper, we address the problem of transferring action models learned in one view (source view) to another different view (target view), where action instances from these two views are represented by heterogeneous features. A novel learning method, called heterogeneous transfer discriminant-analysis of canonical correlations (HTDCC), is proposed to discover a discriminative common feature space for linking source view and target view to transfer knowledge between them. Two projection matrices are learned to, respectively, map data from the source view and the target view into a common feature space via simultaneously minimizing the canonical correlations of interclass training data, maximizing the canonical correlations of intraclass training data, and reducing the data distribution mismatch between the source and target views in the common feature space. In our method, the source view and the target view neither share any common features nor have any corresponding action instances. Moreover, our HTDCC method is capable of handling only a few or even no labeled samples available in the target view, and can also be easily extended to the situation of multiple source views. We additionally propose a weighting learning framework for multiple source views adaptation to effectively leverage action knowledge learned from multiple source views for the recognition task in the target view. Under this framework, different source views are assigned different weights according to their different relevances to the target view. Each weight represents how contributive the corresponding source view is to the target view. Extensive experiments on the IXMAS data set demonstrate the effectiveness of HTDCC on learning the common feature space for heterogeneous cross-view action recognition. In addition, the weighting learning framework can achieve promising results on automatically adapting multiple transferred source-view knowledge to the target view.

KW - Cross-view action recognition

KW - heterogeneous features

KW - multiple views adaptation

KW - transfer learning

UR - http://www.scopus.com/inward/record.url?scp=84939538606&partnerID=8YFLogxK

U2 - 10.1109/TIP.2015.2445293

DO - 10.1109/TIP.2015.2445293

M3 - Article

AN - SCOPUS:84939538606

SN - 1057-7149

VL - 24

SP - 4096

EP - 4108

JO - IEEE Transactions on Image Processing

JF - IEEE Transactions on Image Processing

IS - 11

M1 - 7122882

ER -

Cross-View Action Recognition Over Heterogeneous Feature Spaces

摘要

访问文件

其它文件与链接

指纹

引用此