Human activity recognition with a multibranch network based on CNN and LSTM

Ruixin Yuan; Yanmei Zhang; Lizhe Wang; Shengyun Li

doi:10.1117/12.3023366

Human activity recognition with a multibranch network based on CNN and LSTM

Ruixin Yuan, Yanmei Zhang^*, Lizhe Wang, Shengyun Li

^*此作品的通讯作者

集成电路与电子学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

With the widespread use of wearable devices, human activity recognition (HAR) holds immense potential in health monitoring, smart environment. Notably, temporal sensory sequences collected from the wearable devices can provide accurate reflections of the daily activities. Nonetheless, existing CNN-based and LSTM-based methods have predominantly concentrated on feature extraction from univariate sequences, overlooking the implicit frequency information. Therefore, we firstly employed the Short Time Fourier Transform (STFT) in HAR tasks, extracting inherent frequency feature. Concurrently, we introduced a multi-branch network that combines CNN and LSTM. The CNN component captures spatial information of different dimensions. The LSTM, on the other hand, comprises two parts, one focused on temporal relationships within a single channel and the other concerned about channel relationships at a specific time point. In addition, recognizing the limitations in the available datasets, particularly the insufficient coverage of daily activities, we collected our custom dataset, encompassing eight distinct daily activity categories. Finally, we evaluated our proposed model and benchmark models. The results demonstrate that our network exhibits superior generalization across different datasets, archieving accuracy of 91.70%, 95.79%, 87.81% on the PAMAP2, UCI HAR and our own dataset respectively.

源语言	英语
主期刊名	Third International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023
编辑	Atsushi Inoue
出版商	SPIE
ISBN（电子版）	9781510672925
DOI	https://doi.org/10.1117/12.3023366
出版状态	已出版 - 2024
活动	3rd International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023 - Zhengzhou, 中国期限: 17 11月 2023 → 19 11月 2023

出版系列

姓名	Proceedings of SPIE - The International Society for Optical Engineering
卷	12987
ISSN（印刷版）	0277-786X
ISSN（电子版）	1996-756X

会议

会议	3rd International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023
国家/地区	中国
市	Zhengzhou
时期	17/11/23 → 19/11/23

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1117/12.3023366

其它文件与链接

链接到 Scopus 的出版物

引用此

Yuan, R., Zhang, Y., Wang, L., & Li, S. (2024). Human activity recognition with a multibranch network based on CNN and LSTM. 在 A. Inoue (编辑), Third International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023 文章 1298704 (Proceedings of SPIE - The International Society for Optical Engineering; 卷 12987). SPIE. https://doi.org/10.1117/12.3023366

@inproceedings{12b3ece0684f45ccafc6757063a7edb4,

title = "Human activity recognition with a multibranch network based on CNN and LSTM",

abstract = "With the widespread use of wearable devices, human activity recognition (HAR) holds immense potential in health monitoring, smart environment. Notably, temporal sensory sequences collected from the wearable devices can provide accurate reflections of the daily activities. Nonetheless, existing CNN-based and LSTM-based methods have predominantly concentrated on feature extraction from univariate sequences, overlooking the implicit frequency information. Therefore, we firstly employed the Short Time Fourier Transform (STFT) in HAR tasks, extracting inherent frequency feature. Concurrently, we introduced a multi-branch network that combines CNN and LSTM. The CNN component captures spatial information of different dimensions. The LSTM, on the other hand, comprises two parts, one focused on temporal relationships within a single channel and the other concerned about channel relationships at a specific time point. In addition, recognizing the limitations in the available datasets, particularly the insufficient coverage of daily activities, we collected our custom dataset, encompassing eight distinct daily activity categories. Finally, we evaluated our proposed model and benchmark models. The results demonstrate that our network exhibits superior generalization across different datasets, archieving accuracy of 91.70%, 95.79%, 87.81% on the PAMAP2, UCI HAR and our own dataset respectively.",

keywords = "cnn, deep learning, human activity recognition, lstm, multi-branch",

author = "Ruixin Yuan and Yanmei Zhang and Lizhe Wang and Shengyun Li",

note = "Publisher Copyright: {\textcopyright} 2024 SPIE.; 3rd International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023 ; Conference date: 17-11-2023 Through 19-11-2023",

year = "2024",

doi = "10.1117/12.3023366",

language = "English",

series = "Proceedings of SPIE - The International Society for Optical Engineering",

publisher = "SPIE",

editor = "Atsushi Inoue",

booktitle = "Third International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023",

address = "United States",

}

Yuan, R, Zhang, Y, Wang, L & Li, S 2024, Human activity recognition with a multibranch network based on CNN and LSTM. 在 A Inoue (编辑), Third International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023., 1298704, Proceedings of SPIE - The International Society for Optical Engineering, 卷 12987, SPIE, 3rd International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023, Zhengzhou, 中国, 17/11/23. https://doi.org/10.1117/12.3023366

Human activity recognition with a multibranch network based on CNN and LSTM. / Yuan, Ruixin; Zhang, Yanmei; Wang, Lizhe 等.
Third International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023. 编辑 / Atsushi Inoue. SPIE, 2024. 1298704 (Proceedings of SPIE - The International Society for Optical Engineering; 卷 12987).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Human activity recognition with a multibranch network based on CNN and LSTM

AU - Yuan, Ruixin

AU - Zhang, Yanmei

AU - Wang, Lizhe

AU - Li, Shengyun

PY - 2024

Y1 - 2024

N2 - With the widespread use of wearable devices, human activity recognition (HAR) holds immense potential in health monitoring, smart environment. Notably, temporal sensory sequences collected from the wearable devices can provide accurate reflections of the daily activities. Nonetheless, existing CNN-based and LSTM-based methods have predominantly concentrated on feature extraction from univariate sequences, overlooking the implicit frequency information. Therefore, we firstly employed the Short Time Fourier Transform (STFT) in HAR tasks, extracting inherent frequency feature. Concurrently, we introduced a multi-branch network that combines CNN and LSTM. The CNN component captures spatial information of different dimensions. The LSTM, on the other hand, comprises two parts, one focused on temporal relationships within a single channel and the other concerned about channel relationships at a specific time point. In addition, recognizing the limitations in the available datasets, particularly the insufficient coverage of daily activities, we collected our custom dataset, encompassing eight distinct daily activity categories. Finally, we evaluated our proposed model and benchmark models. The results demonstrate that our network exhibits superior generalization across different datasets, archieving accuracy of 91.70%, 95.79%, 87.81% on the PAMAP2, UCI HAR and our own dataset respectively.

AB - With the widespread use of wearable devices, human activity recognition (HAR) holds immense potential in health monitoring, smart environment. Notably, temporal sensory sequences collected from the wearable devices can provide accurate reflections of the daily activities. Nonetheless, existing CNN-based and LSTM-based methods have predominantly concentrated on feature extraction from univariate sequences, overlooking the implicit frequency information. Therefore, we firstly employed the Short Time Fourier Transform (STFT) in HAR tasks, extracting inherent frequency feature. Concurrently, we introduced a multi-branch network that combines CNN and LSTM. The CNN component captures spatial information of different dimensions. The LSTM, on the other hand, comprises two parts, one focused on temporal relationships within a single channel and the other concerned about channel relationships at a specific time point. In addition, recognizing the limitations in the available datasets, particularly the insufficient coverage of daily activities, we collected our custom dataset, encompassing eight distinct daily activity categories. Finally, we evaluated our proposed model and benchmark models. The results demonstrate that our network exhibits superior generalization across different datasets, archieving accuracy of 91.70%, 95.79%, 87.81% on the PAMAP2, UCI HAR and our own dataset respectively.

KW - cnn

KW - deep learning

KW - human activity recognition

KW - lstm

KW - multi-branch

UR - http://www.scopus.com/inward/record.url?scp=85191228993&partnerID=8YFLogxK

U2 - 10.1117/12.3023366

DO - 10.1117/12.3023366

M3 - Conference contribution

AN - SCOPUS:85191228993

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - Third International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023

A2 - Inoue, Atsushi

PB - SPIE

T2 - 3rd International Conference on Computer Technology, Information Engineering, and Electron Materials, CTIEEM 2023

Y2 - 17 November 2023 through 19 November 2023

ER -

Human activity recognition with a multibranch network based on CNN and LSTM

摘要

出版系列

会议

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此