TY - GEN
T1 - A Student behavior Recognition Algorithm Based on Improved MobileNetV2
AU - Zhou, Zheng
AU - Hirota, Kaoru
AU - Dai, Yaping
AU - Mersha, Bemnet Wondimagegnehu
AU - Dai, Wei
AU - Lin, Yumin
N1 - Publisher Copyright:
© 2025 SPIE.
PY - 2025
Y1 - 2025
N2 - Student behavior recognition is crucial for detecting learning activities within the classroom. Addressing the high complexity, poor real-time performance, and low accuracy of existing recognition models, we develop a student behavior classification dataset and propose an algorithm for student behavior recognition based on improved MobileNetV2. The algorithm diminishes model complexity and accelerates inference speed by reducing the width factor and improves model accuracy by improving classifier of the model and embedding coordinate attention block. The ablation experiment demonstrates the algorithm’s superior performance, achieving an average accuracy of 92.45% on the dataset, a 2.64% improvement over the initial model. The parameter is also significantly reduced to 0.56M, marking a 75% decrease. Computational requirements are lowered to 1.67 GFLOPs, representing a 68% reduction. Furthermore, the inference speed is enhanced to 244 FPS, a 106.7% increase compared to the initial model. Comparative experiment indicates that our algorithm outperforms other lightweight neural networks, including EfficientNet, DenseNet, ShuffleNetV2, and MobileNetV3, in terms of model complexity, inference speed, and accuracy.
AB - Student behavior recognition is crucial for detecting learning activities within the classroom. Addressing the high complexity, poor real-time performance, and low accuracy of existing recognition models, we develop a student behavior classification dataset and propose an algorithm for student behavior recognition based on improved MobileNetV2. The algorithm diminishes model complexity and accelerates inference speed by reducing the width factor and improves model accuracy by improving classifier of the model and embedding coordinate attention block. The ablation experiment demonstrates the algorithm’s superior performance, achieving an average accuracy of 92.45% on the dataset, a 2.64% improvement over the initial model. The parameter is also significantly reduced to 0.56M, marking a 75% decrease. Computational requirements are lowered to 1.67 GFLOPs, representing a 68% reduction. Furthermore, the inference speed is enhanced to 244 FPS, a 106.7% increase compared to the initial model. Comparative experiment indicates that our algorithm outperforms other lightweight neural networks, including EfficientNet, DenseNet, ShuffleNetV2, and MobileNetV3, in terms of model complexity, inference speed, and accuracy.
KW - behavior recognition
KW - Lightweight
KW - MobileNetV2
UR - http://www.scopus.com/inward/record.url?scp=105000830726&partnerID=8YFLogxK
U2 - 10.1117/12.3059118
DO - 10.1117/12.3059118
M3 - Conference contribution
AN - SCOPUS:105000830726
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Eighth International Conference on Video and Image Processing, ICVIP 2024
A2 - Liang, Xuefeng
PB - SPIE
T2 - 8th International Conference on Video and Image Processing, ICVIP 2024
Y2 - 13 December 2024 through 15 December 2024
ER -