Research on Video Super-Resolution Technology Based on Multi-scale Spatiotemporal Information Aggregation

Xiao Luo*, Ang Li, Baoling Han

*此作品的通讯作者

科研成果: 书/报告/会议事项章节章节同行评审

摘要

As a new type of teaching tool, web course video breaks the limitations of traditional teaching methods, which has attracted widespread attention. However, due to the limited memory of the filming equipment, the web course video will be compressed and processed, resulting in a lower resolution of the video. In addition, when shooting video, it will be disturbed by factors such as lighting, character movement and blurred PPT projection, resulting in the final captured web course video being impaired in terms of brightness and clarity, which cannot meet the visual needs of users. Therefore, this paper uses video super-resolution reconstruction technology to predict and fill in the missing pixel information in low-resolution video frames, thereby obtaining high-resolution videos and improving user learning efficiency. First, in view of the problems such as occlusion and uneven illumination in online course videos, and the difficulty of the optical flow estimation network in accurately extracting the temporal dependencies in video frames, a Multi-scale Spatiotemporal Information Aggregation network was proposed. The network uses different sizes of 3D convolutions to not only accurately extract the temporal information between video frames at different time intervals, but also obtain the spatial information in the video frames, implicitly completing the alignment between video frames. Secondly, in view of the problem that conventional super-resolution reconstruction methods are difficult to reconstruct text areas in online course videos with high quality, a hybrid residual self-attention reconstruction network is proposed to construct a high-precision spatial self-attention module and a high-precision channel self-attention module. Significantly improves the reconstruction quality of text areas in online course videos. Experimental results show that this algorithm can achieve excellent results in the online course video super-resolution data set.

源语言英语
主期刊名Lecture Notes on Data Engineering and Communications Technologies
出版商Springer Science and Business Media Deutschland GmbH
165-174
页数10
DOI
出版状态已出版 - 2025

出版系列

姓名Lecture Notes on Data Engineering and Communications Technologies
218
ISSN(印刷版)2367-4512
ISSN(电子版)2367-4520

指纹

探究 'Research on Video Super-Resolution Technology Based on Multi-scale Spatiotemporal Information Aggregation' 的科研主题。它们共同构成独一无二的指纹。

引用此