Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

Xinxiao Wu; Jialu Chen

doi:10.1145/3394171.3413872

Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

Xinxiao Wu, Jialu Chen

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

5 引用（Scopus）

摘要

Video style transfer is a challenging task that requires not only stylizing video frames but also preserving temporal consistency among them. Many existing methods resort to optical flow for maintaining the temporal consistency in stylized videos. However, optical flow is sensitive to occlusions and rapid motions, and its training processing speed is quite slow, which makes it less practical in real-world applications. In this paper, we propose a novel fast method that explores both global and local temporal consistency for video style transfer without estimating optical flow. To preserve the temporal consistency of the entire video (i.e., global consistency), we use structural similarity index instead of flow optical and propose a self-similarity loss to ensure the temporal structure similarity between the stylized video and the source video. Furthermore, to enhance the coherence between adjacent frames (i.e., local consistency), a self-attention mechanism is designed to attend the previous stylized frame for synthesizing the current frame.

源语言	英语
主期刊名	MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia
出版商	Association for Computing Machinery, Inc
页	1791-1799
页数	9
ISBN（电子版）	9781450379885
DOI	https://doi.org/10.1145/3394171.3413872
出版状态	已出版 - 12 10月 2020
活动	28th ACM International Conference on Multimedia, MM 2020 - Virtual, Online, 美国期限: 12 10月 2020 → 16 10月 2020

出版系列

姓名	MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

会议

会议	28th ACM International Conference on Multimedia, MM 2020
国家/地区	美国
市	Virtual, Online
时期	12/10/20 → 16/10/20

访问文件

10.1145/3394171.3413872

其它文件与链接

链接到 Scopus 的出版物

引用此

Wu, X., & Chen, J. (2020). Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer. 在 MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (页码 1791-1799). (MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3413872

@inproceedings{fd6b1235524a4e618c41efd1454ae6a5,

title = "Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer",

abstract = "Video style transfer is a challenging task that requires not only stylizing video frames but also preserving temporal consistency among them. Many existing methods resort to optical flow for maintaining the temporal consistency in stylized videos. However, optical flow is sensitive to occlusions and rapid motions, and its training processing speed is quite slow, which makes it less practical in real-world applications. In this paper, we propose a novel fast method that explores both global and local temporal consistency for video style transfer without estimating optical flow. To preserve the temporal consistency of the entire video (i.e., global consistency), we use structural similarity index instead of flow optical and propose a self-similarity loss to ensure the temporal structure similarity between the stylized video and the source video. Furthermore, to enhance the coherence between adjacent frames (i.e., local consistency), a self-attention mechanism is designed to attend the previous stylized frame for synthesizing the current frame.",

keywords = "self-attention, self-similarity, video style transfer",

author = "Xinxiao Wu and Jialu Chen",

note = "Publisher Copyright: {\textcopyright} 2020 ACM.; 28th ACM International Conference on Multimedia, MM 2020 ; Conference date: 12-10-2020 Through 16-10-2020",

year = "2020",

month = oct,

day = "12",

doi = "10.1145/3394171.3413872",

language = "English",

series = "MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia",

publisher = "Association for Computing Machinery, Inc",

pages = "1791--1799",

booktitle = "MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia",

}

Wu, X & Chen, J 2020, Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer. 在 MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia. MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia, Association for Computing Machinery, Inc, 页码 1791-1799, 28th ACM International Conference on Multimedia, MM 2020, Virtual, Online, 美国, 12/10/20. https://doi.org/10.1145/3394171.3413872

Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer. / Wu, Xinxiao; Chen, Jialu.
MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia. Association for Computing Machinery, Inc, 2020. 页码 1791-1799 (MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

AU - Wu, Xinxiao

AU - Chen, Jialu

PY - 2020/10/12

Y1 - 2020/10/12

N2 - Video style transfer is a challenging task that requires not only stylizing video frames but also preserving temporal consistency among them. Many existing methods resort to optical flow for maintaining the temporal consistency in stylized videos. However, optical flow is sensitive to occlusions and rapid motions, and its training processing speed is quite slow, which makes it less practical in real-world applications. In this paper, we propose a novel fast method that explores both global and local temporal consistency for video style transfer without estimating optical flow. To preserve the temporal consistency of the entire video (i.e., global consistency), we use structural similarity index instead of flow optical and propose a self-similarity loss to ensure the temporal structure similarity between the stylized video and the source video. Furthermore, to enhance the coherence between adjacent frames (i.e., local consistency), a self-attention mechanism is designed to attend the previous stylized frame for synthesizing the current frame.

AB - Video style transfer is a challenging task that requires not only stylizing video frames but also preserving temporal consistency among them. Many existing methods resort to optical flow for maintaining the temporal consistency in stylized videos. However, optical flow is sensitive to occlusions and rapid motions, and its training processing speed is quite slow, which makes it less practical in real-world applications. In this paper, we propose a novel fast method that explores both global and local temporal consistency for video style transfer without estimating optical flow. To preserve the temporal consistency of the entire video (i.e., global consistency), we use structural similarity index instead of flow optical and propose a self-similarity loss to ensure the temporal structure similarity between the stylized video and the source video. Furthermore, to enhance the coherence between adjacent frames (i.e., local consistency), a self-attention mechanism is designed to attend the previous stylized frame for synthesizing the current frame.

KW - self-attention

KW - self-similarity

KW - video style transfer

UR - http://www.scopus.com/inward/record.url?scp=85106882811&partnerID=8YFLogxK

U2 - 10.1145/3394171.3413872

DO - 10.1145/3394171.3413872

M3 - Conference contribution

AN - SCOPUS:85106882811

T3 - MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

SP - 1791

EP - 1799

BT - MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

PB - Association for Computing Machinery, Inc

T2 - 28th ACM International Conference on Multimedia, MM 2020

Y2 - 12 October 2020 through 16 October 2020

ER -

Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此