Multi-view Consistency View Synthesis

Xiaodi Wu; Zhiqiang Zhang; Wenxin Yu; Shiyu Chen; Yufei Gao; Peng Chen; Jun Gong

doi:10.1007/978-981-99-8148-9_25

Multi-view Consistency View Synthesis

Xiaodi Wu, Zhiqiang Zhang^*, Wenxin Yu, Shiyu Chen, Yufei Gao, Peng Chen, Jun Gong

^*此作品的通讯作者

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Novel view synthesis (NVS) aims to synthesize photo-realistic images depicting a scene by utilizing existing source images. The core objective is that the synthesized images are supposed to be as close as possible to the scene content. In recent years, various approaches shift the focus towards the visual effect of images in continuous space or time. While current methods for static scenes treat the rendering of images as isolated processes, neglecting the geometric consistency in static scenes. This usually results in incoherent visual experiences like flicker or artifacts in synthesized image sequences. To address this limitation, we propose Multi-View Consistency View Synthesis (MCVS). MCVS leverages long short-term memory (LSTM) and self-attention mechanism to model the spatial correlation between synthesized images, hence forcing them closer to the ground truth. MCVS not only enhances multi-view consistency but also improves the overall quality of the synthesized images. The proposed method is evaluated on the Tanks and Temples dataset, and the FVS dataset. On average, the Learned Perceptual Image Patch Similarity (LPIPS) is better than state-of-the-art approaches by 0.14 to 0.16%, indicating the superiority of our approach.

源语言	英语
主期刊名	Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings
编辑	Biao Luo, Long Cheng, Zheng-Guang Wu, Hongyi Li, Chaojie Li
出版商	Springer Science and Business Media Deutschland GmbH
页	311-323
页数	13
ISBN（印刷版）	9789819981472
DOI	https://doi.org/10.1007/978-981-99-8148-9_25
出版状态	已出版 - 2024
已对外发布	是
活动	30th International Conference on Neural Information Processing, ICONIP 2023 - Changsha, 中国期限: 20 11月 2023 → 23 11月 2023

出版系列

姓名	Communications in Computer and Information Science
卷	1966 CCIS
ISSN（印刷版）	1865-0929
ISSN（电子版）	1865-0937

会议

会议	30th International Conference on Neural Information Processing, ICONIP 2023
国家/地区	中国
市	Changsha
时期	20/11/23 → 23/11/23

访问文件

10.1007/978-981-99-8148-9_25

其它文件与链接

链接到 Scopus 的出版物

引用此

Wu, X., Zhang, Z., Yu, W., Chen, S., Gao, Y., Chen, P., & Gong, J. (2024). Multi-view Consistency View Synthesis. 在 B. Luo, L. Cheng, Z.-G. Wu, H. Li, & C. Li (编辑), Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings (页码 311-323). (Communications in Computer and Information Science; 卷 1966 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-8148-9_25

Wu, Xiaodi ; Zhang, Zhiqiang ; Yu, Wenxin 等. / Multi-view Consistency View Synthesis. Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings. 编辑 / Biao Luo ; Long Cheng ; Zheng-Guang Wu ; Hongyi Li ; Chaojie Li. Springer Science and Business Media Deutschland GmbH, 2024. 页码 311-323 (Communications in Computer and Information Science).

@inproceedings{02ba8962935448c8a1c05f0f56c3e0a9,

title = "Multi-view Consistency View Synthesis",

abstract = "Novel view synthesis (NVS) aims to synthesize photo-realistic images depicting a scene by utilizing existing source images. The core objective is that the synthesized images are supposed to be as close as possible to the scene content. In recent years, various approaches shift the focus towards the visual effect of images in continuous space or time. While current methods for static scenes treat the rendering of images as isolated processes, neglecting the geometric consistency in static scenes. This usually results in incoherent visual experiences like flicker or artifacts in synthesized image sequences. To address this limitation, we propose Multi-View Consistency View Synthesis (MCVS). MCVS leverages long short-term memory (LSTM) and self-attention mechanism to model the spatial correlation between synthesized images, hence forcing them closer to the ground truth. MCVS not only enhances multi-view consistency but also improves the overall quality of the synthesized images. The proposed method is evaluated on the Tanks and Temples dataset, and the FVS dataset. On average, the Learned Perceptual Image Patch Similarity (LPIPS) is better than state-of-the-art approaches by 0.14 to 0.16%, indicating the superiority of our approach.",

keywords = "Deep Learning, Long Short-Term Memory Mechanism, Novel View Synthesis",

author = "Xiaodi Wu and Zhiqiang Zhang and Wenxin Yu and Shiyu Chen and Yufei Gao and Peng Chen and Jun Gong",

note = "Publisher Copyright: {\textcopyright} 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.; 30th International Conference on Neural Information Processing, ICONIP 2023 ; Conference date: 20-11-2023 Through 23-11-2023",

year = "2024",

doi = "10.1007/978-981-99-8148-9_25",

language = "English",

isbn = "9789819981472",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "311--323",

editor = "Biao Luo and Long Cheng and Zheng-Guang Wu and Hongyi Li and Chaojie Li",

booktitle = "Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings",

address = "Germany",

}

Wu, X, Zhang, Z, Yu, W, Chen, S, Gao, Y, Chen, P & Gong, J 2024, Multi-view Consistency View Synthesis. 在 B Luo, L Cheng, Z-G Wu, H Li & C Li (编辑), Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings. Communications in Computer and Information Science, 卷 1966 CCIS, Springer Science and Business Media Deutschland GmbH, 页码 311-323, 30th International Conference on Neural Information Processing, ICONIP 2023, Changsha, 中国, 20/11/23. https://doi.org/10.1007/978-981-99-8148-9_25

Multi-view Consistency View Synthesis. / Wu, Xiaodi; Zhang, Zhiqiang; Yu, Wenxin 等.
Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings. 编辑 / Biao Luo; Long Cheng; Zheng-Guang Wu; Hongyi Li; Chaojie Li. Springer Science and Business Media Deutschland GmbH, 2024. 页码 311-323 (Communications in Computer and Information Science; 卷 1966 CCIS).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Multi-view Consistency View Synthesis

AU - Wu, Xiaodi

AU - Zhang, Zhiqiang

AU - Yu, Wenxin

AU - Chen, Shiyu

AU - Gao, Yufei

AU - Chen, Peng

AU - Gong, Jun

PY - 2024

Y1 - 2024

N2 - Novel view synthesis (NVS) aims to synthesize photo-realistic images depicting a scene by utilizing existing source images. The core objective is that the synthesized images are supposed to be as close as possible to the scene content. In recent years, various approaches shift the focus towards the visual effect of images in continuous space or time. While current methods for static scenes treat the rendering of images as isolated processes, neglecting the geometric consistency in static scenes. This usually results in incoherent visual experiences like flicker or artifacts in synthesized image sequences. To address this limitation, we propose Multi-View Consistency View Synthesis (MCVS). MCVS leverages long short-term memory (LSTM) and self-attention mechanism to model the spatial correlation between synthesized images, hence forcing them closer to the ground truth. MCVS not only enhances multi-view consistency but also improves the overall quality of the synthesized images. The proposed method is evaluated on the Tanks and Temples dataset, and the FVS dataset. On average, the Learned Perceptual Image Patch Similarity (LPIPS) is better than state-of-the-art approaches by 0.14 to 0.16%, indicating the superiority of our approach.

AB - Novel view synthesis (NVS) aims to synthesize photo-realistic images depicting a scene by utilizing existing source images. The core objective is that the synthesized images are supposed to be as close as possible to the scene content. In recent years, various approaches shift the focus towards the visual effect of images in continuous space or time. While current methods for static scenes treat the rendering of images as isolated processes, neglecting the geometric consistency in static scenes. This usually results in incoherent visual experiences like flicker or artifacts in synthesized image sequences. To address this limitation, we propose Multi-View Consistency View Synthesis (MCVS). MCVS leverages long short-term memory (LSTM) and self-attention mechanism to model the spatial correlation between synthesized images, hence forcing them closer to the ground truth. MCVS not only enhances multi-view consistency but also improves the overall quality of the synthesized images. The proposed method is evaluated on the Tanks and Temples dataset, and the FVS dataset. On average, the Learned Perceptual Image Patch Similarity (LPIPS) is better than state-of-the-art approaches by 0.14 to 0.16%, indicating the superiority of our approach.

KW - Deep Learning

KW - Long Short-Term Memory Mechanism

KW - Novel View Synthesis

UR - http://www.scopus.com/inward/record.url?scp=85178621328&partnerID=8YFLogxK

U2 - 10.1007/978-981-99-8148-9_25

DO - 10.1007/978-981-99-8148-9_25

M3 - Conference contribution

AN - SCOPUS:85178621328

SN - 9789819981472

T3 - Communications in Computer and Information Science

SP - 311

EP - 323

BT - Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings

A2 - Luo, Biao

A2 - Cheng, Long

A2 - Wu, Zheng-Guang

A2 - Li, Hongyi

A2 - Li, Chaojie

PB - Springer Science and Business Media Deutschland GmbH

T2 - 30th International Conference on Neural Information Processing, ICONIP 2023

Y2 - 20 November 2023 through 23 November 2023

ER -

Wu X, Zhang Z, Yu W, Chen S, Gao Y, Chen P 等. Multi-view Consistency View Synthesis. 在 Luo B, Cheng L, Wu ZG, Li H, Li C, 编辑, Neural Information Processing - 30th International Conference, ICONIP 2023, Proceedings. Springer Science and Business Media Deutschland GmbH. 2024. 页码 311-323. (Communications in Computer and Information Science). doi: 10.1007/978-981-99-8148-9_25

Multi-view Consistency View Synthesis

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此