ProbIBR: Fast Image-Based Rendering with Learned Probability-Guided Sampling

Yuemei Zhou; Tao Yu; Zerong Zheng; Gaochang Wu; Guihua Zhao; Wenbo Jiang; Ying Fu; Yebin Liu

doi:10.1109/TVCG.2024.3372152

ProbIBR: Fast Image-Based Rendering with Learned Probability-Guided Sampling

Yuemei Zhou, Tao Yu, Zerong Zheng, Gaochang Wu, Guihua Zhao, Wenbo Jiang, Ying Fu, Yebin Liu

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

We present a general, fast, and practical solution for interpolating novel views of diverse real-world scenes given a sparse set of nearby views. Existing generic novel view synthesis methods rely on time-consuming scene geometry pre-computation or redundant sampling of the entire space for neural volumetric rendering, limiting the overall efficiency. Instead, we incorporate learned MVS priors into the neural volume rendering pipeline while improving the rendering efficiency by reducing sampling points under the guidance of depth probability distributions. Specifically, fewer but important points are sampled under the guidance of depth probability distributions extracted from the learned MVS architecture. Based on the learned probability-guided sampling, we develop a sophisticated neural volume rendering module that effectively integrates source view information with the learned scene structures. We further propose confidence-aware refinement to improve the rendering results in uncertain, occluded, and unreferenced regions. Moreover, we build a four-view camera system for holographic display and provide a real-time version of our framework for free-viewpoint experience, where novel view images of a spatial resolution of 512×512 can be rendered at around 20 fps on a single GTX 3090 GPU. Experiments show that our method achieves 15 to 40 times faster rendering compared to state-of-the-art baselines, with strong generalization capacity and comparable high-quality novel view synthesis performance.

Original language	English
Pages (from-to)	1-15
Number of pages	15
Journal	IEEE Transactions on Visualization and Computer Graphics
DOIs	https://doi.org/10.1109/TVCG.2024.3372152
Publication status	Accepted/In press - 2024

Keywords

View synthesis
image-based rendering
volume rendering

Access to Document

10.1109/TVCG.2024.3372152

Cite this

@article{9141433100d44a2d9f93451c86eccc93,

title = "ProbIBR: Fast Image-Based Rendering with Learned Probability-Guided Sampling",

abstract = "We present a general, fast, and practical solution for interpolating novel views of diverse real-world scenes given a sparse set of nearby views. Existing generic novel view synthesis methods rely on time-consuming scene geometry pre-computation or redundant sampling of the entire space for neural volumetric rendering, limiting the overall efficiency. Instead, we incorporate learned MVS priors into the neural volume rendering pipeline while improving the rendering efficiency by reducing sampling points under the guidance of depth probability distributions. Specifically, fewer but important points are sampled under the guidance of depth probability distributions extracted from the learned MVS architecture. Based on the learned probability-guided sampling, we develop a sophisticated neural volume rendering module that effectively integrates source view information with the learned scene structures. We further propose confidence-aware refinement to improve the rendering results in uncertain, occluded, and unreferenced regions. Moreover, we build a four-view camera system for holographic display and provide a real-time version of our framework for free-viewpoint experience, where novel view images of a spatial resolution of 512×512 can be rendered at around 20 fps on a single GTX 3090 GPU. Experiments show that our method achieves 15 to 40 times faster rendering compared to state-of-the-art baselines, with strong generalization capacity and comparable high-quality novel view synthesis performance.",

keywords = "View synthesis, image-based rendering, volume rendering",

author = "Yuemei Zhou and Tao Yu and Zerong Zheng and Gaochang Wu and Guihua Zhao and Wenbo Jiang and Ying Fu and Yebin Liu",

note = "Publisher Copyright: IEEE",

year = "2024",

doi = "10.1109/TVCG.2024.3372152",

language = "English",

pages = "1--15",

journal = "IEEE Transactions on Visualization and Computer Graphics",

issn = "1077-2626",

publisher = "IEEE Computer Society",

}

TY - JOUR

T1 - ProbIBR

T2 - Fast Image-Based Rendering with Learned Probability-Guided Sampling

AU - Zhou, Yuemei

AU - Yu, Tao

AU - Zheng, Zerong

AU - Wu, Gaochang

AU - Zhao, Guihua

AU - Jiang, Wenbo

AU - Fu, Ying

AU - Liu, Yebin

N1 - Publisher Copyright: IEEE

PY - 2024

Y1 - 2024

N2 - We present a general, fast, and practical solution for interpolating novel views of diverse real-world scenes given a sparse set of nearby views. Existing generic novel view synthesis methods rely on time-consuming scene geometry pre-computation or redundant sampling of the entire space for neural volumetric rendering, limiting the overall efficiency. Instead, we incorporate learned MVS priors into the neural volume rendering pipeline while improving the rendering efficiency by reducing sampling points under the guidance of depth probability distributions. Specifically, fewer but important points are sampled under the guidance of depth probability distributions extracted from the learned MVS architecture. Based on the learned probability-guided sampling, we develop a sophisticated neural volume rendering module that effectively integrates source view information with the learned scene structures. We further propose confidence-aware refinement to improve the rendering results in uncertain, occluded, and unreferenced regions. Moreover, we build a four-view camera system for holographic display and provide a real-time version of our framework for free-viewpoint experience, where novel view images of a spatial resolution of 512×512 can be rendered at around 20 fps on a single GTX 3090 GPU. Experiments show that our method achieves 15 to 40 times faster rendering compared to state-of-the-art baselines, with strong generalization capacity and comparable high-quality novel view synthesis performance.

AB - We present a general, fast, and practical solution for interpolating novel views of diverse real-world scenes given a sparse set of nearby views. Existing generic novel view synthesis methods rely on time-consuming scene geometry pre-computation or redundant sampling of the entire space for neural volumetric rendering, limiting the overall efficiency. Instead, we incorporate learned MVS priors into the neural volume rendering pipeline while improving the rendering efficiency by reducing sampling points under the guidance of depth probability distributions. Specifically, fewer but important points are sampled under the guidance of depth probability distributions extracted from the learned MVS architecture. Based on the learned probability-guided sampling, we develop a sophisticated neural volume rendering module that effectively integrates source view information with the learned scene structures. We further propose confidence-aware refinement to improve the rendering results in uncertain, occluded, and unreferenced regions. Moreover, we build a four-view camera system for holographic display and provide a real-time version of our framework for free-viewpoint experience, where novel view images of a spatial resolution of 512×512 can be rendered at around 20 fps on a single GTX 3090 GPU. Experiments show that our method achieves 15 to 40 times faster rendering compared to state-of-the-art baselines, with strong generalization capacity and comparable high-quality novel view synthesis performance.

KW - View synthesis

KW - image-based rendering

KW - volume rendering

UR - http://www.scopus.com/inward/record.url?scp=85187383664&partnerID=8YFLogxK

U2 - 10.1109/TVCG.2024.3372152

DO - 10.1109/TVCG.2024.3372152

M3 - Article

AN - SCOPUS:85187383664

SN - 1077-2626

SP - 1

EP - 15

JO - IEEE Transactions on Visualization and Computer Graphics

JF - IEEE Transactions on Visualization and Computer Graphics

ER -

ProbIBR: Fast Image-Based Rendering with Learned Probability-Guided Sampling

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this