TY - GEN
T1 - Voxel Grid Performer
T2 - Optical Metrology and Inspection for Industrial Applications IX 2022
AU - Fan, Zhongyi
AU - Liu, Ming
AU - Zhao, Yuejin
AU - Dong, Liquan
AU - Hui, Mei
AU - Kong, Lingqin
AU - Yang, Qikun
N1 - Publisher Copyright:
© 2022 SPIE.
PY - 2022
Y1 - 2022
N2 - Novel view synthesis is a long-standing problem. Despite the rapid development of neural radiance field (NeRF), in terms of rendering dynamic human body, NeRF still cannot achieve a good trade-off in precision and efficiency. In this paper, we aim at synthesizing a free-viewpoint video of an arbitrary human performers in an efficient way, only requiring a sparse number of camera views as inputs and skirting per-case fine-tuning. Recently, several works have addressed this problem by learning person-specific neural radiance fields to capture the appearance of a particular human. In parallel, some work proposed to use pixel-aligned features to generalize radiance fields to arbitrary new scenes and objects. Adopting these generalization aapprochs to human achieve reasonable rendering result. However, due to the difficulties of modeling the complex appearance of human and the dynamic sense, it is challenging to train nerf well in an efficient way. We find that the slow convergence of the human body reconstruction model is largely due to the nerf representation. In this work, we introduce a voxel grid based representation for human view synthesis, termed Voxel Grid Performer(VGP). Specifically, a sparse voxel grid is designed to represent the density and color in every sapce voxel, which enable better performance and less computation than conventional nerf optimization. We perform extensive experiments on both seen human performer and unseen human performer, demonstrating that our approach surpasses nerf-based methods on a wide variety of metrics. Code and data will be made available at https://github.com/fanzhongyi/vgp.
AB - Novel view synthesis is a long-standing problem. Despite the rapid development of neural radiance field (NeRF), in terms of rendering dynamic human body, NeRF still cannot achieve a good trade-off in precision and efficiency. In this paper, we aim at synthesizing a free-viewpoint video of an arbitrary human performers in an efficient way, only requiring a sparse number of camera views as inputs and skirting per-case fine-tuning. Recently, several works have addressed this problem by learning person-specific neural radiance fields to capture the appearance of a particular human. In parallel, some work proposed to use pixel-aligned features to generalize radiance fields to arbitrary new scenes and objects. Adopting these generalization aapprochs to human achieve reasonable rendering result. However, due to the difficulties of modeling the complex appearance of human and the dynamic sense, it is challenging to train nerf well in an efficient way. We find that the slow convergence of the human body reconstruction model is largely due to the nerf representation. In this work, we introduce a voxel grid based representation for human view synthesis, termed Voxel Grid Performer(VGP). Specifically, a sparse voxel grid is designed to represent the density and color in every sapce voxel, which enable better performance and less computation than conventional nerf optimization. We perform extensive experiments on both seen human performer and unseen human performer, demonstrating that our approach surpasses nerf-based methods on a wide variety of metrics. Code and data will be made available at https://github.com/fanzhongyi/vgp.
KW - 3D Deep Learning
KW - Human Reconstruction
KW - Image-based Rendering
KW - Neural Radiance Field
KW - Novel View Synthesis
KW - Voxel Grid
UR - http://www.scopus.com/inward/record.url?scp=85146662509&partnerID=8YFLogxK
U2 - 10.1117/12.2643856
DO - 10.1117/12.2643856
M3 - Conference contribution
AN - SCOPUS:85146662509
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Optical Metrology and Inspection for Industrial Applications IX
A2 - Han, Sen
A2 - Han, Sen
A2 - Ehret, Gerd
A2 - Chen, Benyong
PB - SPIE
Y2 - 5 December 2022 through 11 December 2022
ER -