Three-dimensional (3D) object reconstruction based on differentiable rendering (DR) is an active research topic in computer vision. DR-based methods minimize the dif- ference between the rendered and target images by optimizing both the shape and ap- pearance and realizing a high visual reproductivity. However, most approaches perform poorly for textureless objects because of the geometrical ambiguity, which means that multiple shapes can have the same rendered result in such objects. To overcome this problem, we introduce active sensing with structured light (SL) into multi-view 3D object reconstruction based on DR to learn the unknown geometry and appearance of arbitrary scenes and camera poses. More specifically, our framework leverages the correspon- dences between pixels in different views calculated by structured light as an additional constraint in the DR-based optimization of implicit surface, color representations, and camera poses. Because camera poses can be optimized simultaneously, our method re- alizes high reconstruction accuracy in the textureless region and reduces efforts for cam- era pose calibration, which is required for conventional SL-based methods. Experiment results on both synthetic and real data demonstrate that our system outperforms conven- tional DR- and SL-based methods in a high-quality surface reconstruction, particularly for challenging objects with textureless or shiny surfaces.
@inproceedings{li2022multi,
title={Multi-View Neural Surface Reconstruction with Structured Light},
author={Li, Chunyu and Hashimoto, Taisuke and Matsumoto, Eiichi and Kato, Hiroharu},
booktitle={The 33rd British Machine Vision Conference (BMVC)},
year={2022}
}