M3VSNET: Unsupervised Multi-Metric Multi-View Stereo Network

Baichuan Huang, Hongwei Yi, Can Huang, Yijia He, Jingbin Liu, Xiao Liu

Research output: Chapter in Book/Report/Conference proceedingPaper in conference proceedingpeer-review

Abstract

The present Multi-view stereo (MVS) methods with supervised learning-based networks have an impressive performance comparing with traditional MVS methods. However, the ground-truth depth maps for training are hard to be obtained and are within limited kinds of scenarios. In this paper, we propose a novel unsupervised multi-metric MVS network, named M 3 VSNet, for dense point cloud reconstruction without any supervision. To improve the robustness and completeness of point cloud reconstruction, we propose a novel multi-metric loss function that combines pixel-wise and feature-wise loss function to learn the inherent constraints from different perspectives of matching correspondences. Besides, we also incorporate the normal-depth consistency in the 3D point cloud format to improve the accuracy and continuity of the estimated depth maps. Experimental results show that M 3 VSNet establishes the state-of-the-arts unsupervised method and achieves better performance than previous supervised MVSNet on the DTU dataset and demonstrates the powerful generalization ability on the Tanks & Temples benchmark with effective improvement.
Original languageEnglish
Title of host publicationProceeding 2021 IEEE International Conference on Image Processing (ICIP)
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Pages3163-3167
ISBN (Electronic)978-1-6654-4115-5
DOIs
Publication statusPublished - 2021 Sept 19
Externally publishedYes
Event2021 IEEE International Conference on Image Processing (ICIP) - Anchorage, AK, USA
Duration: 2021 Sept 192021 Sept 22

Conference

Conference2021 IEEE International Conference on Image Processing (ICIP)
Period2021/09/192021/09/22

Subject classification (UKÄ)

  • Computer Vision and Robotics (Autonomous Systems)

Fingerprint

Dive into the research topics of 'M3VSNET: Unsupervised Multi-Metric Multi-View Stereo Network'. Together they form a unique fingerprint.

Cite this