skip to main content
10.1145/3568364.3568366acmotherconferencesArticle/Chapter ViewAbstractPublication PageswsseConference Proceedingsconference-collections
research-article

A Multi-view Matching Method Based on PatchmatchNet with Sparse Point Information

Published: 23 December 2022 Publication History

Abstract

The learning-based multi-view stereo (MVS) method has become a research hotspot in 3D reconstruction. Deep learning can extract more robust semantic features of images and better adapt to scenes of soft texture and non-diffuse reflection. However, current deep learning methods focus more on improving the quality of reconstruction, and we believe that reducing depth estimation time and GPU memory consumption is equally important. Therefore, this paper proposes S-PatchmatchNet with higher accuracy and faster efficiency. Firstly, in the initial stage of depth estimation, we use Colmap to obtain sparse points and generate initial depth information through triangulation and interpolation to replace the random initialization of PatchmatchNet, which reduces the time consumed in random depth and improves computational efficiency. Secondly, we design an effective data enhancement mechanism. Specifically, a mask 1/3 of the size of the image is used to randomly erase data on the image. By minimizing the error between the prediction result of enhanced data and the ground reality, the sample prediction is standardized, the robustness of the model is enhanced, and the accuracy of depth prediction is improved. To test the validity of our method, we conducted tests on the DTU dataset. Compared to PatchmatchNet, the efficiency and accuracy of our approach are improved to varying degrees. Meanwhile, we get competitive results on challenging Tanks and Temples datasets.

References

[1]
Aanæs, Henrik, Rasmus Ramsbøl Jensen, George Vogiatzis, Engin Tola & Anders Bjorholm Dahl (2016) Large-scale data for multiple-view stereopsis. International Journal of Computer Vision, 120, 153-168.
[2]
Campbell, Neill DF, George Vogiatzis, Carlos Hernández & Roberto Cipolla. 2008. Using multiple hypotheses to improve depth-maps for multi-view stereo. In European Conference on Computer Vision, 766-779. Springer.
[3]
Cheng, Shuo, Zexiang Xu, Shilin Zhu, Zhuwen Li, Li Erran Li, Ravi Ramamoorthi & Hao Su. 2020. Deep stereo using adaptive thin volume representation with uncertainty awareness. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2524-2534.
[4]
Furukawa, Yasutaka & Jean Ponce (2009) Accurate, dense, and robust multiview stereopsis. IEEE transactions on pattern analysis and machine intelligence, 32, 1362-1376.
[5]
Galliani, Silvano, Katrin Lasinger & Konrad Schindler. 2015. Massively parallel multiview stereopsis by surface normal diffusion. In Proceedings of the IEEE International Conference on Computer Vision, 873-881.
[6]
Gao, Zongyan & Lixin Han. 2020. Clothing image classification based on random erasing and residual network. In Journal of Physics: Conference Series, 012136. IOP Publishing.
[7]
Gu, Xiaodong, Zhiwen Fan, Siyu Zhu, Zuozhuo Dai, Feitong Tan & Ping Tan. 2020. Cascade cost volume for high-resolution multi-view stereo and stereo matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2495-2504.
[8]
Ji, Mengqi, Juergen Gall, Haitian Zheng, Yebin Liu & Lu Fang. 2017. Surfacenet: An end-to-end 3d neural network for multiview stereopsis. In Proceedings of the IEEE International Conference on Computer Vision, 2307-2315.
[9]
Knapitsch, Arno, Jaesik Park, Qian-Yi Zhou & Vladlen Koltun (2017) Tanks and temples: Benchmarking large-scale scene reconstruction. ACM Transactions on Graphics (ToG), 36, 1-13.
[10]
Luo, Keyang, Tao Guan, Lili Ju, Haipeng Huang & Yawei Luo. 2019. P-mvsnet: Learning patch-wise matching confidence aggregation for multi-view stereo. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 10452-10461.
[11]
Wang, Fangjinhua, Silvano Galliani, Christoph Vogel, Pablo Speciale & Marc Pollefeys. 2021. Patchmatchnet: Learned multi-view patchmatch stereo. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 14194-14203.
[12]
Yao, Yao, Zixin Luo, Shiwei Li, Tian Fang & Long Quan. 2018. Mvsnet: Depth inference for unstructured multi-view stereo. In Proceedings of the European Conference on Computer Vision (ECCV), 767-783.
[13]
Yao, Yao, Zixin Luo, Shiwei Li, Tianwei Shen, Tian Fang & Long Quan. 2019. Recurrent mvsnet for high-resolution multi-view stereo depth inference. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5525-5534.
[14]
Yu, Zehao & Shenghua Gao. 2020. Fast-mvsnet: Sparse-to-dense multi-view stereo with learned propagation and gauss-newton refinement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1949-1958.
[15]
Zhong, Zhun, Liang Zheng, Guoliang Kang, Shaozi Li & Yi Yang. 2020. Random erasing data augmentation. In Proceedings of the AAAI conference on artificial intelligence, 13001-13008.

Cited By

View all
  • (2023)A multi-view 3D reconstruction method that integrates patchmatch and efficient channel attention cascades2023 5th International Conference on Frontiers Technology of Information and Computer (ICFTIC)10.1109/ICFTIC59930.2023.10456051(811-814)Online publication date: 17-Nov-2023

Index Terms

  1. A Multi-view Matching Method Based on PatchmatchNet with Sparse Point Information

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WSSE '22: Proceedings of the 4th World Symposium on Software Engineering
    September 2022
    187 pages
    ISBN:9781450396950
    DOI:10.1145/3568364
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 December 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. 3D reconstruction
    2. Data augmentation
    3. Depth estimation
    4. Initial depth
    5. Random erasing

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • National Natural Science Foundation of China
    • Fuxiaquan National Independent Innovation Demonstration Zone collaborative innovation platform project
    • Fujian science and technology plan projects
    • Fujian Province 2019 social sciences planning projects funding

    Conference

    WSSE 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)29
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 21 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)A multi-view 3D reconstruction method that integrates patchmatch and efficient channel attention cascades2023 5th International Conference on Frontiers Technology of Information and Computer (ICFTIC)10.1109/ICFTIC59930.2023.10456051(811-814)Online publication date: 17-Nov-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media