research-article

Virtual Rephotography: Novel View Prediction Error for 3D Reconstruction

Authors:

Michael Waechter,

Simon Fuhrmann,

Michael GoeseleAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 36, Issue 1

Article No.: 8, Pages 1 - 11

https://doi.org/10.1145/2999533

Published: 06 January 2017 Publication History

Abstract

The ultimate goal of many image-based modeling systems is to render photo-realistic novel views of a scene without visible artifacts. Existing evaluation metrics and benchmarks focus mainly on the geometric accuracy of the reconstructed model, which is, however, a poor predictor of visual accuracy. Furthermore, using only geometric accuracy by itself does not allow evaluating systems that either lack a geometric scene representation or utilize coarse proxy geometry. Examples include a light field and most image-based rendering systems. We propose a unified evaluation approach based on novel view prediction error that is able to analyze the visual quality of any method that can render novel views from input images. One key advantage of this approach is that it does not require ground truth geometry. This dramatically simplifies the creation of test datasets and benchmarks. It also allows us to evaluate the quality of an unknown scene during the acquisition and reconstruction process, which is useful for acquisition planning. We evaluate our approach on a range of methods, including standard geometry-plus-texture pipelines as well as image-based rendering techniques, compare it to existing geometry-based benchmarks, demonstrate its utility for a range of use cases, and present a new virtual rephotography-based benchmark for image-based modeling and rendering systems.

Supplementary Material

waechter (waechter.zip)

Supplemental movie and image files for, Virtual Rephotography: Novel View Prediction Error for 3D Reconstruction

Download
250.45 MB

MP4 File (tog-24.mp4)

Download
336.98 MB

References

[1]

Henrik Aanæs, Rasmus Ramsbøl Jensen, George Vogiatzis, Engin Tola, and Anders Bjorholm Dahl. 2016. Large-scale data for multiple-view stereopsis. IJCV 120, 2, 153--168.

Digital Library

[2]

Tunç Ozan Aydın, Rafał Mantiuk, Karol Myszkowski, and Hans-Peter Seidel. 2008. Dynamic range independent image quality assessment. In SIGGRAPH.

Digital Library

[3]

Soonmin Bae, Aseem Agarwala, and Frédo Durand. 2010. Computational rephotography. ACM Transactions on Graphics 29, 3, 24:1--24:15.

Digital Library

[4]

Simon Baker, Daniel Scharstein, J. P. Lewis, Stefan Roth, Michael J. Black, and Richard Szeliski. 2011. A database and evaluation methodology for optical flow. IJCV 92, 1, 1--31.

Digital Library

[5]

Kai Berger, Christian Lipski, Christian Linz, Anita Sellent, and Marcus Magnor. 2010. A ghosting artifact detector for interpolated image quality assessment. In International Symposium on Consumer Electronics.

[6]

Chris Buehler, Michael Bosse, Leonard McMillan, Steven Gortler, and Michael F. Cohen. 2001. Unstructured Lumigraph rendering. In SIGGRAPH.

Digital Library

[7]

Fatih Calakli, Ali O. Ulusoy, Maria I. Restrepo, Gabriel Taubin, and Joseph L. Mundy. 2012. High resolution surface reconstruction from multi-view aerial imagery. In 3DIMPVT.

[8]

Neill D. Campbell, George Vogiatzis, Carlos Hernández, and Roberto Cipolla. 2008. Using multiple hypotheses to improve depth-maps for multi-view stereo. In ECCV.

Digital Library

[9]

Scott Daly. 1993. The visible differences predictor: An algorithm for the assessment of image fidelity. In Digital Images and Human Vision.

[10]

Andrew Fitzgibbon, Yonatan Wexler, and Andrew Zisserman. 2005. Image-based rendering using image-based priors. IJCV 63, 2, 141--151.

Digital Library

[11]

Wolfgang Förstner. 1996. 10 pros and cons against performance characterization of vision algorithms. In Workshop on Performance Characteristics of Vision Algorithms.

[12]

Simon Fuhrmann and Michael Goesele. 2011. Fusion of depth maps with multiple scales. In SIGGRAPH Asia.

Digital Library

[13]

Simon Fuhrmann, Fabian Langguth, Nils Moehrle, Michael Waechter, and Michael Goesele. 2015. MVE -- An image-based reconstruction environment. Computers 8 Graphics 53, Part A (2015).

[14]

Yasutaka Furukawa, Brian Curless, Steven M. Seitz, and Richard Szeliski. 2010. Towards internet-scale multi-view stereo. In CVPR.

[15]

Yasutaka Furukawa and Jean Ponce. 2010. Accurate, dense, and robust multi-view stereopsis. PAMI 32, 8, 1362--1376.

Digital Library

[16]

Michael Goesele, Noah Snavely, Brian Curless, Hugues Hoppe, and Steven M. Seitz. 2007. Multi-view stereo for community photo collections. In ICCV.

[17]

Steven J. Gortler, Radek Grzeszczuk, Richard Szeliski, and Michael F. Cohen. 1996. The Lumigraph. In SIGGRAPH.

Digital Library

[18]

Stefan Guthe, Douglas Cunningham, Pascal Schardt, and Michael Goesele. 2016. Ghosting and popping detection for image-based rendering. In 3DTV Conference.

[19]

Tom Haber, Christian Fuchs, Philippe Bekaert, Hans-Peter Seidel, Michael Goesele, and Hendrik P. A. Lensch. 2009. Relighting objects from image collections. In CVPR.

[20]

Christian Hofsetz, Kim Ng, George Chen, Peter McGuinness, Nelson Max, and Yang Liu. 2004. Image-based rendering of range data with estimated depth uncertainty. Computer Graphics and Applications 24, 4, 34--42.

Digital Library

[21]

Yuan Hongxing, Guo Li, Yu Li, and Cheng Long. 2010. Multi-view reconstruction using band graph-cuts. Journal of Computer-Aided Design 8 Computer Graphics 4 (2010).

[22]

Christof Hoppe, Manfred Klopschitz, Markus Rumpler, Andreas Wendel, Stefan Kluckner, Horst Bischof, and Gerhard Reitmayr. 2012a. Online feedback for structure-from-motion image acquisition. In BMVC.

[23]

Christof Hoppe, Andreas Wendel, Stefanie Zollmann, Katrin Pirker, Arnold Irschara, Horst Bischof, and Stefan Kluckner. 2012b. Photogrammetric camera network design for micro aerial vehicles. In Computer Vision Winter Workshop.

[24]

Michael Kazhdan, Matthew Bolitho, and Hugues Hoppe. 2006. Poisson surface reconstruction. In SGP.

[25]

Johannes Kopf, Michael F. Cohen, and Richard Szeliski. 2014. First-person hyper-lapse videos. In SIGGRAPH.

Digital Library

[26]

Yvan G. Leclerc, Quang-Tuan Luong, and Pascal Fua. 2000. Measuring the self-consistency of stereo algorithms. In ECCV.

[27]

Marc Levoy and Pat Hanrahan. 1996. Light field rendering. In SIGGRAPH.

Digital Library

[28]

Rafał Mantiuk. 2013. Quantifying image quality in graphics: Perspective on subjective and objective metrics and their performance. In SPIE, Vol. 8651.

[29]

Rafał Mantiuk, Kil Joong Kim, Allan G. Rempel, and Wolfgang Heidrich. 2011. HDR-VDP-2: A calibrated visual metric for visibility and quality predictions in all luminance conditions. In SIGGRAPH.

[30]

Paul Merrell, Amir Akbarzadeh, Liang Wang, Philippos Mordohai, Jan-Michael Frahm, Ruigang Yang, David Nistér, and Marc Pollefeys. 2007. Real-time visibility-based fusion of depth maps. In ICCV.

[31]

Ken Perlin. 2002. Improving noise. In SIGGRAPH.

Digital Library

[32]

Jens Preiss, Felipe Fernandes, and Philipp Urban. 2014. Color-image quality assessment: From prediction to optimization. IEEE Transactions on Image Processing 23, 3, 1366--1378.

Digital Library

[33]

Ganesh Ramanarayanan, James Ferwerda, Bruce Walter, and Kavita Bala. 2007. Visual equivalence: Towards a new standard for image fidelity. In SIGGRAPH.

[34]

Michael Schwarz and Marc Stamminger. 2009. On predicting visual popping in dynamic scenes. In Applied Perception in Graphics and Visualization.

Digital Library

[35]

Steven M. Seitz, Brian Curless, James Diebel, Daniel Scharstein, and Richard Szeliski. 2006. A comparison and evaluation of multi-view stereo reconstruction algorithms. In CVPR.

Digital Library

[36]

Qi Shan, Riley Adams, Brian Curless, Yasutaka Furukawa, and Steven M. Seitz. 2013. The visual Turing test for scene reconstruction. In 3DV.

[37]

Noah Snavely, Steven M. Seitz, and Richard Szeliski. 2006. Photo tourism: Exploring photo collections in 3D. In SIGGRAPH.

[38]

Christoph Strecha, Wolfgang von Hansen, Luc Van Gool, Pascal Fua, and Ulrich Thoennessen. 2008. On benchmarking camera calibration and multi-view stereo for high resolution imagery. In CVPR.

[39]

Richard Szeliski. 1999. Prediction error as a quality metric for motion and stereo. In ICCV.

[40]

James Tompkin, Min H. Kim, Kwang In Kim, Jan Kautz, and Christian Theobalt. 2013. Preference and artifact analysis for video transitions of places. ACM Transactions on Applied Perception 10, 3, 13:1--13:19.

[41]

Kathleen Tuite, Noah Snavely, Dun-yu Hsiao, Nadine Tabing, and Zoran Popović. 2011. PhotoCity: Training experts at large-scale image acquisition through a competitive game. In SIGCHI.

[42]

Peter Vangorp, Gaurav Chaurasia, Pierre-Yves Laffont, Roland Fleming, and George Drettakis. 2011. Perception of visual artifacts in image-based rendering of façades. In Eurographics Symposium on Rendering.

Digital Library

[43]

Peter Vangorp, Christian Richardt, Emily A. Cooper, Gaurav Chaurasia, Martin S. Banks, and George Drettakis. 2013. Perception of perspective distortions in image-based rendering. In SIGGRAPH.

Digital Library

[44]

Kenneth Vanhoey, Basile Sauvage, Pierre Kraemer, Frédéric Larue, and Jean-Michel Dischler. 2015. Simplification of meshes with digitized radiance. The Visual Computer 31, 6, 1011--1021.

Digital Library

[45]

Michael Waechter, Nils Moehrle, and Michael Goesele. 2014. Let there be color! Large-scale texturing of 3D reconstructions. In ECCV.

[46]

Zhou Wang, Alan C. Bovik, Hamid R. Sheikh, and Eero P. Simoncelli. 2004. Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing 13, 4, 600--612.

Digital Library

[47]

Robert H. Webb, Diane E. Boyer, and Raymond M. Turner. 2010. Repeat Photography. Island Press.

[48]

Yizhou Yu, Paul Debevec, Jitendra Malik, and Tim Hawkins. 1999. Inverse global illumination: Recovering reflectance models of real scenes from photographs. In SIGGRAPH.

[49]

Ramin Zabih and John Woodfill. 1994. Non-parametric local transforms for computing visual correspondence. In ECCV.

[50]

Matthias Zwicker, Hanspeter Pfister, Jeroen van Baar, and Markus Gross. 2001. Surface splatting. In SIGGRAPH.

Digital Library

Cited By

Liang HWu THanji PBanterle FGao HMantiuk RÖztireli C(2024)Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front‐Facing ViewsComputer Graphics Forum10.1111/cgf.1503643:2Online publication date: 27-Apr-2024
https://doi.org/10.1111/cgf.15036
Fang LFang L(2024)Toward Large-Scale Plenoptic ReconstructionPlenoptic Imaging and Processing10.1007/978-981-97-6915-5_5(191-325)Online publication date: 16-Oct-2024
https://doi.org/10.1007/978-981-97-6915-5_5
Brachmann EWynn JChen SCavallari TMonszpart ÁTurmukhambetov DPrisacariu V(2024)Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a RelocalizerComputer Vision – ECCV 202410.1007/978-3-031-72992-8_24(421-440)Online publication date: 30-Oct-2024
https://doi.org/10.1007/978-3-031-72992-8_24
Show More Cited By

Index Terms

Virtual Rephotography: Novel View Prediction Error for 3D Reconstruction
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction
  2. Computer graphics
    1. Image manipulation
      1. Image-based rendering

Recommendations

Virtual Rephotography: Novel View Prediction Error for 3D Reconstruction

The ultimate goal of many image-based modeling systems is to render photo-realistic novel views of a scene without visible artifacts. Existing evaluation metrics and benchmarks focus mainly on the geometric accuracy of the reconstructed model, which is, ...
Image-based rendering for scenes with reflections

We present a system for image-based modeling and rendering of real-world scenes containing reflective and glossy surfaces. Previous approaches to image-based rendering assume that the scene can be approximated by 3D proxies that enable view ...
An introduction to image-based rendering
Integrated image and graphics technologies

In this chapter, we review the techniques for image-based rendering. Unlike traditional 3D computer graphics in which 3D geometry of the scene is known, image-based rendering (IBR) techniques render novel views directly from input images. IBR techniques ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 36, Issue 1

February 2017

165 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/2996392

Editor:
Kavita Bala
Cornell University

Issue’s Table of Contents

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 January 2017

Accepted: 01 September 2016

Revised: 01 July 2016

Received: 01 November 2015

Published in TOG Volume 36, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

Intel Visual Computing Institute (project RealityScan)
Microsoft Research

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

35
Total Citations
View Citations
1,126
Total Downloads

Downloads (Last 12 months)41
Downloads (Last 6 weeks)6

Reflects downloads up to 01 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Liang HWu THanji PBanterle FGao HMantiuk RÖztireli C(2024)Perceptual Quality Assessment of NeRF and Neural View Synthesis Methods for Front‐Facing ViewsComputer Graphics Forum10.1111/cgf.1503643:2Online publication date: 27-Apr-2024
https://doi.org/10.1111/cgf.15036
Fang LFang L(2024)Toward Large-Scale Plenoptic ReconstructionPlenoptic Imaging and Processing10.1007/978-981-97-6915-5_5(191-325)Online publication date: 16-Oct-2024
https://doi.org/10.1007/978-981-97-6915-5_5
Brachmann EWynn JChen SCavallari TMonszpart ÁTurmukhambetov DPrisacariu V(2024)Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a RelocalizerComputer Vision – ECCV 202410.1007/978-3-031-72992-8_24(421-440)Online publication date: 30-Oct-2024
https://doi.org/10.1007/978-3-031-72992-8_24
Xiong WZhang HPeng BHu ZWu YGuo JHuang H(2023)TwinTex: Geometry-Aware Texture Generation for Abstracted 3D Architectural ModelsACM Transactions on Graphics10.1145/361832842:6(1-14)Online publication date: 5-Dec-2023
https://dl.acm.org/doi/10.1145/3618328
Chen RZhang FFinnie SChalmers ARhee T(2023)Casual 6-DoF: Free-Viewpoint Panorama Using a Handheld 360° CameraIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2022.317683229:9(3976-3988)Online publication date: 1-Sep-2023
https://dl.acm.org/doi/10.1109/TVCG.2022.3176832
Schaffland AHeidemann G(2022)Heritage and Repeat Photography: Techniques, Management, Applications, and PublicationsHeritage10.3390/heritage50402205:4(4267-4305)Online publication date: 18-Dec-2022
https://doi.org/10.3390/heritage5040220
Kim JKim HNam HPark JLee S(2022)TextureMe: High-Quality Textured Scene Reconstruction in Real TimeACM Transactions on Graphics10.1145/350392641:3(1-18)Online publication date: 7-Mar-2022
https://dl.acm.org/doi/10.1145/3503926
Zhang JZhang JMao SJi MWang GChen ZZhang TYuan XDai QFang L(2022)GigaMVS: A Benchmark for Ultra-Large-Scale Gigapixel-Level 3D ReconstructionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2021.311502844:11(7534-7550)Online publication date: 1-Nov-2022
https://doi.org/10.1109/TPAMI.2021.3115028
de Dinechin GPaljic ATanant J(2021)Impact of View-Dependent Image-Based Effects on Perception of Visual Realism and Presence in Virtual Reality Environments Created Using Multi-Camera SystemsApplied Sciences10.3390/app1113617311:13(6173)Online publication date: 2-Jul-2021
https://doi.org/10.3390/app11136173
Li RLi XHui KFu C(2021)SP-GANACM Transactions on Graphics10.1145/3450626.345976640:4(1-12)Online publication date: 19-Jul-2021
https://dl.acm.org/doi/10.1145/3450626.3459766
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents