skip to main content
10.1145/3343031.3351085acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

iDFusion: Globally Consistent Dense 3D Reconstruction from RGB-D and Inertial Measurements

Published: 15 October 2019 Publication History

Abstract

We present a practical fast, globally consistent and robust dense 3D reconstruction system, iDFusion, by exploring the joint benefit of both the visual (RGB-D) solution and inertial measurement unit (IMU). A global optimization considering all the previous states is adopted to maintain high localization accuracy and global consistency, yet its complexity of being linear to the number of all previous camera/IMU observations seriously impedes real-time implementation. We show that the global optimization can be solved efficiently at the complexity linear to the number of keyframes, and further realize a real-time dense 3D reconstruction system given the estimated camera states. Meanwhile, for the sake of robustness, we propose a novel loop-validity detector based on the estimated bias of the IMU state. By checking the consistency of camera movements, a false loop closure constraint introduces manifest inconsistency between the camera movements and IMU measurements. Experiments reveal that iDFusion owns superior reconstruction performance running in 25 fps on CPU computing of portable devices, under challenging yet practical scenarios including texture-less, motion blur, and repetitive contents.

References

[1]
Andrew J. Davison, “Futuremapping: The computational structure of spatial AI systems,” CoRR, vol. abs/1803.11288, 2018.
[2]
Angela Dai, Matthias Nießner, Michael Zollöfer, Shahram Izadi, and Christian Theobalt, “Bundlefusion: Real-time globally consistent 3d reconstruction using on-the-fly surface re-integration,” ACM Transactions on Graphics 2017 (TOG), 2017.
[3]
Sungjoon Choi, Qian-Yi Zhou, and Vladlen Koltun, “Robust reconstruction of indoor scenes,” in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on. IEEE, 2015, pp. 5556--5565.
[4]
Thomas Whelan, Renato F Salas-Moreno, Ben Glocker, Andrew J Davison, and Stefan Leutenegger, “Elasticfusion: Real-time dense slam and light source estimation,” The International Journal of Robotics Research, vol. 35, no. 14, pp. 1697--1716, 2016.
[5]
Lei Han and Lu Fang, “Flashfusion: Real-time globally consistent dense 3d reconstruction using cpu computing,” RSS, 2018.
[6]
Lei Han, Lan Xu, Dmytro Bobkov, Eckehard Steinbach, and Lu Fang, “Real-time global registration for globally consistent rgb-d slam,” IEEE Transactions on Robotics, 2019.
[7]
Christian Forster, Luca Carlone, Frank Dellaert, and Davide Scaramuzza, “Imu preintegration on manifold for efficient visual-inertial maximum-a-posteriori estimation,” Georgia Institute of Technology, 2015.
[8]
Lei Han and Lu Fang, “Mild: Multi-index hashing for appearance based loop closure detection,” in Multimedia and Expo (ICME), 2017 IEEE International Conference on. IEEE, 2017, pp. 139--144.
[9]
Qian-Yi Zhou, Jaesik Park, and Vladlen Koltun, “Fast global registration,” in European Conference on Computer Vision. Springer, 2016, pp. 766--782.
[10]
Brian Curless and Marc Levoy, “A volumetric method for building complex models from range images,” 1996.
[11]
M. Nießner, M. Zollhöfer, S. Izadi, and M. Stamminger, “Real-time 3d reconstruction at scale using voxel hashing,” ACM Transactions on Graphics (TOG), 2013.
[12]
Raul Mur-Artal and Juan D Tardós, “Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras,” IEEE Transactions on Robotics, vol. 33, no. 5, pp. 1255--1262, 2017.
[13]
Jonathan Ventura, Clemens Arth, Gerhard Reitmayr, and Dieter Schmalstieg, “Global localization from monocular slam on a mobile phone,” IEEE transactions on visualization and computer graphics, vol. 20, no. 4, pp. 531--539, 2014.
[14]
Sebastian Thrun, Wolfram Burgard, and Dieter Fox, Probabilistic robotics, MIT press, 2005.
[15]
Anastasios I Mourikis and Stergios I Roumeliotis, “A multi-state constraint kalman filter for vision-aided inertial navigation,” in Proceedings 2007 IEEE International Conference on Robotics and Automation. IEEE, 2007, pp. 3565--3572.
[16]
Mingyang Li, Byung Hyung Kim, and Anastasios I Mourikis, “Real-time motion tracking on a cellphone using inertial sensing and a rolling-shutter camera,” in 2013 IEEE International Conference on Robotics and Automation. IEEE, 2013, pp. 4712--4719.
[17]
Joel A. Hesch, Dimitrios G. Kottas, Sean L. Bowman, and Stergios I. Roumeliotis, “Observability-constrained vision-aided inertial navigation,” 2012.
[18]
Martin Brossard, Silvere Bonnabel, and Axel Barrau, “Invariant kalman filtering for visual inertial slam,” 2018 21st International Conference on Information Fusion (FUSION), pp. 2021--2028, 2018.
[19]
Tong Qin, Peiliang Li, and Shaojie Shen, “Vins-mono: A robust and versatile monocular visual-inertial state estimator,” IEEE Transactions on Robotics, vol. 34, no. 4, pp. 1004--1020, 2018.
[20]
Stefan Leutenegger, Simon Lynen, Michael Bosse, Roland Siegwart, and Paul Furgale, “Keyframe-based visual--inertial odometry using nonlinear optimization,” The International Journal of Robotics Research, vol. 34, no. 3, pp. 314--334, 2015.
[21]
R. Mur-Artal and J. D. Tardós, “Visual-inertial monocular slam with map reuse,” IEEE Robotics and Automation Letters, vol. 2, no. 2, pp. 796--803, April 2017.
[22]
Ming Hsiao, Eric Westman, and Michael Kaess, “Dense planar-inertial slam with structural constraints,” 2018 IEEE International Conference on Robotics and Automation (ICRA), pp. 6521--6528, 2018.
[23]
T. Laidlow, M. Bloesch, W. Li, and S. Leutenegger, “Dense rgb-d-inertial slam with map deformations,” in 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Sep. 2017, pp. 6741--6748.
[24]
Zunjie Zhu and Feng Xu, “Real-time indoor scene reconstruction with rgbd and inertia input,” 12 2018.
[25]
O. Kähler, V. Adrian Prisacariu, C. Yuheng Ren, X. Sun, P. Torr, and D. Murray, “Very high frame rate volumetric integration of depth images on mobile devices,” IEEE Transactions on Visualization and Computer Graphics, vol. 21, no. 11, pp. 1241--1250, Nov 2015.
[26]
Matthew Klingensmith, Ivan Dryanovski, Siddhartha Srinivasa, and Jizhong Xiao, “Chisel: Real time large scale 3d reconstruction onboard a mobile device using spatially hashed signed distance fields,” 07 2015.
[27]
C. Feng, Y. Taguchi, and V. R. Kamat, “Fast plane extraction in organized point clouds using agglomerative hierarchical clustering,” in 2014 IEEE International Conference on Robotics and Automation (ICRA), May 2014, pp. 6218--6225.
[28]
Jose Luis Blanco, “A tutorial on se(3) transformation parameterizations and on-manifold optimization,” 09 2010.
[29]
Lingni Ma, Christian Kerl, Jörg Stückler, and Daniel Cremers, “Cpa-slam: Consistent plane-model alignment for direct rgb-d slam,” in Robotics and Automation (ICRA), 2016 IEEE International Conference on. IEEE, 2016, pp. 1285--1291.
[30]
F. Endres, J. Hess, N. Engelhard, J. Sturm, D. Cremers, and W. Burgard, “An evaluation of the rgb-d slam system,” in 2012 IEEE International Conference on Robotics and Automation, May 2012, pp. 1691--1696.
[31]
Chen Wang, Minh-Chung Hoang, L. Xie, and Junsong Yuan, “Non-iterative rgb-d-inertial odometry,” 2017.
[32]
Chen Wang, Junsong Yuan, and Lihua Xie, “Non-iterative slam,” 2017 18th International Conference on Advanced Robotics (ICAR), pp. 83--90, 2017.
[33]
P. Furgale, J. Rehder, and R. Siegwart, “Unified temporal and spatial calibration for multi-sensor systems,” in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, Nov 2013, pp. 1280--1286.
[34]
T. Qin, P. Li, and S. Shen, “Vins-mono: A robust and versatile monocular visual-inertial state estimator,” IEEE Transactions on Robotics, vol. 34, no. 4, pp. 1004--1020, Aug 2018.
[35]
J. Sturm, N. Engelhard, F. Endres, W. Burgard, and D. Cremers, “A benchmark for the evaluation of rgb-d slam systems,” in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, Oct 2012, pp. 573--580.
[36]
A. Handa, T. Whelan, J. McDonald, and A. J. Davison, “A benchmark for rgb-d visual odometry, 3d reconstruction and slam,” in 2014 IEEE International Conference on Robotics and Automation (ICRA), May 2014, pp. 1524--1531.
[37]
Jürgen Sturm, Nikolas Engelhard, Felix Endres, Wolfram Burgard, and Daniel Cremers, “A benchmark for the evaluation of rgb-d slam systems,” in Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on. IEEE, 2012, pp. 573--580.

Cited By

View all

Index Terms

  1. iDFusion: Globally Consistent Dense 3D Reconstruction from RGB-D and Inertial Measurements

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '19: Proceedings of the 27th ACM International Conference on Multimedia
    October 2019
    2794 pages
    ISBN:9781450368896
    DOI:10.1145/3343031
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 October 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. 3d reconstruction
    2. loop closure
    3. real-time slam
    4. visual-imu global optimization

    Qualifiers

    • Research-article

    Conference

    MM '19
    Sponsor:

    Acceptance Rates

    MM '19 Paper Acceptance Rate 252 of 936 submissions, 27%;
    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)20
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 26 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media