skip to main content
10.1145/3293353.3293434acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvgipConference Proceedingsconference-collections
research-article

Integrating Objects into Monocular SLAM: Line Based Category Specific Models

Published: 03 May 2020 Publication History

Abstract

We propose a novel Line based parameterization for category specific CAD models. The proposed parameterization associates 3D category-specific CAD model and object under consideration using a dictionary based RANSAC method that uses object Viewpoints as prior and edges detected in the respective intensity image of the scene. The association problem is posed as a classical Geometry problem rather than being dataset driven, thus saving the time and labour that one invests in annotating dataset to train Keypoint Network[25, 26] for different category objects. Besides eliminating the need of dataset preparation, the approach also speeds up the entire process as this method processes the image only once for all objects, thus eliminating the need of invoking the network for every object in an image across all images. A 3D-2D edge association module followed by a resection algorithm for lines is used to recover object poses. The formulation optimizes for shape and pose of the object, thus aiding in recovering object 3D structure more accurately. Finally, a Factor Graph formulation is used to combine object poses with camera odometry to formulate a SLAM problem.

References

[1]
Sameer Agarwal and Keir Mierle. 2015. Olhers,âĂIJCeres solver,âĂİ. (2015).
[2]
Koray Çelik and Arun K Somani. 2013. Monocular vision SLAM for indoor aerial vehicles. Journal of electrical and computer engineering 2013 (2013), 4--1573.
[3]
Angel X Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, and others. 2015. Shapenet: An information-rich 3d model repository. arXiv preprint arXiv:1512.03012 (2015).
[4]
Younggun Cho and Ayoung Kim. 2017. Visibility enhancement for underwater visual SLAM based on underwater light scattering model. In Robotics and Automation (ICRA), 2017 IEEE International Conference on. IEEE, 710--717.
[5]
Siddharth Choudhary, Luca Carlone, Carlos Nieto, John Rogers, Zhen Liu, Henrik I Christensen, and Frank Dellaert. 2016. Multi Robot Object-Based SLAM. In International Symposium on Experimental Robotics. Springer, 729--741.
[6]
Marco Crocco, Cosimo Rubino, and Alessio Del Bue. 2016. Structure from motion with objects. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4141--4149.
[7]
F Dellaert and others. 2012. GTSAM. URL: https://borg.cc.gatech.edu (2012).
[8]
Frank Dellaert and others. 2012. Gtsam. URL: https://borg.cc.gatech.edu (2012).
[9]
Jakob Engel, Thomas Schöps, and Daniel Cremers. 2014. LSD-SLAM: Large-scale direct monocular SLAM. In European Conference on Computer Vision. Springer, 834--849.
[10]
Mark Everingham, Luc Van Gool, Christopher KI Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. International journal of computer vision 88, 2 (2010), 303--338.
[11]
Pedro F Felzenszwalb, Ross B Girshick, David McAllester, and Deva Ramanan. 2010. Object detection with discriminatively trained part-based models. IEEE transactions on pattern analysis and machine intelligence 32, 9 (2010), 1627--1645.
[12]
Christian Forster, Zichao Zhang, Michael Gassner, Manuel Werlberger, and Davide Scaramuzza. 2017. Svo: Semidirect visual odometry for monocular and multicamera systems. IEEE Transactions on Robotics 33, 2 (2017), 249--265.
[13]
Dorian Gálvez-López, Marta Salas, Juan D Tardós, and JMM Montiel. 2016. Realtime monocular object slam. Robotics and Autonomous Systems 75 (2016), 435--449.
[14]
Ian Jolliffe. 2011. Principal component analysis. In International encyclopedia of statistical science. Springer, 1094--1096.
[15]
Michael Kaess, Hordur Johannsson, Richard Roberts, Viorela Ila, John J Leonard, and Frank Dellaert. 2012. iSAM2: Incremental smoothing and mapping using the Bayes tree. The International Journal of Robotics Research 31, 2 (2012), 216--235.
[16]
Rainer Kümmerle, Giorgio Grisetti, Hauke Strasdat, Kurt Konolige, and Wolfram Burgard. 2011. g 2 o: A general framework for graph optimization. In Robotics and Automation (ICRA), 2011 IEEE International Conference on. IEEE, 3607--3613.
[17]
Abhijit Kundu, K Madhava Krishna, and CV Jawahar. 2011. Realtime multibody visual SLAM with a smoothly moving monocular camera. In Computer Vision (ICCV), 2011 IEEE International Conference on. IEEE, 2080--2087.
[18]
Henning Lategahn, Andreas Geiger, and Bernd Kitt. 2011. Visual SLAM for autonomous ground vehicles. In Robotics and Automation (ICRA), 2011 IEEE International Conference on. IEEE, 1732--1737.
[19]
Cindy Leung, Shoudong Huang, and Gamini Dissanayake. 2006. Active SLAM using model predictive control and attractor based exploration. In Intelligent Robots and Systems, 2006 IEEE/RSJ International Conference on. IEEE, 5026--5031.
[20]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21--37.
[21]
Beipeng Mu, Shih-Yuan Liu, Liam Paull, John Leonard, and Jonathan P How. 2016. SLAM with objects using a nonparametric pose graph. In Intelligent Robots and Systems (IROS), 2016 IEEE/RSJ International Conference on. IEEE, 4602--4609.
[22]
Montiel J. M. M. Mur-Artal, Raúl and Juan D. Tardós. 2015. ORB-SLAM: a Versatile and Accurate Monocular SLAM System. IEEE Transactions on Robotics 31, 5 (2015), 1147--1163.
[23]
Raul Mur-Artal, Jose Maria Martinez Montiel, and Juan D Tardos. 2015. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Transactions on Robotics 31, 5 (2015), 1147--1163.
[24]
J Krishna Murthy, GV Sai Krishna, Falak Chhaya, and K Madhava Krishna. 2017. Reconstructing vehicles from a single image: Shape priors for road scene understanding. In Robotics and Automation (ICRA), 2017 IEEE International Conference on. IEEE, 724--731.
[25]
Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked hourglass networks for human pose estimation. In European Conference on Computer Vision. Springer, 483--499.
[26]
Parv Parkhiya, Rishabh Khawad, J Krishna Murthy, Brojeshwar Bhowmick, and K Madhava Krishna. 2018. Constructing Category-Specific Models for Monocular Object-SLAM. arXiv preprint arXiv:1802.09292 (2018).
[27]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779--788.
[28]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems. 91--99.
[29]
Renato F Salas-Moreno, Richard A Newcombe, Hauke Strasdat, Paul HJ Kelly, and Andrew J Davison. 2013. Slam++: Simultaneous localisation and mapping at the level of objects. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1352--1359.
[30]
Hao Su, Charles R Qi, Yangyan Li, and Leonidas J Guibas. 2015. Render for cnn: Viewpoint estimation in images using cnns trained with rendered 3d model views. In Proceedings of the IEEE International Conference on Computer Vision. 2686--2694.
[31]
Niko Sünderhauf, Feras Dayoub, Sean McMahon, Markus Eich, Ben Upcroft, and Michael Milford. 2015. SLAM-Quo Vadis? In support of object oriented and semantic SLAM. (2015).
[32]
Shubham Tulsiani, Abhishek Kar, Joao Carreira, and Jitendra Malik. 2017. Learning category-specific deformable 3d models for object reconstruction. IEEE transactions on pattern analysis and machine intelligence 39, 4 (2017), 719--731.
[33]
Rafael Grompone Von Gioi, Jérémie Jakubowicz, Jean-Michel Morel, and Gregory Randall. 2012. LSD: a line segment detector. Image Processing On Line 2 (2012), 35--55.
[34]
Shichao Yang and Sebastian Scherer. 2018. CubeSLAM: Monocular 3D Object Detection and SLAM without Prior Models. arXiv preprint arXiv:1806.00557 (2018).
[35]
Rui Zhu, Chaoyang Wang, Chen-Hsuan Lin, Ziyan Wang, and Simon Lucey. 2018. Object-centric photometric bundle adjustment with deep shape prior. In 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 894--902.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICVGIP '18: Proceedings of the 11th Indian Conference on Computer Vision, Graphics and Image Processing
December 2018
659 pages
ISBN:9781450366151
DOI:10.1145/3293353
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 May 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Edge Association
  2. Object SLAM
  3. Shape and pose Optimization

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

ICVGIP 2018

Acceptance Rates

Overall Acceptance Rate 95 of 286 submissions, 33%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)3
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media