skip to main content
10.1145/2632856.2632947acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicimcsConference Proceedingsconference-collections
research-article

Multiple Feature Fusion Based Hand-held Object Recognition with RGB-D data

Published: 10 July 2014 Publication History

Abstract

With the advance of computer technology and smart device, many technologies and applications have been developed to enhance the efficiency of human-computer interaction (HCI). For human, the hand is a natural and direct way in communication. Hand-held Object Recognition (HHOR), which is to predict the label for the object people hold in hand, can help machines in understanding the environment and people's intentions. However, it has not been well studied in the community. So, in this paper, we proposed a novel feature fusion based method for hand-held object recognition with RGB-D data. First, the skeleton information is used to initially locate the object and with depth map we extract object region in a region-growing manner. Then on the corresponding object point cloud, we use Multiple Kernel Learning (MKL) to fuse the shape feature with color feature to obtain the advantages of them. Specially, we collected a dataset, which contains 12800 video frames of 16 categories and each frame captures the visual image, depth map and user skeleton data. The experiment shows promising results in both segmentation and recognition.

References

[1]
Kanezaki, A., Suzuki, T., Harada, T., & Kuniyoshi, Y. (2011, May). Fast object detection for robots in a cluttered indoor environment using integral 3D feature table. ICRA, 2011 IEEE International Conference on (pp. 4026--4033). IEEE.
[2]
Marton, Z. C., Pangercic, D., Rusu, R. B., Holzbach, A., & Beetz, M. (2010, December). Hierarchical object geometric categorization and appearance classification for mobile manipulation. In Humanoid Robots (Humanoids), 2010 10th IEEE-RAS International Conference on (pp. 365--370). IEEE.
[3]
Knopp, J., Prasad, M., Willems, G., Timofte, R., & Van Gool, L. (2010). Hough transform and 3D SURF for robust three dimensional classification. In Computer Vision--ECCV 2010 (pp. 589--602). Springer Berlin Heidelberg.
[4]
Guo, Y., Sohel, F. A., Bennamoun, M., Wan, J., & Lu, M. (2013, February). RoPS: A local feature descriptor for 3D rigid objects based on rotational projection statistics. In ICCSPA, 2013 1st International Conference on (pp. 1--6). IEEE.
[5]
Kanezaki, A., Marton, Z. C., Pangercic, D., Harada, T., Kuniyoshi, Y., & Beetz, M. (2011, September). Voxelized shape and color histograms for RGB-D. IROS, Workshop on Active Semantic Perception and Object Search in the Real World, San Francisco, CA, USA.
[6]
do Nascimento, E. R., Oliveira, G. L., Vieira, A. W., & Campos, M. F. (2013). On the development of a robust, fast and lightweight keypoint descriptor. Neurocomputing, 120, 141--155.
[7]
Wohlkinger, W., & Vincze, M. (2011, December). Ensemble of shape functions for 3D object classification. In Robotics and Biomimetics (ROBIO), (pp. 2987--2992). IEEE
[8]
Ip, C. Y., Lapadat, D., Sieger, L., & Regli, W. C. (2002, June). Using shape distributions to compare solid models. In Proceedings of the seventh ACM symposium on Solid modeling and applications (pp. 273--280). ACM.
[9]
Sonnenburg, S., Rätsch, G., Schäfer, C., & Schöölkopf, B. (2006). Large scale multiple kernel learning. The Journal of Machine Learning Research, 7, 1531--1565

Cited By

View all

Index Terms

  1. Multiple Feature Fusion Based Hand-held Object Recognition with RGB-D data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICIMCS '14: Proceedings of International Conference on Internet Multimedia Computing and Service
    July 2014
    430 pages
    ISBN:9781450328104
    DOI:10.1145/2632856
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    In-Cooperation

    • NSF of China: National Natural Science Foundation of China
    • Beijing ACM SIGMM Chapter

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 July 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Hand-held object recognition
    2. Multiple Kernel Learning
    3. RGB-D
    4. multiple feature fusion
    5. region growing

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICIMCS '14

    Acceptance Rates

    Overall Acceptance Rate 163 of 456 submissions, 36%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 21 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media