short-paper

Deep CNN based pseudo-concept selection and modeling for generation of semantic multinomial representation of scene images

Authors:

Dileep Aroor Dinesh,

Veena ThenkanidiyoorAuthors Info & Claims

CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data

Pages 336 - 339

https://doi.org/10.1145/3152494.3167985

Published: 11 January 2018 Publication History

Abstract

Though recent convolutional neural network (CNN) based method for scene classification task show impressive results but lacks in capturing the complex semantic content of the scene images. To reduce the semantic gap a semantic multinomial (SMN) representation is introduced. SMN representation corresponds to a vector of posterior probabilities of concepts. The core part of SMN generation is building the concept model. For building the concept model, it is necessary to have ground truth (true) concept labels for every image in the database. In this research work, we propose novel deep CNN based SMN representation which exploits convolutional layer filters response as pseudo concepts to build the concept model in the absence of true concept labels. The effectiveness of the proposed approach is studied for scene classification tasks on standard datasets like MIT67 and SUN397.

References

[1]

Navneet Dalal and Bill Triggs. {n. d.}. Histograms of oriented gradients for human detection. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), Vol. 1. 886--893.

Digital Library

[2]

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Proceedings of the IEEE conference on computer vision and pattern recognition. 248--255.

[3]

Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A Library for Large Linear Classification. Journal of Machine Learning Research 9 (2008), 1871--1874.

Digital Library

[4]

Shikha Gupta, A. D Dileep, and Thenkanidiyoor Veena. 2017. The Semantic Multinomial Representation of Images Obtained using Dynamic Kernel based Pseudo-concept SVMs. National Conference on Communication (2017).

[5]

Kai Kang and Xiaogang Wang. 2014. Fully convolutional neural networks for crowd segmentation. arXiv preprint arXiv:1411.4464 (2014).

[6]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105.

Digital Library

[7]

David G Lowe. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 2 (2004), 91--110.

Digital Library

[8]

Aude Oliva and Antonio Torralba. 2001. Modeling the shape of the scene: A holistic representation of the spatial envelope. International journal of computer vision 42, 3 (2001), 145--175.

Digital Library

[9]

Ariadna Quattoni and Antonio Torralba. 2009. Recognizing indoor scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 413--420.

[10]

Nikhil Rasiwasia, Pedro J Moreno, and Nuno Vasconcelos. 2007. Bridging the gap: Query by semantic example. Multimedia, IEEE Transactions on 9, 5 (2007), 923--938.

Digital Library

[11]

Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

[12]

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1--9.

[13]

Julia Vogel and Bernt Schiele. 2004. Natural scene retrieval based on a semantic modeling step. In International Conference on Image and Video Retrieval. Springer, 207--215.

[14]

Jianxiong Xiao, James Hays, Krista A Ehinger, Aude Oliva, and Antonio Torralba. 2010. Sun database: Large-scale scene recognition from abbey to zoo. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3485--3492.

[15]

Jason Yosinski, Jeff Clune, Anh Nguyen, Thomas Fuchs, and Hod Lipson. 2015. Understanding Neural Networks Through Deep Visualization. In Deep Learning Workshop, International Conference on Machine Learning (ICML).

[16]

Fang Zhao, Yongzhen Huang, Liang Wang, and Tieniu Tan. 2015. Deep semantic ranking based hashing for multi-label image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1556--1564.

[17]

Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba. 2017. Places: A 10 million Image Database for Scene Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2017).

[18]

Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva. 2014. Learning deep features for scene recognition using places database. In Advances in neural information processing systems. 487--495.

Digital Library

Cited By

Gupta SDileep AThenkanidiyoor V(2021)Recognition of varying size scene images using semantic analysis of deep activation mapsMachine Vision and Applications10.1007/s00138-021-01168-832:2Online publication date: 1-Mar-2021
https://dl.acm.org/doi/10.1007/s00138-021-01168-8

Recommendations

Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition

In this work, we address the task of scene recognition from image data. A scene is a spatially correlated arrangement of various visual semantic contents also known as concepts, e.g., “chair,” “car,” “sky,” etc. Representation learning using visual ...
A deep CNN based transfer learning method for false positive reduction

A low false positive (FP) rate is of great importance for the use of a Computer Aided Detection (CAD) system to detect pulmonary nodules in thoracic Computed Tomography (CT). However, due to the variations of nodules in appear and size, it is still a ...
Medical image fusion using optimal feature selection methods based on second generation contourlet transform

As a novel of multi-resolution analysis tool, second generation contourlet transform SGCT provides flexible multiresolution, anisotropy, and directional expansion for medical imaging systems. In this paper, a novel fusion method for multimodal medical ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CODS-COMAD '18: Proceedings of the ACM India Joint International Conference on Data Science and Management of Data

January 2018

379 pages

ISBN:9781450363419

DOI:10.1145/3152494

Conference Chair:
Sayan Ranu
IIT Delhi
,
General Chairs:
Niloy Ganguly
IIT Kharagpur
,
Raghu Ramakrishnan
Microsoft
,
Program Chairs:
Sunita Sarawagi
IIT Bombay
,
Shourya Roy
American Express Big Data Labs

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 January 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

CoDS-COMAD '18

CoDS-COMAD '18: The ACM India Joint International Conference on Data Science & Management of Data

January 11 - 13, 2018

Goa, India

Acceptance Rates

CODS-COMAD '18 Paper Acceptance Rate 50 of 150 submissions, 33%;

Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
73
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 06 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Gupta SDileep AThenkanidiyoor V(2021)Recognition of varying size scene images using semantic analysis of deep activation mapsMachine Vision and Applications10.1007/s00138-021-01168-832:2Online publication date: 1-Mar-2021
https://dl.acm.org/doi/10.1007/s00138-021-01168-8

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents