skip to main content
10.1145/1291233.1291297acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Feature selection using principal feature analysis

Published: 29 September 2007 Publication History

Abstract

Dimensionality reduction of a feature set is a common preprocessing step used for pattern recognition and classification applications. Principal Component Analysis (PCA) is one of the popular methods used, and can be shown to be optimal using different optimality criteria. However, it has the disadvantage that measurements from all the original features are used in the projection to the lower dimensional space. This paper proposes a novel method for dimensionality reduction of a feature set by choosing a subset of the original features that contains most of the essential information, using the same criteria as PCA. We call this method Principal Feature Analysis (PFA). The proposed method is successfully applied for choosing the principal features in face tracking and content-based image retrieval (CBIR) problems. Automated annotation of digital pictures has been a highly challenging problem for computer scientists since the invention of computers. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications including Web image search, online picture-sharing communities, and scientific experiments. In our work, by advancing statistical modeling and optimization techniques, we can train computers about hundreds of semantic concepts using example pictures from each concept. The ALIPR (Automatic Linguistic Indexing of Pictures - Real Time) system of fully automatic and high speed annotation for online pictures has been constructed. Thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process, have been tested. The experimental results show that a single computer processor can suggest annotation terms in real-time and with good accuracy.

References

[1]
Lisboa, P. J. G., Mehri-Dehnavi, R. Sensitivity Methods for Variable Selection Using the MLP. International Workshop on Neural Networks for Identification, Control, Robotics and Signal/Image, 1996, 330--338.
[2]
Lin, T.-S., Meador, J. Statistical Feature Extraction and Selection for IC Test Pattern Analysis. Circuits and systems, vol 1., 1992, 391--394.
[3]
Hocking, R. R. Development in Linear Regression Methodology: 1959-1982. Technometrics, vol. 25, 1983, 219--249.
[4]
McCabe, G. P. Principal Variables. Technometrics, vol. 26, 1984, 127--134.
[5]
Jolliffe, I. T. Principal Component Analysis. Springer-Verlag, New-York, 1986.
[6]
Krzanowski, W. J. Selection of Variables to Preserve Multivariate Data Structure, Using Principal Component Analysis. Applied Statistics - Journal of the Royal Statistical Society Series C, vol. 36, 1987, 22--33.
[7]
Krzanowski, W. J. A Stopping Rule for structure - Preserving Variable Selection. Statistics and Computing, March vol. 6, 1996, 51--56.
[8]
Gower, J. C., Statistical Methods of Comparing Different Multivariate Analyses of the Same Data, Mathematics in the Archaeological and Historical Sciences, University Press, Edinburgh, 1971, 138--149.

Cited By

View all
  • (2024)Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective StudyJMIR Medical Informatics10.2196/5289612(e52896-e52896)Online publication date: 26-Jul-2024
  • (2024)An Intelligent Approach for Predicting Stock Market Movements in Emerging Markets Using Optimized Technical Indicators and Neural NetworksEconomics10.1515/econ-2022-007318:1Online publication date: 25-Mar-2024
  • (2024)PACT: Accurate Power Analysis and Carbon Emission Tracking for SustainabilityProceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design10.1145/3665314.3670809(1-6)Online publication date: 5-Aug-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '07: Proceedings of the 15th ACM international conference on Multimedia
September 2007
1115 pages
ISBN:9781595937025
DOI:10.1145/1291233
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 September 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. discriminant analysis
  2. feature extraction
  3. feature selection
  4. principal component analysis

Qualifiers

  • Article

Conference

MM07

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)181
  • Downloads (Last 6 weeks)19
Reflects downloads up to 27 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Unsupervised Feature Selection to Identify Important ICD-10 and ATC Codes for Machine Learning on a Cohort of Patients With Coronary Heart Disease: Retrospective StudyJMIR Medical Informatics10.2196/5289612(e52896-e52896)Online publication date: 26-Jul-2024
  • (2024)An Intelligent Approach for Predicting Stock Market Movements in Emerging Markets Using Optimized Technical Indicators and Neural NetworksEconomics10.1515/econ-2022-007318:1Online publication date: 25-Mar-2024
  • (2024)PACT: Accurate Power Analysis and Carbon Emission Tracking for SustainabilityProceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design10.1145/3665314.3670809(1-6)Online publication date: 5-Aug-2024
  • (2024)User gait biometrics in smart ambient applications through wearable accelerometer signals: an analysis of the influence of training setup on recognition accuracyJournal of Ambient Intelligence and Humanized Computing10.1007/s12652-024-04790-215:7(2967-2979)Online publication date: 15-Apr-2024
  • (2024)Feature Engineering for Music/Speech Detection in Costa Rica Radio BroadcastPattern Recognition10.1007/978-3-031-62836-8_9(84-95)Online publication date: 19-Jun-2024
  • (2023)Feature selection in the contrastive analysis settingProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669009(66102-66126)Online publication date: 10-Dec-2023
  • (2023)Feature Selection and Damage Identification for Urban Railway Track Using Bayesian Globally Sparse Principal Component AnalysisSustainability10.3390/su1506539115:6(5391)Online publication date: 17-Mar-2023
  • (2023)Breast Cancer Prediction Using Fine Needle Aspiration Features and Upsampling with Supervised Machine LearningCancers10.3390/cancers1503068115:3(681)Online publication date: 22-Jan-2023
  • (2023)PSEV-BF Methodology for Object Recognition of Birds in Uncontrolled EnvironmentsAxioms10.3390/axioms1202019712:2(197)Online publication date: 13-Feb-2023
  • (2023)Challenges in Using mHealth Data From Smartphones and Wearable Devices to Predict Depression Symptom Severity: Retrospective AnalysisJournal of Medical Internet Research10.2196/4523325(e45233)Online publication date: 14-Aug-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media