skip to main content
10.1007/978-3-642-36669-7_83guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

An efficient active learning method based on random sampling and backward deletion

Published: 15 October 2012 Publication History

Abstract

Active learning aims to select data samples which would be the most informative to improve classification performance so that their class labels are obtained from an expert. Recently, an active learning method based on locally linear reconstruction(LLR) has been proposed and the performance of LLR was demonstrated well in the experiments comparing with other active learning methods. However, the time complexity of LLR is very high due to matrix operations required repeatedly for data selection. In this paper, we propose an efficient active learning method based on random sampling and backward deletion. We select a small subset of data samples by random sampling from the total data set, and a process of deleting the most redundant points in the subset is performed iteratively by searching for a pair of data samples having the smallest distance. The distance measure using a graph-based shortest path distance is utilized in order to consider the underlying data distribution. Experimental results demonstrate that the proposed method has very low time complexity, but the prediction power of data samples selected by our method outperforms that by LLR.

References

[1]
Settles, B.: Active learning literature survey: Computer sciences technical report 1648, University of Wisconsin-Madison (2009).
[2]
Freund, Y., Seung, H. S., Shamir, E., Tishby, N.: Selective sampling using the query by committee algorithm. Machine Learning 28(2-3), 138-168 (1997).
[3]
Tong, S., Keller, D.: Support vector machine active learning with applications to text classifications. Journal of Machine Learning Research 2, 45-66 (2002).
[4]
Cohn, D. A., Ghahramani, Z., Jordan, M. I.: Active learning with statistical models. Journal of Artificial Intelligence Research 4, 129-145 (1996).
[5]
Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using Gaussian Fields and Harmonic functions. In: Proceedings of ICML, pp. 912-919 (2003).
[6]
Atkinson, A., Donev, A., Tobias, R.: Optimum Experimental Designs, with SAS. Oxford Univ. Press (2007).
[7]
Asprey, S. P., Macchietto, S.: Designing robust optimal dynamic experimetns. J. Process Control 12(4), 545-556 (2002).
[8]
Hardin, R. H., Sloane, N. J. A.: A new approach to the construction of optimal designs: J. Statistical Planning and Inference 37(3), 339-369 (1993).
[9]
Zhang, L., Chen, C., Bu, J., Cai, D., He, X., Huang, T. S.: Active Learning Based on Locally Linear Reconstruction. IEEE Trans. Pattern Anal. Machine Int. 33(10), 2026-2038 (2011).
[10]
Groves, R. M., Fowler Jr., F. J.: Survey methodology. Wiley (2010).
[11]
Tan, P., Steinbach, M., Kumar, V.: Introduction to Data Mining. Addison Wesley, Boston (2006).
[12]
UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/
[13]
A Library for SVM, http://www.csie.ntu.edu.tw/~cjlin/libsvm/
[14]
Machine Learning Group at University of Waikato, http://www.cs.waikato.ac.nz/ml/weka/

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IScIDE'12: Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
October 2012
875 pages
ISBN:9783642366680
  • Editors:
  • Jian Yang,
  • Fang Fang,
  • Changyin Sun

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 15 October 2012

Author Tags

  1. active learning
  2. optimal experimental design
  3. random sampling

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media