skip to main content
research-article

General Bounds on the Number of Examples Needed for Learning Probabilistic Concepts

Published: 01 April 1996 Publication History

Abstract

Given a p-concept classC, we define two important functionsdC( ),d C( ) (related to the notion of -shattering). We prove a lower bound of ((dC( ) 1)/( 2)) on the number of examples required for learningCwith an (, )-good model of probability. We prove similar lower bounds for some other learning models like learning with -bounded absolute (or quadratic) difference or learning with a -good decision rule. For the class ND of nondecreasing p-concepts on the real domain,dND( )= (1/ ). It can be shown that the resulting lower bounds for learning ND (within the models in consideration) are tight to within a logarithmic factor. In order to get the “almost-matching” upper bounds, we introduce a new method for designing learning algorithms: dynamic partitioning of the domain by use of splitting trees. The paper also contains a discussion of the gaps between the general lower bounds and the corresponding general upper bounds. It can be shown that, under very mild conditions, these gaps are quite narrow.

References

[1]
N. Abe and M. K. Warmuth, On the computational complexity of approximating distributions by probabilistic automata, in "Proceedings, 3rd Annual Workshop on Computational Learning Theory," pp. 52-66, Kaufmann, Los Angeles, CA, 1990.
[2]
N. Alon, S. Ben-David, N. Cesa-Bianchi, and D. Haussler, Scale-sensitive dimensions, uniform convergence, and learnability, in "Proceedings, 34th Symposium on Foundations of Computer Science," pp. 292-302, IEEE Comput. Soc. Los Alamitos, CA, 1993.
[3]
D. Angluin and P. Laird, Learning from noisy examples, Mach. Learning 2 , No. 4 (1988), 343-370.
[4]
S. Ben-David, N. Cesa-Bianchi, and P. M. Long, Characterizations of learnability for classes of {0, ..., n }-valued functions, in "Proceedings, 5th Annual Workshop on Computational Learning Theory," pp. 333-418, Assoc. Comput. Mach., New York, 1992.
[5]
A. Blumer, A. Ehrenfeucht, D. Haussler, and M. K. Warmuth, Learnability and the Vapnik-Chervonenkis dimension, J. Assoc. Comput. Mach. 36 , No. 4 (1989), 929-965.
[6]
R. O. Duda and P. E. Hart, "Pattern Classification and Scene Analysis," Wiley-Interscience, New York, 1973.
[7]
A. Ehrenfeucht, D. Haussler, M. Kearns, and L. Valiant, A general lower bound on the number of examples needed for learning, Inform. and Comput. 82 , No. 3 (1989), 247-261.
[8]
W. Feller, "An Introduction to Probability Theory and its Applications," Vol. 1, 3rd ed., Wiley, New York, 1968.
[9]
D. Haussler, Decision theoretic generalizations of the PAC model for neural net and other learning applications, Inform. and Comput. 100 , No. 1 (1992), 78-150.
[10]
M. J. Kearns and R. E. Schapire, Efficient distribution-free learning of probabilistic concepts, in "Proceedings, 31th Annual Symposium on the Foundations of Computer Science," pp. 382-392; J. Comput. System Sci. , to appear.
[11]
P. Liard, "Learning from Good and Bad Data," Kluwer Academic, Boston, 1988.
[12]
B. K. Natarajan, On learning sets and functions, Mach. Learning 4 , No. 1 (1989), 67-97.
[13]
D. Pollard, "Convergence of Stochastic Processes," Springer-Verlag, New York/Berlin, 1984.
[14]
R. Rivest, Learning decision lists, Mach. Learning 2 , No. 3 (1987), 229-246.
[15]
L. G. Valiant, A theory of the learnable, Commun. ACM 27, No. 11 (1984), 1134-1142.
[16]
Yamanishi, A learning criterion for stochastic rules, Mach. Learning 9 (1992), 165-203.

Cited By

View all

Index Terms

  1. General Bounds on the Number of Examples Needed for Learning Probabilistic Concepts

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Journal of Computer and System Sciences
      Journal of Computer and System Sciences  Volume 52, Issue 2
      April 1996
      190 pages
      ISSN:0022-0000
      • Editor:
      • E. K. Blum
      Issue’s Table of Contents

      Publisher

      Academic Press, Inc.

      United States

      Publication History

      Published: 01 April 1996

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 14 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media