skip to main content
10.5555/646420.693659guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

An Efficient Single-Scan Algorithm for Mining Essential Jumping Emerging Patterns for Classification

Published: 06 May 2002 Publication History

Abstract

Emerging patterns (EPs), namely itemsets whose supports change significantly from one class to another, were recently proposed to capture multi-attribute contrasts between data classes, or trends over time. Previous studies show that EP/JEP(jumping emerging patterns) - based classifiers such as CAEP[2] and JEP-classifier[6] have good overall predictive accuracy. But they suffer from the huge number of mined EPs/JEPs, which makes the classifiers complex.In this study, we propose a special type of EP, essential jumping emerging patterns (eJEPs), which are believed to be high quality patterns with the most differentiating power and thus are sufficient for building accurate classifiers. Existing algorithms such as border-based algorithms and consEPMiner[7] can not directly mine such eJEPs. We present a new single-scan algorithm to effectively mine eJEPs of both data classes (both directions). Experimental results show that the classifier based exclusively on eJEPs, which uses much fewer JEPs than JEP-classifier, achieves the same or higher testing accuracy and is often also superior to other state-of-the-art classification systems such as C4.5a nd CBA.

References

[1]
G. Dong and J. Li. Efficient mining of emerging patterns: Discovering trends and differences. In Proc. of KDD'99 , pages 15-18, 1999.
[2]
G. Dong, X. Zhang, L. Wong, and J. Li. CAEP: Classification by aggregating emerging patterns. In Proc. of the 2nd Int'l Conf. on Discovery Science (DS'99) , Tokyo, Japan, Dec. 1999.
[3]
E. Keogh C. Blake and C.J. Merz. UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html, 1998.
[4]
R. Kohavi, G. John, R. Long, D. Manley, and K. Pfleger. MLC++: a machine learning library in C++. In Tools with artificial intelligence , pages 740-743, 1994.
[5]
J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In SIGMOD'00 , Dallas, TX, May 2000.
[6]
J. Li, G. Dong, and K. Ramamohanarao. JEP-Classifier: Classification by Aggregating Jumping Emerging Patterns. In Knowledge and Information Systems , Volume 3 Issue 2, 131-145, 2001.
[7]
X. Zhang, G. Dong, K. Ramamohanarao. Exploring Constraints to Efficiently Mine Emerging Patterns from Large High-dimensional Datasets. In Proc. 2000 ACM SIGKDD Conf. , pages 310-314, Boston, USA, Aug. 2000.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
PAKDD '02: Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
May 2002
566 pages
ISBN:3540437045

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 06 May 2002

Author Tags

  1. classification
  2. complexity
  3. data mining
  4. decision trees
  5. emerging patterns

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media