skip to main content
10.1007/978-3-319-12571-8_18guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Instant Exceptional Model Mining Using Weighted Controlled Pattern Sampling

Published: 07 March 2023 Publication History

Abstract

When plugged into instant interactive data analytics processes, pattern mining algorithms are required to produce small collections of high quality patterns in short amounts of time. In the case of Exceptional Model Mining (EMM), even heuristic approaches like beam search can fail to deliver this requirement, because in EMM each search step requires a relatively expensive model induction. In this work, we extend previous work on high performance controlled pattern sampling by introducing extra weighting functionality, to give more importance to certain data records in a dataset. We use the extended framework to quickly obtain patterns that are likely to show highly deviating models. Additionally, we combine this randomized approach with a heuristic pruning procedure that optimizes the pattern quality further. Experiments show that in contrast to traditional beam search, this combined method is able to find higher quality patterns using short time budgets.

References

[1]
Hasan, M.A., Zaki, M.J.: Output space sampling for graph patterns. In: Proc. VLDB Endow, pp. 730–741 (2009)
[2]
Bache, K., Lichman, M.: UCI machine learning repository (2013)
[3]
Blumenstock, A., Hipp, J., Kempe, S., Lanquillon, C., Wirth, R.: Interactivity closes the gap. In: Proc. ACM SIGKDD 2006 Workshop on Data Mining for Business Applications (2006)
[4]
Boley, M., Lucchese, C., Paurat, D., Gärtner, T.: Direct local pattern sampling by efficient two–step random procedures. In: Proc. ACM SIGKDD 2011 (2011)
[5]
Boley, M., Mampaey, M., Kang, B., Tokmakov, P., Wrobel, S.: One click mining: Interactive local pattern discovery through implicit preference and performance learning. In: Proc. ACM SIGKDD 2013 Workshop IDEA, pp. 27–35. ACM (2013)
[6]
Boley, M., Moens, S., Gärtner, T.: Linear space direct pattern sampling using coupling from the past. In: Proc. ACM SIGKDD 2012, pp. 69–77. ACM (2012)
[7]
Chaoji, V., Hasan, M.A., Salem, S., Besson, J., Zaki, M.J.: Origami: A novel and effective approach for mining representative orthogonal graph patterns. In: Stat. Anal. Data Min., pp. 67–84 (2008)
[8]
Duivesteijn, W.: Exceptional model mining. PhD thesis, Leiden Institute of Advanced Computer Science (LIACS), Faculty of Science, Leiden University (2013)
[9]
Dzyuba V. and van Leeuwen M. Tucker A., Höppner F., Siebes A., and Swift S. Interactive discovery of interesting subgroup sets Advances in Intelligent Data Analysis XII 2013 Heidelberg Springer 150-161
[10]
Goethals, B., Moens, S., Vreeken, J.: Mime: a framework for interactive visual pattern mining. In: Proc. ACM SIGKDD 2011, pp. 757–760. ACM (2011)
[11]
Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., and Witten I.H. The weka data mining software: An update SIGKDD Explor. Newsl. 2009 11 1 10-18
[12]
Herrera, F., Carmona, C.J., González, P., del Jesus, M.J.: An overview on subgroup discovery: Foundations and applications. Knowl. Inf. Syst., 495–525 (2011)
[13]
Moens, S., Goethals, B.: Randomly sampling maximal itemsets. In: Proc. ACM SIGKDD 2013 Workshop IDEA, pp. 79–86 (2013)
[14]
Škrabal R., Šimůnek M., Vojíř S., Hazucha A., Marek T., Chudán D., and Kliegr T. Flach P.A., De Bie T., and Cristianini N. Association Rule Mining Following the Web Search Paradigm Machine Learning and Knowledge Discovery in Databases 2012 Heidelberg Springer 808-811

Cited By

View all

Index Terms

  1. Instant Exceptional Model Mining Using Weighted Controlled Pattern Sampling
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Guide Proceedings
      Advances in Intelligent Data Analysis XIII: 13th International Symposium, IDA 2014, Leuven, Belgium, October 30 – November 1, 2014. Proceedings
      Oct 2014
      45 pages
      ISBN:978-3-319-12570-1
      DOI:10.1007/978-3-319-12571-8

      Publisher

      Springer-Verlag

      Berlin, Heidelberg

      Publication History

      Published: 07 March 2023

      Author Tags

      1. Controlled Pattern Sampling
      2. Subgroup Discovery
      3. Exceptional Model Mining

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media