skip to main content
10.1145/952532.952629acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

ADMiRe: an algebraic approach to system performance analysis using data mining techniques

Published: 09 March 2003 Publication History

Abstract

System performance analysis is a very difficult problem. Traditional tools rely on manual operations to analyze data. Consequently, determining which system resources to examine is often a lengthy process, where many problems are elusive, even when using data mining tools. We address this problem by introducing the Analyzer for Data Mining Results (ADMiRe) technique as a natural and flexible means to further interpret data mining outcome. In our scheme, regression analysis is first applied to performance data to discover correlations between parameters. Regression rules are defined to represent this output in a format suitable for ADMiRe. ADMiRe expressions are then used to manipulate these sets of rules, revealing information about combined, common and different features of varying configurations. This knowledge would be unavailable if regression output were considered in isolation. ADMiRe was tested with performance data collected from a TPC-C (Transaction Processing Performance Council) test on an Oracle database system, under various configurations, to demonstrate the effectiveness of our technique.

References

[1]
Allen J. F., "Maitaining Knowledge about Temporal Intervals", CACM26 (11) 1983, 832--843.
[2]
Rakesh Agrawal, S. Gosh, T. Imielinski, B. Iyer, Arun Swami, "An Interval Classifier for Databases Mining Applications", Proceedings of the 1992 Very Large Databases Conference.
[3]
Peter M. Chen and David A. Patterson, "A New Approach to I/O Performance Evaluation: Self-scaling I/O Benchmarks, Predicted I/O Performance", Pages 1--12 Proceedings of the 1993 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems, May 10 - 14, 1993, Santa Clara, CA USA.
[4]
Will Leland, Murad Taqqu, Walter Willinger, and Daniel Wilson, "On the Self-Similar Nature of Ethernet Traffic (Extended Version)", IEEE/ACM Transactions on Networking, Vol. 2, No. 1, pp. 1--15, February 1994. (From an earlier version of the paper in SIGCOMM 93, Sept. 1993.)
[5]
Rakesh Agrawal, Ramakrishnan Srikant, "Fast Algorithms for Mining Association Rules", Proceedings of the 20th VLDB Conference.
[6]
Maurice Houtsma, Arun Swami, "Set-Oriented Mining for Association Rules in Relational Databases", 11th International Conference on Data Engineering.
[7]
Rakesh Agrawal and Ramakrishnan Srikant, "Mining Quantitative Association Rules in Large Relational Tables", Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data.
[8]
Leo Breiman, Jerome H. Friedman, Richard A. Olshen, Charles J. Stone, "Classification And Regression Trees", Chapman & Hall, 1998.
[9]
Steven D. Gribble, Gurmeet Singh Manku, Drew Roselli, and Eric A. Brewer, Timothy J. Gibson and Ethan L. Miller, "Self-Similarity in File Systems", SIGMETRICS '98 Madison, WI, USA.
[10]
Ashok Joshi, William Bridge, Juan Loaiza, Tirthankar Lahiri, "Checkpointing in Oracle", Proceedings of the 1998 VLDB. New York, 1998.
[11]
Walter Willinger, Vern Paxson, and Murad Taqqu, "Self-similarity and Heavy Tails: Structural Modeling of Network Traffic. In A Practical Guide to Heavy Tails: Statistical Techniques and Applications", Adler, R., Feldman, R., and Taqqu, M. S., editors, Birkhauser, 1998.
[12]
C. H. Cheng, A. W. Fu, and Y. Zhang. "Entropy-based Subspace Clustering for Mining Numerical Data", In SIGKDD, pages 84--93, 1999.
[13]
Oracle, "Oracle 8i Tuning, Release 8.1.5", Part No. A67775-01, Oracle Corporation.
[14]
Oracle, "Oracle Enterprise Manager Database Tuning with the Oracle Tuning Pack", Release 9.0.1, Part Number A86647-01, Oracle Corporation.
[15]
Ken Gottry, "Successful Solaris Tuning", SysAdmin, the Journal for UNIX Systems and Administrators, 2001.
[16]
Trevor Hastie, Robert Tibshirani, and Jerome Friedman, "The Elements of Statistical Learning, Data Mining, Inference, and Prediction". Springer 2001.
[17]
Tirthankar Lahiri, Amit Ganesh, Ron Weiss, and Ashok Joshi, "Fast-Start: Quick Fault Recovery in Oracle", Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, 2001.
[18]
Transaction Processing Performance Council, "TPC Benchmark C Standard Specification Revision 5.0", http://www.tpc.org/tpcc/spec/tpcc_current.pdf.
[19]
Haixun Wang, Wei Wang, Jiong Yang, Philip S. Yu, "Clustering by Pattern Similarity in Large Data Sets", Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '03: Proceedings of the 2003 ACM symposium on Applied computing
March 2003
1268 pages
ISBN:1581136242
DOI:10.1145/952532
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 March 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data mining
  2. regression
  3. scalability

Qualifiers

  • Article

Conference

SAC03
Sponsor:
SAC03: ACM Symposium on Applied Computing
March 9 - 12, 2003
Florida, Melbourne

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Upcoming Conference

SAC '25
The 40th ACM/SIGAPP Symposium on Applied Computing
March 31 - April 4, 2025
Catania , Italy

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media