skip to main content
article

Multidimensional Business Benchmarking Analysis on Data Warehouses

Published: 01 January 2017 Publication History

Abstract

Benchmarking analysis has been used extensively in industry for business analytics. Surprisingly, how to conduct benchmarking analysis efficiently over large data sets remains a technical problem untouched. In this paper, the authors formulate benchmark queries in the context of data warehousing and business intelligence, and develop a series of algorithms to answer benchmark queries efficiently. Their methods employ several interesting ideas and the state-of-the-art data cube computation techniques to reduce the number of aggregate cells that need to be computed and indexed. An empirical study using the TPC-H data sets and the Weather data set demonstrates the efficiency and scalability of their methods.

References

[1]
Ajibefun, I. 2008, July. An evaluation of parametric and non-parametric methods of technical efficiency measurement: Application to small scale food crop production in Nigeria. Journal of agriculture and social sciences, 43, 95-100.
[2]
Beyer, K., & Ramakrishnan, R. 1999. Bottom-up computation of sparse and iceberg cube. Proceedings of the 1999 ACM SIGMOD International Conference on Management of DataSIGMOD '99, pp. 359-370. New York, NY, USA: ACM. 10.1145/304182.304214
[3]
Ceci, M., Cuzzocrea, A., & Malerba, D. 2015, June. Effectively and efficiently supporting roll-up and drill-down OLAP operations over continuous dimensions via hierarchical clustering. Journal of Intelligent Information Systems, 443, 309-333.
[4]
Chen, C., Yan, X., Zhu, F., Han, J., & Yu, P. S. 2009, October. Graph OLAP: A multi-dimensional framework for graph data analysis. Knowledge and Information Systems, 211, 41-63.
[5]
Chen, Y., Dehne, F., Eavis, T., & Rau-Chaplin, A. 2008, April. Pnp: Sequential, external memory, and parallel iceberg cube computation. Distributed and Parallel Databases, 232, 99-126.
[6]
Dong, G., Han, J., Lam, J. M. W., Pei, J. & Wang, K. 2001. Mining multi-dimensional constrained gradients in data cubes. Proceedings of the 27th International Conference on Very Large Data BasesVLDB '01 pp. 321-330. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
[7]
Fang, M., Shivakumar, N., Garcia-Molina, H., Motwani, R., & Ullman, J. D. 1998. Computing iceberg queries efficiently. Proceedings of the 24rd International Conference on Very Large Data BasesVLDB '98 pp. 299-310. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
[8]
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., & Pirahesh, H. et al . 1997, January. Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. Data Mining and Knowledge Discovery, 11, 29-53.
[9]
Han, J., Pei, J., Dong, G., & Wang, K. 2001. Efficient computation of iceberg cubes with complex measures. Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, SIGMOD '01. New York, NY, USA: ACM. 10.1145/375663.375664
[10]
He, Z., Wong, P., Kao, B., Lo, E., & Cheng, R. 2013. Fast evaluation of iceberg pattern-based aggregate queries. Proceedings of the 22Nd ACM International Conference on Conference on Information & Knowledge ManagementCIKM '13 pp. 2219-2224. New York, NY, USA: ACM. 10.1145/2505515.2505726
[11]
Herrera, S., & Pang, G. 2005, June. Efficiency of public spending in developing countries: an efficiency frontier approach. The World Bank.
[12]
S. Holder, B. Veronese, P. Metcalfe, F. Mini, S. Carter, & Basalisco, B. 2006. Cost benchmarking of air navigation service providers: A stochastic frontier analysis Technical report. EUROCONTROL.
[13]
Imieliński, T., Khachiyan, L., & Abdulghani, A. 2002, July. Cubegrades: Generalizing association rules. Data Mining and Knowledge Discovery, 63, 219-257.
[14]
Lakshmanan, L. V. S., Pei, J., & Han, J. 2002. Quotient cube: How to summarize the semantics of a data cube. Proceedings of the 28th International Conference on Very Large Data BasesVLDB '02 pp. 778-789. 10.1016/B978-155860869-6/50074-3
[15]
Lo, E., Kao, B., Ho, W.-S., Lee, S. D., Chui, C. K., & Cheung, D. W. 2008. OLAP on sequence data. Proceedings of the 2008 ACM SIGMOD International Conference on Management of DataSIGMOD '08 pp. 649-660. New York, NY, USA: ACM. 10.1145/1376616.1376682
[16]
Ng, R. T., Wagner, A., & Yin, Y. 2001. Iceberg-cube computation with pc clusters. Proceedings of the 2001 ACM SIGMOD International Conference on Management of DataSIGMOD '01 pp. 25-36. New York, NY, USA: ACM. 10.1145/375663.375666
[17]
Nguyen, K.-N. T., Cerf, L., Plantevit, M., & Boulicaut, J.-F. 2013, January. Discovering descriptive rules in relational dynamic graphs. Intelligent Data Analysis, 171, 49-69.
[18]
Sarawagi, S., Agrawal, R., & Megiddo, N. 1998. Discovery-driven exploration of OLAP data cubes. Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database TechnologyEDBT '98 pp. 168-182. London, UK: Springer.
[19]
Wang, J., Han, J., & Pei, J. 2006, June. Closed constrained gradient mining in retail databases. IEEE Transactions on Knowledge and Data Engineering, 186, 764-769.
[20]
Zairi, M. 1996. Effective Benchmarking. Springer Netherlands.
  1. Multidimensional Business Benchmarking Analysis on Data Warehouses

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image International Journal of Data Warehousing and Mining
      International Journal of Data Warehousing and Mining  Volume 13, Issue 1
      January 2017
      93 pages
      ISSN:1548-3924
      EISSN:1548-3932
      Issue’s Table of Contents

      Publisher

      IGI Global

      United States

      Publication History

      Published: 01 January 2017

      Author Tags

      1. Benchmark Queries
      2. Business Intelligence
      3. Data Cubes
      4. Data Warehouse

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 0
        Total Downloads
      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 06 Jan 2025

      Other Metrics

      Citations

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media