skip to main content
10.1145/1346281.1346288acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
research-article

Efficiency trends and limits from comprehensive microarchitectural adaptivity

Published: 01 March 2008 Publication History

Abstract

Increasing demand for power-efficient, high-performance computing requires tuning applications and/or the underlying hardware to improve the mapping between workload heterogeneity and computational resources. To assess the potential benefits of hardware tuning, we propose a framework that leverages synergistic interactions between recent advances in (a) sampling, (b) predictive modeling, and (c) optimization heuristics. This framework enables qualitatively new capabilities in analyzing the performance and power characteristics of adaptive microarchitectures. For the first time, we are able to simultaneously consider high temporal and comprehensive spatial adaptivity. In particular, we optimize efficiency for many, short adaptive intervals and identify the best configuration of 15 parameters, which define a space of 240B point.
With frequent sub-application reconfiguration and a fully reconfigurable hardware substrate, adaptive microarchitectures achieve bips3/w efficiency gains of up to 5.3x (median 2.4x) relative to their static counterparts already optimized for a given application. This 5.3x efficiency gain is derived from a 1.6x performance gain and 0.8x power reduction. Although several applications achieve a significant fraction of their potential efficiency with as few as three adaptive parameters, the three most significant parameters differ across applications. These differences motivate a hardware substrate capable of comprehensive adaptivity to meet these diverse application requirements.

Supplementary Material

JPG File (1346288.jpg)
index.html (index.html)
Slides from the presentation
ZIP File (p36-lee-slides.zip)
Supplemental material for Efficiency trends and limits from comprehensive microarchitectural adaptivity
Audio only (1346288.mp3)
Video (1346288.mp4)

References

[1]
D. Albonesi. Dynamic IPC/clock rate optimization. In International Symposium on Computer Architecture, June 1998.
[2]
D. Albonesi, R. Balasubramonian, S. Dropsho, S. Dwarkadas, E. Friedman, M. Huang, V. Kursun, G. Magklis, M. Scott, G. Semezaro, P. Bose, A. Buyuktosunoglu, P. Cook, and S. Schuster. Dynamically tuning processor resources with adaptive processing. IEEE Computer, December 2003.
[3]
D. Bader, Y. Li, T. Li, and V. Sachdeva. Bioperf: A benchmark suite to evaluate high-performance computer architecture on bioinformatics applications. In IEEE International Symposium on Workload Characterization, October 2005.
[4]
R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures. In International Symposium on Microarchitecture, December 2000.
[5]
D. Brooks, P. Bose, V. Srinivasan, M. Gschwind, P.G. Emma, and M.G. Rosenfield. New methodology for early-stage, microarchitecture-level power-performance analysis of microprocessors. IBM Journal of Research and Development, 47(5/6), Oct/Nov 2003.
[6]
D. Brooks and et. al. Power-aware microarchitecture: Design and modeling challenges for next-generation microprocessors. IEEE Micro, 20(6):26--44, Nov/Dec 2000.
[7]
A. Dhodapkar and J. Smith. Managing multi-configuration hardware via dynamic working set analysis. In International Symposium on Computer Architecture, June 2002.
[8]
A. Efthymiou and J. Garside. Adaptive pipeline structures for speculation control. In International Symposium on Asynchronous Circuits and Systems, May 2003.
[9]
E. Ipek, S.A. McKee, B. de Supinski, M. Schulz, and R. Caruana. Efficiently exploring architectural design spaces via predictive modeling. In Architectural support for programming languages and operating systems, October 2006.
[10]
S. Eyerman, L. Eeckhout, and K.D. Bosschere. Efficient design space exploration of high performance embedded out-of-order processors. In Design, Automation, and Test in Europe, March 2006.
[11]
D. Folegnani and A. Gonzalez. Energy-effective issue logic. In International Symposium on Computer Architecture, June 2001.
[12]
G. Givens and J. Hoeting. Computational Statistics. Wiley, 2005.
[13]
S. Gochman, R. Ronen, and et al. The intel pentium m processor: Micorarchitecture and performance. Intel Technology Journal, 7(2), May 2003.
[14]
F. Harrell. Regression modeling strategies. Springer, 2001.
[15]
J. Henning. Spec cpu2000: Measuring cpu performance in the new millenium. IEEE Computer, July 2000.
[16]
M. Huang, J. Renau, and J. Torrellas. Positional adaptation of processors: Application to energy reduction. In International Symposium on Computer Architecture, June 2003.
[17]
C. Hughes, J. Srinivasan, and S. Adve. Saving energy with architectural and frequency adaptations for multimedia applications. In International Symposium on Microarchitecture, December 2001.
[18]
E. Ipek, M. Kirman, N. Kirman, and J. Martinez. Core fusion: Accommodating software diversity in chip multiprocessors. In International Symposium on Computer Architecture, June 2007.
[19]
V. Iyengar, L. Trevillyan, and P. Bose. Representative traces for processor models with infinite cache. In International Symposium on High Performance Computer Architecture, February 1996.
[20]
P. Joseph, K. Vaswani, and M.J. Thazhuthaveetil. A predictive performance model for superscalar processors. In International Symposium on Microarchitecture, December 2006.
[21]
B. Lee and D. Brooks. Accurate and efficient regression modeling for microarchitectural performance and power prediction. In International Conference on Architectural Support for Programming Languages and Operating Systems, October 2006.
[22]
B. Lee and D. Brooks. Illustrative design space studies with microarchitectural regression models. In International Symposium on High-Performance Computer Architecture, February 2007.
[23]
X. Liang and D. Brooks. Mitigating the impact of process variations on cpu register file and execution units. In International Symposium on Microarchitecture, December 2006.
[24]
K. Mai, T. Paaske, N. Jayasena, R. Ho, W. Dally, and M. Horowitz. Smart memories: A modular reconfigurable architecture. In International Symposium on Computer Architecture, June 2000.
[25]
M. Moudgill, J. Wellman, and J. Moreno. Environment for PowerPC microarchitecture exploration. IEEE Micro, 19(3):9--14, May/June 1999.
[26]
A. Phansalkar, A. Joshi, L. Eeckhout, and L. John. Measuring program similarity: experiments with SPEC CPU benchmark suites. In International Symposium on Performance Analysis of Systems and Software, March 2005.
[27]
D. Ponomarev, G. Kucuk, and K. Ghose. Reducing power requirements of instruction scheduling through dynamic allocation of multiple datapath resources. In International Symposium on Microarchitecture, December 2001.
[28]
R Development Team. R Language Definition.
[29]
T. Sherwood, E. Perelman, G. Hamerly, and B. Calder. Automatically characterizing large scale program behavior. In International Conference on Architectural Support for Programming Languages and Operating Systems, October 2002.
[30]
P. Shivakumar and N. Jouppi. An integrated cache timing, power, and area model. In Technical Report 2001/2, Compaq Computer Corporation, August 2001.
[31]
A. Tiwari, S. Sarangi, and J. Torrellas. ReCycle: Pipeline adaptation to tolerate process variation. In International Symposium on Computer Architecture, June 2007.
[32]
S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta. The SPLASH-2 programs: Characterization and methodological considerations. In International Symposium on Computer Architecture, June 1995.
[33]
R.E. Wunderlich, T.F. Wenisch, B. Falsafi, and J.C. Hoe. SMARTS: Accelerating microarchitecture simulation via rigorous statistical sampling. In International Symposium on Computer Architecture, June 2003.
[34]
V. Zyuban. Inherently lower-power high-performance superscalar architectures. In Ph.D. Thesis, University of Notre Dame, March 2000.
[35]
V. Zyuban, D. Brooks, V. Srinivasan, M. Gschwind, P. Bose, P. Strenski, and P. Emma. Integrated analysis of power and performance for pipelined microprocessors. IEEE Transactions on Computers, Aug 2004.

Cited By

View all

Index Terms

  1. Efficiency trends and limits from comprehensive microarchitectural adaptivity

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ASPLOS XIII: Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
      March 2008
      352 pages
      ISBN:9781595939586
      DOI:10.1145/1346281
      • cover image ACM SIGPLAN Notices
        ACM SIGPLAN Notices  Volume 43, Issue 3
        ASPLOS '08
        March 2008
        339 pages
        ISSN:0362-1340
        EISSN:1558-1160
        DOI:10.1145/1353536
        Issue’s Table of Contents
      • cover image ACM SIGARCH Computer Architecture News
        ACM SIGARCH Computer Architecture News  Volume 36, Issue 1
        ASPLOS '08
        March 2008
        339 pages
        ISSN:0163-5964
        DOI:10.1145/1353534
        Issue’s Table of Contents
      • cover image ACM SIGOPS Operating Systems Review
        ACM SIGOPS Operating Systems Review  Volume 42, Issue 2
        ASPLOS '08
        March 2008
        339 pages
        ISSN:0163-5980
        DOI:10.1145/1353535
        Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 March 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. adaptivity
      2. efficiency
      3. inference
      4. microarchitecture
      5. performance
      6. power
      7. reconfigurablity
      8. regression
      9. simulation
      10. statistics

      Qualifiers

      • Research-article

      Conference

      ASPLOS08

      Acceptance Rates

      ASPLOS XIII Paper Acceptance Rate 31 of 127 submissions, 24%;
      Overall Acceptance Rate 535 of 2,713 submissions, 20%

      Upcoming Conference

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)5
      Reflects downloads up to 16 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media