skip to main content
10.1145/1165573.1165606acmconferencesArticle/Chapter ViewAbstractPublication PagesislpedConference Proceedingsconference-collections
Article

Thread-associative memory for multicore and multithreaded computing

Published: 04 October 2006 Publication History

Abstract

Presented in this paper is the thread-associative memory microarchitecture for multicore and multithreaded processor design. Memory contention among concurrent threads in chip multithreaded processing has become a limiting factor for performance improvement. The proposed thread-associative memory addresses this challenge by incorporating thread-specific information explicitly into on-chip memory hardware. The proposed technique can be utilized at different levels of memory hierarchy. Furthermore, it is not just a technique for performance enhancement but also a solution for energy efficiency. Trace-driven simulations on a 32KB L1 data cache demonstrate 36.6% maximum performance improvement and up to 15.1% total energy reduction, with 20.3% dynamic energy reduction and 9.9% leakage energy reduction.

References

[1]
D. M. Tullsen, S. J. Eggers, and H. M. Levy, "Simultaneous multithreading: Maximizing on-chip parallelism,"Proc. Intl. Symp. Computer Architecture, pp. 392--403, Jun. 1995.
[2]
J. L. Lo, et al., "Converting Thread-Level Parallelism Into Instruction-Level Parallelism via Simultaneous Multithreading,"ACM Trans. on Computer Systems, pp. 322--354, Aug. 1997.
[3]
K. Olukotun phet al., "The Case for a Single Chip Multiprocessor," Proc. 7th Intl. Conf. Architectural Support for Programming Languages and Operating Systems, ACM Press, New York, pp. 2--11, 1996.
[4]
L. Hammond, B. Nayfeh, and K. Olukotun, "A Single-Chip Multiprocessor,"Computer, vol. 30, no. 9, pp. 79--85, Sept. 1997.
[5]
D. Chandra, et al., "Predicting inter-thread cache contention on a chip multi-processor architecture,"Proc. Intl. Symp. on High Performance Computer Architecture, pp. 340--351, Feb. 2005.
[6]
G. E. Suh, L. Rudolph, and S. Devadas, "Dynamic partitioning of shared cache memory,"The Journal of Supercomputing, pp. 7--26, Apr. 2004.
[7]
A. Settle, et al., "A Dynamically Reconfigurable Cache for Multithreaded Processors," The Journal of Embedded Computing (JEC), special issue, Dec. 2005.
[8]
S. J. Eggers, et al., "Simultaneous multithreading: A platform for next-generation processors,"IEEE Micro, pp. 12--19, 1997.
[9]
C. Zhang, and F. Vahid, "A self-tuning cache architecture for embedded systems," Design, Automation and Test Conference in Europe, pp. 142--147, 2004.
[10]
C. Zhang, et al., "A way-halting cache for low-energy high-performance systems," Low Power Electronics and Design, pp. 126--131, Aug. 2004.
[11]
J. Edler and M. D. Hill, "Dinero IV Trace-Driven Uniprocessor Cache Simulator," at http://www.cs.wisc.edu/~markhill/DineroIV/, 2006.
[12]
SPEC CPU2000 athttp://www.spec.org/cpu/.
[13]
A. Milenkovic and M. Milenkovic, "Exploiting Streams in Instruction and Data Address Trace Compression," Proceedings of the IEEE 6th Annual Workshop on Workload Characterization, pp. 99--107, Oct. 2003.

Cited By

View all

Index Terms

  1. Thread-associative memory for multicore and multithreaded computing

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ISLPED '06: Proceedings of the 2006 international symposium on Low power electronics and design
      October 2006
      446 pages
      ISBN:1595934626
      DOI:10.1145/1165573
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 October 2006

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. cache mapping
      2. memory system
      3. multicore
      4. multithreading

      Qualifiers

      • Article

      Conference

      ISLPED06
      Sponsor:
      ISLPED06: International Symposium on Low Power Electronics and Design
      October 4 - 6, 2006
      Bavaria, Tegernsee, Germany

      Acceptance Rates

      Overall Acceptance Rate 398 of 1,159 submissions, 34%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 13 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media