skip to main content
10.1145/1787275.1787302acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
poster

Exploitation of nested thread-level speculative parallelism on multi-core systems

Published: 17 May 2010 Publication History

Abstract

Multi-cores such as the Intel Core 2 Duo, AMD Barcelona and IBM POWER6 are becoming ubiquitous. The number of cores and the resulting hardware parallelism is poised to increase rapidly in the foreseeable future. Nested thread-level speculative parallelization has been proposed as a means to exploit the hardware parallelism of such systems. In this paper, we present a methodology to gauge the efficacy of nested thread-level speculation with increasing level of nesting.

References

[1]
AMD Confirms 12-Core Opteron Production, Februrary, 2010. http://www.hpcwire.com/blogs/AMD-Confirms-12-Core-Opteron-Production-85118242.html#38593146.
[2]
Multicore Watershed, March, 2010. http://www.hpcwire.com/blogs/Multicore-Watershed-86426622.htm.
[3]
H. Sutter and J. Larus. Software and the concurrency revolution. ACM Queue, 3(7), 2005.
[4]
A. Kejariwal, X. Tian, W. Li, M. Girkar, S. Kozhukhov, H. Saito, U. Banerjee, A. Nicolau, A. V. Veidenbaum, and C. D. Polychronopoulos. On the performance potential of different types of speculative thread-level parallelism. In Proceedings of the 20th ACM ICS, pages 24--35, 2006.
[5]
A. Kejariwal, X. Tian, M. Girkar, W. Li, S. Kozhukhov, H. Saito, U. Banerjee, A. Nicolau, A. V. Veidenbaum, and C. D. Polychronopoulos. Tight analysis of the performance potential of thread speculation using SPEC CPU2006. In Proceedings of the 12th ACM SIGPLAN Symposium on PPoPP, 2007.
[6]
A. Kejariwal, M. Girkar, X. Tian, H. Saito, A. Nicolau, A. V. Veidenbaum, and U. Banerjee. On the efficacy of call graph-level thread-level speculation. In Proceedings of the First Joint WOSP/SIPEW ICPE, pages 247--248, 2010.
[7]
J. Renau, J. Tuck, W. Liu, L. Ceze, K. Strauss, and J. Torrellas. Tasking with out-of-order spawn in TLS chip multiprocessors: Microarchitecture and compilation. In Proceedings of the 19th ACM ICS, pages 179--188, 2005.
[8]
SPEC CFP2006. http://www.spec.org/cpu2006/CFP2006.
[9]
P. Wu, A. Kejariwal, and C. Caşscaval. Compiler-driven dependence profiling to guide program parallelization. In Proceedings of the 21st International Workshop on LCPC, 2008.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CF '10: Proceedings of the 7th ACM international conference on Computing frontiers
May 2010
370 pages
ISBN:9781450300445
DOI:10.1145/1787275
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 May 2010

Check for updates

Author Tags

  1. performance
  2. thread-level speculation

Qualifiers

  • Poster

Conference

CF'10
Sponsor:
CF'10: Computing Frontiers Conference
May 17 - 19, 2010
Bertinoro, Italy

Acceptance Rates

CF '10 Paper Acceptance Rate 30 of 113 submissions, 27%;
Overall Acceptance Rate 273 of 785 submissions, 35%

Upcoming Conference

CF '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 24 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media