skip to main content
10.1145/165123.165150acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

Evaluation of release consistent software distributed shared memory on emerging network technology

Published: 01 May 1993 Publication History

Abstract

We evaluate the effect of processor speed, network characteristics, and software overhead on the performance of release-consistent software distributed shared memory. We examine five different protocols for implementing release consistency: eager update, eager invalidate, lazy update, lazy invalidate, and a new protocol called lazy hybrid. This lazy hybrid protocol combines the benefits of both lazy update and lazy invalidate.
Our simulations indicate that with the processors and networks that are becoming available, coarse-grained applications such as Jacobi and TSP perform well, more or less independent of the protocol used. Medium-grained applications, such as Water, can achieve good performance, but the choice of protocol is critical. For sixteen processors, the best protocol, lazy hybrid, performed more than three times better than the worst, the eager update. Fine-grained applications such as Cholesky achieve little speedup regardless of the protocol used because of the frequency of synchronization operations and the high latency involved.
While the use of relaxed memory models, lazy implementations, and multiple-writer protocols has reduced the impact of false sharing, synchronization latency remains a serious problem for software distributed shared memory systems. These results suggest that the future work on software DSMs should concentrate on reducing the amount of synchronization or its effect.

References

[1]
S. V. Adve and M. D. Hill. A unified formalization of four shared-memory models. Technical Report CS-1051, University of Wisconsin, Madison, September 1991.
[2]
M. Ahamad, P.W. Hutto, and R. John. Implementing and programming causal distributed shared memory. In Proceedings of the l lth lnlernalional Conference on Distributed Computing Systems, pages 274-281, May 1991.
[3]
H.E. Bal and A.S. Tanenbaum. Distributed programming with shared data. In Proceedings of the 1988 International Conference on Computer Languages, pages 82-91, October 1988.
[4]
B.N. Bershad and M.J. Zekauskas. Midway: Shared memory parallel programming with entry consistency for distributed memory multiprocessors. Technical Report CMU-CS-91-170, Carnegie- Mellon University, September 1991.
[5]
J.B. Carter, J.K. Bennett, and W. Zwaenepoel. Implementation and performance of Munin. In Proceedings of the 13th A CM Symposium on Operating Systems Principles, pages 152-164, October 1991.
[6]
J.S. Chase, F.G. Amador, E.D. Lazowska, H.M. Levy, and R.J. Littlefield. The Amber system: Parallel programming on a network of multiprocessors. In Proceedings of the 12th A CM Symposium on Operating Systems Principles, pages 147-158, December 1989.
[7]
R. G. Covington, S. Dwarkadas, J. R. Jump, S. Madala, and J. B. Sinclair. The Efficient Simulation of Parallel Computer Systems. International Journal in Computer Simulation, 1:31-58, January 1991.
[8]
M. Dubois and C. Scheurich. Memory access dependencies in shared-memory multiprocessors. IEEE Transactions on Contpulers, 16(6):660-673, June 1990.
[9]
K. Gharachorloo, D. Lenoski, J. Laudon, P. Gibbons, A. Gupta, and J. Hennessy. Memory consistency and event ordering in scalable sharedmemory multiprocessors. In Proceedings of th,e 17th Annual International Symposium on Computer Architecture, pages 15-26, May 1990.
[10]
D.B. Johnson and W. Zwaenepoel. The Peregrine high-performance RPC system. Software: Practice and Experience, 23(2):201-221, February 1993.
[11]
P. Keleher, A. L. Cox, and W. Zwaenepoel. Lazy release consistency for software distributed shared memory. In Proceedings of the 19th Annual International Symposium on Computer Architecture'., pages 13-21, May 1992.
[12]
D. Lenoski, J. Laudon, K. Gharachorloo, A. Gupta, and J. Hennessy. The directory-based cache coherence protocol for the DASH multiprocessor. In Proceedings of the 17th Annual International Symposium on Computer Architecture, pages 148-159, May 1990.
[13]
K. Li and P. Hudak. Memory coherence in shared virtual memory systems. A CM Transactions on Computer Systems, 7(4):321-359, November 1989.
[14]
J.P. Singh, W.-D. Weber, and A. Gupta. SPLASH: Stanford parallel applications for shared-memory. Technical Report CSL-TR-91-469, Stanford University, April 1991.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ISCA '93: Proceedings of the 20th annual international symposium on computer architecture
June 1993
361 pages
ISBN:0818638109
DOI:10.1145/165123
  • cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 21, Issue 2
    Special Issue: Proceedings of the 20th annual international symposium on Computer architecture (ISCA '93)
    May 1993
    348 pages
    ISSN:0163-5964
    DOI:10.1145/173682
    Issue’s Table of Contents

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1993

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

20ISCA93
Sponsor:
20ISCA93: 20th International Symposium on Computer Architecture
May 16 - 19, 1993
California, San Diego, USA

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Upcoming Conference

ISCA '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)68
  • Downloads (Last 6 weeks)20
Reflects downloads up to 24 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media