skip to main content
10.1145/143365.143503acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
Article
Free access

A performance evaluation of optimal hybrid cache coherency protocols

Published: 01 September 1992 Publication History
First page of PDF

References

[1]
M. Accetta, R. Baron, W. Bolosky, D. Golub, R. Rashid, A. Tevanian, and M. Young. Mach: A new kernel foundation for UNIX development. In Proe. o/ the Summer 1986 USENIX Technical Conference and Exhibition, Pittsburgh, PA, June 1986.
[2]
A. Agarwal, R. Simoni, J. Hennessy, and M. Horowitz. An evaluation of directory schemes for cache coherence. In Proceedings o/the 15th Annual International Symposium on Computer Architecture, pages 280-289, June 1988.
[3]
James Archibald and Jeaat~Loup Baer. Cache coherence protocols: Evaluation using a multiprocessor simulation model. A CM Transactions on Computer Systems, 4(4):273-298, November 1986.
[4]
H. B. Bakogh, G. F. Grohoski, and R. K. Montoye. The IBM RISC System/6000 processor: Hardware overview. IBM Journal of Research and Development, 36(1):12-22, 1990.
[5]
John K. Bennett, John B. Carter, and Willy Zwaenepoel. Adaptive software cache management for distributed shared memory architectures. In Proceedings o/the 17th Annual International Symposium on Computer Architecture, pages 125-135, June 1990.
[6]
John K. Bennett, John B. Carter, and Willy Zwaenepoel. Munin: Distributed shared memory based on type-specific memory coherence, in Proceedings o/ the Second PPoPP, pages 168-176, March 1990. ACM SIGPLAN Notices 25:3.
[7]
B. N. Bershad, E. D. Lazowska, and H. M. Levy. Presto: A system for object-oriented parallel programming. Software- Practice and Experience, 18(8):713- 732, 1988.
[8]
W. J. Bolosky and M. L. Scott. A Trace-Based Comparison of Shared Memory Multiprocessors using Optimal Off-Line Analysis. Technical report, U. of Rochester CSD, December 1991.
[9]
William J. Bolosky, Michael L. Scott, Robert P. Fitzgerald, Robert J. Fowler, and Alan L. Cox. NUMA policies and their relation to memory architecture. In Proceedings o/the Fourth International Conference on Architectural Support/or Programming Languages and Operating Systems, pages 212 - 221, 1991.
[10]
James Boyle, Ralph Butler, Terrence Disz, Barnett Blickfeld, Ewing Lusk, and Ross Overbeek. Portable Programs/or Parallel Processors. Holt, Rinehart and Winston, 1987.
[11]
Lucien M. Censier and Paul Feautrier. A new solution to coherence problems in muRicache systems. IEEE Transactions on Computers, C-27(12):1112-1118, December 1978.
[12]
David R. Cheriton, Hendrik A. Goosen, and Philip Ms~ chanick. Restructuring a para}lel simulation to improve cache behavior in a shaxed-memory multiprocessor: A first experience. In Proceedings of the international Symposium on Shared-Memory Multiprocessing, pages 109-118, 1991.
[13]
Eric Cooper and Richard Draves. C Threads. Technical report, Carnegie-Mellon University, Computer Science Department, nov 1990.
[14]
Corollary, Inc. C-bus H Specification, Revision 1.0. Corollary, Inc., 2802 Kelvin Avenue, Irvine, California 92714, 1991.
[15]
Alan L. Cox. The Implementation and Evaluation o/a Coherent Memory Abstraction/or NUMA Multiproces. sops. PhD thesis, University of Rochester, Department of Computer Science, may 1992.
[16]
Alan L. Cox and Robert J. Fowler. The implementation of a coherent memory abstraction on a NUMA multiprocessor: Experiences with PLATINUM. In Proceedings o/the 12th A CM Symposium on Operating System Principles, pages 32-44, Litchfield Park, AZ, December 1989.
[17]
W. P. Crowley, C. P. Hendrickson, and T. E. Rudy. The SIMPLE code. Technical report, Lawrence Livermore Laboratory, 1978. UCID-17715.
[18]
Czarek Dubnicki and Thomas J. LeBlanc. Adjustable block size coherent caches. In Proceedings o/the 19th Annual International Symposium on Computer Archi. tecture, pages 170-180, 1992.
[19]
S. J. Eggers and T. E. Jeremiassen. Eliminating False Sharing. In Prec. 199I Intl. Conf. on Parallel Processing, pages 377-381, 1991. Volume I.
[20]
Susan J. Eggers and Randy H. Katz. A characterization of sharing in parallel programs and its application to coherency protocol evaluation. In Proceedings of the lath International Symposium on Computer Architecture, pages 373-382, 1988.
[21]
Susan j. Eggers and Randy H. Katz. Evaluating the performance of four snooping cache coherency protocols. Technical Report UCB/CSD 88/478, Computer Science Division (EECS), University of California, Berkeley, December 1988.
[22]
Susan J. Eggers and Randy H. Katz. The effect of sharing on the cache and bus performance of parallel programs. In Proceedings o/the Third international Conference on Arehicteetural Support for Program. ruing Languages and Operating Systems (ASPLOS- llI), pages 257-270, April 1989.
[23]
A. Garcia, D. Foster, and R. Freitas. The Advanced Computing Environment Multiprocessor Workstation. Research Report RC-14419, IBM T.J. Watson Research Center, March 1989.
[24]
Armando Gaxda. Efficient Rendering of Synthetic In. ages. PhD thesis, Massachusetts Institute of Technology, February 1988.
[25]
Aaron Goldberg and John itennessy. Mtooh A method for isolating memory bottlenecks in shared memory multiprocessor programs. In Proceedings of the 1991 Conference on Parallel Processing, pages II-251-II- 257, 1991.
[26]
J. R. Goodman, M. K. Vernon, and P. J. Woest. Efficient Synchronization Primitives for Large-Scale Cache-Coherent Multiprocessors. In Proceedings of the Third International Conference on Architectural Support for Programming Languages and Operating Systems, pages 64-75, April 1989.
[27]
James R. Goodman. Using cache memory to reduce processor-memory traffic. In Proceedings of the l Oth International Symposium on Computer Architecture, pages 124-131, 1983.
[28]
Norman P. Jouppi. Improving direct-mapped cache performance by the addition of a small fully, associative cache and prefetch buffers. In Proceedings of the 17th Annual International Symposium on Computer Architecture, pages 364-373, 1990.
[29]
A. R. Karlin, M. S. Manasse, L. Rudolph, and D. D. Sleator. Competitive snoopy caching. Algorithmica, pages 79-119, March 1988.
[30]
R. H. Katz, S. J. Eggers, D. A. Wood, C. L. PerMns, and R. G. Sheldon. Implementing a cache consistency protocol. In Proceedings of the l~th International Symposium on Computer Architecture, pages 276-283, 1985.
[31]
Daniel Lenoski, James Landon, Kourosh Gharachor- 1oo, Anoop Gupta, and John Hennessy. The directorybased cache coherence protocol for the DASH mulfiprocessor. In Proceedings o/the 17th Annual International Symposium on Computer Architecture, pages 148-159, June 1990.
[32]
John M. Mellor-Crummey and Michael L. Scott. Synchronization without contention. In Proceedings o/the Fourth International Conference on Architectural Support/or Programming Languages and Operating Systems, pages 269-278, April 1991.
[33]
MIPS Computer Systems, Inc. MIPS R~O00 Microprocessor User's Manual. Integrated Device Technology, Inc., 1991.
[34]
James M. Ortega and Robert O. Voigt. Solution of Partial Differential Equations on Vector an d Parallel Computers. SIAM Review, 27(2):149-240, June 1985.
[35]
J. P. Singh, W.-D. Weber, and A. Gnpta. SPLASH: Stanford Parallel Applications for Shared-Memory. Available by anonymous FTP, April 1991.
[36]
J. Stone and A. Norton. The VM/EPEX FORTRAN Preprocessor Reference. IBM, 1985. Research Report RGll408.
[37]
Charles P. Thacker and Lawrence C. Stewart. Firefly: a Multiprocessor Workstation. In Proceedings of the Second International Conference on Architectural Support/or Programming Languages and Operating Systems, pages 164-172, PaJo Alto, CA, October 1987. Oper. Sys. Review v21(4), 10/87.
[38]
Josep Torrellas, Monies S. Lam, and John L. ttennessy. Shared data placement optimizations to reduce multiprocessor cache miss rates. In International Conference on Parallel Processing, pages 11-266 - I1-270. The Pennsylvania State University Press, 1990.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ASPLOS V: Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
September 1992
308 pages
ISBN:0897915348
DOI:10.1145/143365
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 1992

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ASPLOS92

Acceptance Rates

Overall Acceptance Rate 535 of 2,713 submissions, 20%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)103
  • Downloads (Last 6 weeks)9
Reflects downloads up to 04 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media