skip to main content
10.1145/1238844.1238852acmconferencesArticle/Chapter ViewAbstractPublication PageshoplConference Proceedingsconference-collections
Article

The design and development of ZPL

Published: 09 June 2007 Publication History

Abstract

ZPL is an implicitly parallel programming language, which means all instructions to implement and manage the parallelism are inserted by the compiler. It is the first implicitly parallel language to achieve performance portability, that is, consistent high performance across all (MIMD) parallel platforms. ZPL has been designed from first principles, and is founded on the CTA abstract parallel machine. A key enabler of ZPL's performance portability is its What You See Is What You Get (WYSIWYG) performance model. The paper describes the antecedent research on which ZPL was founded, the design principles used to build it incrementally, and the technical basis for its performance portability. Comparisons with other parallel programming approaches are included.

Supplementary Material

JPG File (m8-snyder-h.jpg)
slides.ppt (zpl.ppt)
The design and development of ZPL PowerPoint presentations
MOV File (m8-snyder-h.mov)

References

[1]
Lawrence Snyder. Type architecture, shared memory and the corollary of modest potential. Annual Review of Computer Science, 1:289--317, 1986.
[2]
Gil Lerman and Larry Rudolph. Parallel Evolution of Parallel Processors, Plenum Press, 1993.
[3]
G. C. Fox, M. A. Johnson, G. A. Lyzenga, S. W. Otto, J. K. Salmon and D. W. Walker. Solving Problems on Concurrent Processors, Prentice-Hall, 1988.
[4]
Lawrence Snyder, The Blue CHiP Project Description. Department of Computer Science Technical Report, Purdue University, 1980.
[5]
Calvin Lin and Lawrence Snyder. ZPL: An array sublanguage. In Proceedings of the Workshop on Languages and Compilers for Parallel Computing, 1993.
[6]
Calvin Lin and Lawrence Snyder. SIMPLE performance results in ZPL. In Proceedings of the Workshop on Languages and Compilers for Parallel Computing, 1994.
[7]
Lawrence Snyder. Parallel programming and the poker programming environment. Computer, 17(7):27--36, July 1984.
[8]
Lawrence Snyder. Introduction to the configurable, highly parallel computer. Computer, 15(1):47--56, January 1982.
[9]
Richard O'Keefe, The Craft of Prolog, MIT Press, 1994.
[10]
Simon Peyton Jones and P. L. Wadler. A static semantics for Haskell. University of Glasgow, 1992.
[11]
Jorg Keller, Christoph W. Kessler and Jesper Larsson Traff. Practical PRAM Programming, John Wiley, 2000.
[12]
Richard J. Anderson and Lawrence Snyder. A comparison of shared and nonshared memory models of parallel computation. Proceedings of the IEEE, 79(4):480--487, April 1991.
[13]
Alan M. Turing. On computable numbers, with an application to the Entscheidungsproblem, Proc. London Math. Soc., 2(42):230--265, 1936.
[14]
John Backus. Can programming be liberated from the von Neumann style? Communications of the ACM 21(8):613--641, 1978.
[15]
J. T. Schwartz. Ultracomputers. ACM Transactions on Programming Languages and Systems, 2(4):484--521, 1980.
[16]
I.D. Scherson and A. S. Youssef. Interconnection Networks for High-Performance Parallel Computers. IEEE Computer Society Press, 1994
[17]
K. Bolding, M. L. Fulgham and L. Snyder. The case for Chaotic adaptive routing. IEEE Trans. Computers 46(12): 1281--1291, 1997.
[18]
David Culler, Richard Karp, David Patterson, Abhijit Sahay, Klaus Erik Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken. LogP: towards a realistic model of parallel computation, ACM Symposium on Principles and Practice of Parallel Programming, 1993.
[19]
Benjamin Ylvisaker, Brian Van Essen and Carl Ebeling. A Type Architecture for Hybrid Micro-Parallel Computers. In IEEE Symposium on Field-Programmable Custom Computing Machines, 2006.
[20]
David G. Socha. Supporting Fine-Grain Computation on Distributed memory Parallel Computers. PhD Dissertation, University of Washington, 1991.
[21]
Calvin Lin, Jin-ling Lee and Lawrence Snyder. Programming SIMPLE for parallel portability, In U. Banerjee, D. Gelernter, A. Nicolau and D. Padua (eds.), Languages and Compilers for Parallel Computing, Springer-Verlag, pp. 84--98, 1992.
[22]
Calvin Lin. The Portability of Parallel Programs Across MIMD Computers. PhD Dissertation, University of Washington, 1992.
[23]
Ton Ahn Ngo and Lawrence Snyder. On the influence of programming models on shared memory computer performance. In Proceedings of the Scalable High Performance Computing Conference, 1992.
[24]
Ton Anh Ngo. The Role of Performance Models in Parallel Programming and Languages. PhD Dissertation, University of Washington, 1997.
[25]
Raymond Greenlaw and Lawrence Snyder. Achieving speedups for APL on an SIMD distributed memory machine. International Journal of Parallel Programming, 19(2):111---127, April 1990.
[26]
Walter S. Brainerd, Charles H. Goldberg and Jeanne C. Adams. Programmer's Guide to Fortran 90, 3rd Ed. Springer, 1996.
[27]
High Performance Fortran Language Specification Version 1.0 (1992) High Performance Fortran Forum, May 3, 1993. {34} Bradford L. Chamberlain. The Design and Implementation of a Region-Based Parallel Language. PhD Dissertation, University of Washington, 2001.
[28]
Bradford L. Chamberlain, Sung-Eun Choi, and Lawrence Snyder. A compiler abstraction for machine independent parallel communication generation. In Proceedings of the Workshop on Languages and Compilers for Parallel Computing, 1997.
[29]
M. R. Haghighat and C. D. Polychronopoulos. Symbolic analysis for parallelizing compilers. ACM Transactions on Programming Languages and Systems, 18(4):477-518, 1996.
[30]
J. R. Rose and G. L. Steele Jr. C*: An extended C language for data parallel programming. In Proceedings Second International Conference on Supercomputing, 1987.
[31]
Bradford L. Chamberlain, Sung-Eun Choi, E Christopher Lewis, Calvin Lin, Lawrence Snyder, and W. Derrick Weathersby. ZPL's WYSIWYG performance model. In Proceedings of the IEEE Workshop on High-Level Parallel Programming Models and Supportive Environments, 1998.
[32]
Sung-Eun Choi. Machine Independent Communication Optimization. PhD Dissertation, University of Washington, March 1999.
[33]
Bradford L. Chamberlain. The Design and Implementation of a Region-based Parallel Programming Language. PhD Dissertation, University of Washington, 2001.
[34]
Bradford L. Chamberlain, E Christopher Lewis, and Lawrence Snyder. Problem space promotion and its evaluation as a technique for efficient parallel computation. In Proceedings of the ACM International Conference on Supercomputing, 1999.
[35]
Robert van de Geijn and Jerrell Watts. SUMMA: Scalable Universal Matrix Multiplication Algorithm. Concurrency: Practice and Experience, 1998.
[36]
R. E. Cypher. J. L. C. Sanz and L. Snyder. Algorithms for image component labeling on SIMD mesh connected computers. IEEE Transactions on Computers, 39(2):276--281, 1990.
[37]
MasPar Programming Language (ANSI C-compatible MPL) Reference Manual Document Part Number: 9302-0001 Revision: A3 July 1992.
[38]
Gregory R. Watson. The Design and Implementation of a Parallel Relative Debugger. PhD Dissertation, Monash University, 2000.
[39]
W. P. Crowley et al. The SIMPLE code. Technical Report UCID 17715, Lawrence Livermore Laboratory, February 1978.
[40]
Lawrence Snyder. A Programmer's Guide to ZPL. MIT Press, Cambridge, MA, 1999. (The language changed in small ways and has been extended; it is now most accurately described in Chapter 2 of Chamberlain {33}.)
[41]
Steven J. Deitz, Bradford L. Chamberlain, Sung-Eun Choi, and Lawrence Snyder. The design and implementation of a parallel array operator for the arbitrary remapping of data. In Proceedings of the ACM Conference on Principles and Practice of Parallel Programming, 2003.
[42]
Adam Beguelin, Jack Dongara, Al Geist, Robert Manchek, and Vaidy Sunderam. User guide to PVM. Oak Ridge National Laboratory, Oak Ridge TN 378 316367, 1993.
[43]
M. Snir, S. W. Otto, S. Huss-Lederman, D. W. Walker, and J. Dongarra. MPI: the Complete Reference. MIT Press, Cambridge, MA, USA, 1996.
[44]
Bradford L. Chamberlain, Sung-Eun Choi, and Lawrence Snyder. A compiler abstraction for machine-independent parallel communication generation. In Proceedings of the Workshop on Languages and Compilers for Parallel Computing, 1997.
[45]
Sung-Eun Choi. Machine Independent Communication Optimization. PhD Dissertation, University of Washington, March 1999.
[46]
E Christopher Lewis, Calvin Lin, and Lawrence Snyder. The implementation and evaluation of fusion and contraction in array languages. In Proceedings of the ACM Conference on Programming Language Design and Implementation, 1998.
[47]
E Christopher Lewis. Achieving Robust Performance in Parallel Programming Languages. PhD Dissertation, University of Washington, February 2001.
[48]
W. Derrick Weathersby. Machine-Independent Compiler Optimizations for Collective Communication. PhD Dissertation, University of Washington, August 1999.
[49]
Steven J. Deitz, Bradford L. Chamberlain, and Lawrence Snyder. Eliminating redundancies in sum-of-product array computations. In Proceedings of the ACM International Conference on Supercomputing, 2001.
[50]
Bradford L. Chamberlain, Sung-Eun Choi, E Christopher Lewis, Calvin Lin, Lawrence Snyder, and W. Derrick Weathersby. Factor-join: A unique approach to compiling array languages for parallel machines. In Proceedings of the Workshop on Languages and Compilers for Parallel Computing, 1996.
[51]
Philip S. Abrams. An APL Machine, PhD Dissertation. Stanford University, SLAC Report 114, 1970.
[52]
Ulrich Ruede. Mathematical and computational techniques for multilevel adaptive methods, Frontiers in Applied Mathematics, 13, SIAM, 1993.
[53]
Guy E. Blelloch. Vector Models for Data-Parallel Computing. MIT Press, 1990.
[54]
R. E. Ladner and M. J. Fischer. Parallel prefix computation. In Proceedings of the IEEE International Conference on Parallel Processing, 1977.
[55]
Steven J. Deitz. High-Level Programming Language Abstractions for Advanced and Dynamic Parallel Computations. PhD Dissertation, University of Washington, February 2005.
[56]
Steven J. Deitz, Bradford L. Chamberlain, and Lawrence Snyder. High-level language support for user-defined reductions. Journal of Supercomputing, 23(1), 2002.
[57]
E Christopher Lewis and Lawrence Snyder. Pipelining wavefront computations: Experiences and performance. In Proceedings of the IEEE Workshop on High-Level Parallel Programming Models and Supportive Environments, May 2000.
[58]
Bradford L. Chamberlain, E Christopher Lewis, and Lawrence Snyder. Problem space promotion and its evaluation as a technique for efficient parallel computation. In Proceedings of the ACM International Conference on Supercomputing, 1999.
[59]
Calvin Lin, Lawrence Snyder, Ruth Anderson, Brad Chamberlain, Sung-Eun Choi, George Forman, E. Christopher Lewis, and W. Derrick Weathersby. ZPL vs. HPF: A Comparison of Performance and Programming Style, TR # 95-11-05 (available online from the University of Washington CSE technical report archive).
[60]
David Callahan, Bradford L. Chamberlain, and Hans P. Zima. The Cascade High Productivity Language. In 9th International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS 2004), pp 52--60, April 2004.
[61]
Philippe Charles, Christian Grothoff, Vijay Saraswat, Christopher Donawa, Allan Kielstra, Kemal Ebcioglu, Christoph von Praun, and Vivek Sarkar. X10: An object-oriented approach to non-uniform cluster computing. In 20th OOPSLA, pp. 519--538, 2005.
[62]
David Bailey, Tim Harris, William Saphir, Rob van der Wijngaart, AlexWoo, and Maurice Yarrow. The NAS parallel benchmarks 2.0. Technical Report NAS-95-020, Nasa Ames Research Center, Moffet Field, CA, December 1995.
[63]
Bradford L. Chamberlain, Steven J. Deitz, and Lawrence Snyder. A comparative study of the NAS MG benchmark across parallel languages and architectures. In Proceedings of the ACM Conference on Supercomputing, 2000.
[64]
R. Alverson, D. Callahan, D. Cummings, B. Koblenz, A. Porterfield, and B. Smith. The Tera Computer System. ACM SIGARCH Computer Architecture News, 18(3):1 - 6, 1990.
[65]
Michael A. Hiltzik. Dealers in Lightning: Xerox PARC and the Dawn of the Computer Age, Harper Collins, 1999.
[66]
Eric Steven Raymond and Rob W. Landley. The Art of Unix Usability, Creative Commons, 2004. http://www.catb.org/~esr/writings/taouu/html/ch02.html.
[67]
Calvin Lin and Lawrence Snyder. A comparison of programming models for shared memory multiprocessors. In Proceedings of the IEEE International Conference on Parallel Processing, 1990.
[68]
M. Kandemir, P. Banerjee, A. Choudhary, J. Ramanujam and N. Shenoy. A global communication optimization technique based on data-flow analysis and linear algebra. ACM Transactions on Programming Languages and Systems 21(6):1251--1297, 1999.
[69]
A. J. Wagner, L. Giraud and C. E. Scott. Simulation of a cusped bubble rising in a viscoelastic fluid with a new numerical method, Computer Physics Communications, 129(3):227--232, 2000.
[70]
Robert W. Numrich, John Reid, and Kieun Kim. Writing a multigrid solver using Co-array Fortran. In Proceedings of the Fourth International Workshop on Applied Parallel Computing, 1998.
[71]
S. B. Scholz. A case study: Effects of WITH-loop-folding on the NAS benchmark MG in SAC. In Proceedings of IFL '98, Springer-Verlag, 1998.

Cited By

View all

Index Terms

  1. The design and development of ZPL

                                Recommendations

                                Comments

                                Information & Contributors

                                Information

                                Published In

                                cover image ACM Conferences
                                HOPL III: Proceedings of the third ACM SIGPLAN conference on History of programming languages
                                June 2007
                                483 pages
                                ISBN:9781595937667
                                DOI:10.1145/1238844
                                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                                Sponsors

                                Publisher

                                Association for Computing Machinery

                                New York, NY, United States

                                Publication History

                                Published: 09 June 2007

                                Permissions

                                Request permissions for this article.

                                Check for updates

                                Author Tags

                                1. CTA
                                2. WYSIWYG performance model
                                3. parallel language design
                                4. performance portability
                                5. problem space promotion
                                6. regions
                                7. type architecture

                                Qualifiers

                                • Article

                                Conference

                                HOPL-III '07
                                Sponsor:

                                Acceptance Rates

                                HOPL III Paper Acceptance Rate 12 of 12 submissions, 100%;
                                Overall Acceptance Rate 12 of 12 submissions, 100%

                                Contributors

                                Other Metrics

                                Bibliometrics & Citations

                                Bibliometrics

                                Article Metrics

                                • Downloads (Last 12 months)6
                                • Downloads (Last 6 weeks)0
                                Reflects downloads up to 06 Jan 2025

                                Other Metrics

                                Citations

                                Cited By

                                View all

                                View Options

                                Login options

                                View options

                                PDF

                                View or Download as a PDF file.

                                PDF

                                eReader

                                View online with eReader.

                                eReader

                                Media

                                Figures

                                Other

                                Tables

                                Share

                                Share

                                Share this Publication link

                                Share on social media