Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleJune 2001
LogGPS: a parallel computational model for synchronization analysis
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 133–142https://doi.org/10.1145/379539.379592We present a new parallel computational model, named LogGPS, which captures synchronization.
The LogGPS model is an extension of the LogGP model, which abstracts communication on parallel platforms. Although the LogGP model captures long messages with ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Blocking and array contraction across arbitrarily nested loops using affine partitioning
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 103–112https://doi.org/10.1145/379539.379586Applicable to arbitrary sequences and nests of loops, affine partitioning is a program transformation framework that unifies many previously proposed loop transformations, including unimodular transforms, fusion, fission, reindexing, scaling and ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Scalable queue-based spin locks with timeout
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 44–52https://doi.org/10.1145/379539.379566Queue-based spin locks allow programs with busy-wait synchronization to scale to very large multiprocessors, without fear of starvation or performance-destroying contention. So-called try locks, traditionally based on non-scalable test-and-set locks, ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Language support for Morton-order matrices
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 24–33https://doi.org/10.1145/379539.379559The uniform representation of 2-dimensional arrays serially in Morton order (or {\eee} order) supports both their iterative scan with cartesian indices and their divide-and-conquer manipulation as quaternary trees. This data structure is important ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Pointer and escape analysis for multithreaded programs
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 12–23https://doi.org/10.1145/379539.379553This paper presents a new combined pointer and escape analysis for multithreaded programs. The algorithm uses a new abstraction called parallel interaction graphs to analyze the interactions between threads and extract precise points-to, escape, and ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Reference idempotency analysis: a framework for optimizing speculative execution
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 2–11https://doi.org/10.1145/379539.379547Recent proposals for multithreaded architectures allow threads with unknown dependences to execute speculatively in parallel. These architectures use hardware speculative storage to buffer uncertain data, track data dependences and roll back incorrect ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Statistical scalability analysis of communication operations in distributed applications
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 123–132https://doi.org/10.1145/379539.379590Current trends in high performance computing suggest that users will soon have widespread access to clusters of multiprocessors with hundreds, if not thousands, of processors. This unprecedented degree of parallelism will undoubtedly expose scalability ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Efficiency vs. portability in cluster-based network servers
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 113–122https://doi.org/10.1145/379539.379589Efficiency and portability are conflicting objectives for cluster-based network servers that distribute the clients' requests across the cluster based on the actual content requested. Our work is based on the observation that this efficiency vs. ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
High-level adaptive program optimization with ADAPT
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 93–102https://doi.org/10.1145/379539.379583Compile-time optimization is often limited by a lack of target machine and input data set knowledge. Without this information, compilers may be forced to make conservative assumptions to preserve correctness and to avoid performance degradation. In ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Source-level global optimizations for fine-grain distributed shared memory systems
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 83–92https://doi.org/10.1145/379539.379578This paper describes and evaluates the use of aggressive static analysis in Jackal, a fine-grain Distributed Shared Memory (DSM) system for Java. Jackal uses an optimizing, source-level compiler rather than the binary rewriting techniques employed by ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Dynamic adaptation to available resources for parallel computing in an autonomous network of workstations
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 72–81https://doi.org/10.1145/379539.379572Networks of workstations (NOWs), which are generally composed of autonomous compute elements networked together, are an attractive parallel computing platform since they offer high performance at low cost. The autonomous nature of the environment, ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Accurate data redistribution cost estimation in software distributed shared memory systems
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 62–71https://doi.org/10.1145/379539.379570Distributing data is one of the key problems in implementing efficient distributed-memory parallel programs. The problem becomes more difficult in programs where data redistribution between computational phases is considered. The global data ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Contention elimination by replication of sequential sections in distributed shared memory programs
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 53–61https://doi.org/10.1145/379539.379568In shared memory programs contention often occurs at the transition between a sequential and a parallel section of the code. As all threads start executing the parallel section, they often access data just modified by the thread that executed the ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Efficient load balancing for wide-area divide-and-conquer applications
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPages 34–43https://doi.org/10.1145/379539.379563Divide-and-conquer programs are easily parallelized by letting the programmer annotate potential parallelism in the form of spawn and sync constructs. To achieve efficient program execution, the generated work load has to be balanced evenly among the ...
Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7 - ArticleJune 2001
Parallel computing in 2010: opportunities and challenges in a networked world
PPoPP '01: Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programmingPage 1https://doi.org/10.1145/379539.379546Also Published in:
ACM SIGPLAN Notices: Volume 36 Issue 7