Language types

Applied Filters

People

Publications

Conferences

Publication Date

24 Results for: Book/Issue: PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,823,348 records)|Limit your search to The ACM Full-Text Collection (772,531 records)

Showing 1 - 20of24 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

poster
February 2011
Active pebbles: a programming model for highly parallel fine-grained data-driven computations
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 305–306https://doi.org/10.1145/1941553.1941601

A variety of programming models exist to support large-scale, distributed memory, parallel computation. These programming models have historically targeted coarse-grained applications with natural locality such as those found in a variety of scientific ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
10
341
Metrics
Total Citations10
Total Downloads341
Last 12 Months6
Last 6 weeks0
Get Access
poster
February 2011
A wait-free NCAS library for parallel applications with timing constraints
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 301–302https://doi.org/10.1145/1941553.1941599

We introduce our major ideas of a wait-free, linearizable, and disjoint access parallel NCAS library, called rtNCAS. It focuses the construction of wait-free data structure operations (DSO) in real-time circumstances. rtNCAS is able to conditionally ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
0
149
Metrics
Total Citations0
Total Downloads149
Last 12 Months5
Last 6 weeks0
Get Access
poster
February 2011
Two examples of parallel programming without concurrency constructs (PP-CC)
- Chen Ding
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 299–300https://doi.org/10.1145/1941553.1941598
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
0
306
Metrics
Total Citations0
Total Downloads306
Last 12 Months5
Last 6 weeks0
Get Access
poster
February 2011
Evaluating graph coloring on GPUs
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 297–298https://doi.org/10.1145/1941553.1941597

This paper evaluates features of graph coloring algorithms implemented on graphics processing units (GPUs), comparing coloring heuristics and thread decompositions. As compared to prior work on graph coloring for other parallel architectures, we find ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
25
529
Metrics
Total Citations25
Total Downloads529
Last 12 Months27
Last 6 weeks1
Get Access
poster
February 2011
Time skewing made simple
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 295–296https://doi.org/10.1145/1941553.1941596

Time skewing and loop tiling has been known for a long time to be a highly beneficial acceleration technique for nested loops especially on bandwidth hungry multi-core processors, but it is little used in practice because efficient implementations ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
14
237
Metrics
Total Citations14
Total Downloads237
Last 12 Months9
Last 6 weeks0
Get Access
poster
February 2011
Kremlin: like gprof, but for parallelization
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 293–294https://doi.org/10.1145/1941553.1941595

This paper overviews Kremlin, a software profiling tool designed to assist the parallelization of serial programs. Kremlin accepts a serial source code, profiles it, and provides a list of regions that should be considered in parallelization. Unlike a ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
15
269
Metrics
Total Citations15
Total Downloads269
Last 12 Months7
Last 6 weeks0
Get Access
poster
February 2011
Weak atomicity under the x86 memory consistency model
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 291–292https://doi.org/10.1145/1941553.1941594

We consider the problem of building a weakly atomic Software Transactional Memory (STM), that provides Single (Global) Lock Atomicity (SLA) while adhering to the x86 memory consistency model (x86-MM).

Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
0
257
Metrics
Total Citations0
Total Downloads257
Last 12 Months7
Last 6 weeks1
Get Access
research-article
February 2011
Achieving a single compute device image in OpenCL for multiple GPUs
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 277–288https://doi.org/10.1145/1941553.1941591

In this paper, we propose an OpenCL framework that combines multiple GPUs and treats them as a single compute device. Providing a single virtual compute device image to the user makes an OpenCL application written for a single GPU portable to the ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
117
2,214
Metrics
Total Citations117
Total Downloads2,214
Last 12 Months44
Last 6 weeks7
Get Access
research-article
February 2011
Accelerating CUDA graph algorithms at maximum warp
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 267–276https://doi.org/10.1145/1941553.1941590

Graphs are powerful data representations favored in many computational domains. Modern GPUs have recently shown promising results in accelerating computationally challenging graph problems but their performance suffered heavily when the graph structure ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
313
2,014
Metrics
Total Citations313
Total Downloads2,014
Last 12 Months90
Last 6 weeks17
Get Access
research-article
February 2011
The STAPL parallel container framework
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 235–246https://doi.org/10.1145/1941553.1941586

The Standard Template Adaptive Parallel Library (STAPL) is a parallel programming infrastructure that extends C++ with support for parallelism. It includes a collection of distributed data structures called pContainers that are thread-safe, concurrent ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
43
398
Metrics
Total Citations43
Total Downloads398
Last 12 Months16
Last 6 weeks3
Get Access
research-article
February 2011
Lifeline-based global load balancing
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 201–212https://doi.org/10.1145/1941553.1941582

On shared-memory systems, Cilk-style work-stealing has been used to effectively parallelize irregular task-graph based applications such as Unbalanced Tree Search (UTS). There are two main difficulties in extending this approach to distributed memory. ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
98
1,044
Metrics
Total Citations98
Total Downloads1,044
Last 12 Months31
Last 6 weeks1
Get Access
research-article
February 2011
Lock-free and scalable multi-version software transactional memory
- Sérgio Miguel Fernandes,
- João Cachopo
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 179–188https://doi.org/10.1145/1941553.1941579

Software Transactional Memory (STM) was initially proposed as a lock-free mechanism for concurrency control. Early implementations had efficiency limitations, and soon obstruction-free proposals appeared, to tackle this problem, often simplifying STM ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
62
625
Metrics
Total Citations62
Total Downloads625
Last 12 Months30
Last 6 weeks3
Get Access
research-article
February 2011
Transaction communicators: enabling cooperation among concurrent transactions
- Victor Luchangco,
- Virendra J. Marathe
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 169–178https://doi.org/10.1145/1941553.1941578

In this paper, we propose to extend transactional memory with transaction communicators, special objects through which concurrent transactions can communicate: changes by one transaction to a communicator can be seen by concurrent transactions before ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
20
316
Metrics
Total Citations20
Total Downloads316
Last 12 Months9
Last 6 weeks0
Get Access
research-article
February 2011
Communicating memory transactions
- Mohsen Lesani,
- Jens Palsberg
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 157–168https://doi.org/10.1145/1941553.1941577

Many concurrent programming models enable both transactional memory and message passing. For such models, researchers have built increasingly efficient implementations and defined reasonable correctness criteria, while it remains an open problem to ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
21
331
Metrics
Total Citations21
Total Downloads331
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2011
Thread contracts for safe parallelism
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 125–134https://doi.org/10.1145/1941553.1941573

We build a framework of thread contracts, called Accord, that allows programmers to annotate their concurrency co-ordination strategies. Accord annotations allow programmers to declaratively specify the parts of memory that a thread may read or write ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
10
262
Metrics
Total Citations10
Total Downloads262
Last 12 Months7
Last 6 weeks0
Get Access
research-article
February 2011
ScalaExtrap: trace-based communication extrapolation for spmd programs
- Xing Wu,
- Frank Mueller
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 113–122https://doi.org/10.1145/1941553.1941569

Performance modeling for scientific applications is important for assessing potential application performance and systems procurement in high-performance computing (HPC). Recent progress on communication tracing opens up novel opportunities for ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
55
335
Metrics
Total Citations55
Total Downloads335
Last 12 Months6
Last 6 weeks0
Get Access
research-article
February 2011
ULCC: a user-level facility for optimizing shared cache performance on multicores
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 103–112https://doi.org/10.1145/1941553.1941568

Scientific applications face serious performance challenges on multicore processors, one of which is caused by access contention in last level shared caches from multiple running threads. The contention increases the number of long latency memory ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
63
504
Metrics
Total Citations63
Total Downloads504
Last 12 Months9
Last 6 weeks1
Get Access
research-article
February 2011
OoOJava: software out-of-order execution
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 57–68https://doi.org/10.1145/1941553.1941563

Developing parallel software using current tools can be challenging. Even experts find it difficult to reason about the use of locks and often accidentally introduce race conditions and deadlocks into parallel software. OoOJava is a compiler-assisted ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
40
425
Metrics
Total Citations40
Total Downloads425
Last 12 Months42
Last 6 weeks5
Get Access
research-article
February 2011
Copperhead: compiling an embedded data parallel language
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 47–56https://doi.org/10.1145/1941553.1941562

Modern parallel microprocessors deliver high performance on applications that expose substantial fine-grained data parallelism. Although data parallelism is widely available in many computations, implementing data parallel algorithms in low-level ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
149
910
Metrics
Total Citations149
Total Downloads910
Last 12 Months22
Last 6 weeks1
Get Access
research-article
February 2011
A domain-specific approach to heterogeneous parallelism
PPoPP '11: Proceedings of the 16th ACM symposium on Principles and practice of parallel programmingPages 35–46https://doi.org/10.1145/1941553.1941561

Exploiting heterogeneous parallel hardware currently requires mapping application code to multiple disparate programming models. Unfortunately, general-purpose programming models available today can yield high performance but are too low-level to be ...
Also Published in:
ACM SIGPLAN Notices: Volume 46 Issue 8
123
1,166
Metrics
Total Citations123
Total Downloads1,166
Last 12 Months29
Last 6 weeks5
Get Access