Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleApril 2017
Sound Loop Superoptimization for Google Native Client
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 45, Issue 1Pages 313–326https://doi.org/10.1145/3093337.3037754Software fault isolation (SFI) is an important technique for the construction of secure operating systems, web browsers, and other extensible software. We demonstrate that superoptimization can dramatically improve the performance of Google Native ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleApril 2017
What Scalable Programs Need from Transactional Memory
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 45, Issue 1Pages 105–118https://doi.org/10.1145/3093337.3037750Transactional memory (TM) has been the focus of numerous studies, and it is supported in processors such as the IBM Blue Gene/Q and Intel Haswell. Many studies have used the STAMP benchmark suite to evaluate their designs. However, the speedups obtained ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleApril 2017
An Architecture Supporting Formal and Compositional Binary Analysis
- Joseph McMahan,
- Michael Christensen,
- Lawton Nichols,
- Jared Roesch,
- Sung-Yee Guo,
- Ben Hardekopf,
- Timothy Sherwood
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 45, Issue 1Pages 177–191https://doi.org/10.1145/3093337.3037733Building a trustworthy life-critical embedded system requires deep reasoning about the potential effects that sequences of machine instructions can have on full system operation. Rather than trying to analyze complete binaries and the countless ways ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleApril 2017
ProRace: Practical Data Race Detection for Production Use
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 45, Issue 1Pages 149–162https://doi.org/10.1145/3093337.3037708This paper presents ProRace, a dynamic data race detector practical for production runs. It is lightweight, but still offers high race detection capability. To track memory accesses, ProRace leverages instruction sampling using the performance ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleMarch 2016
Generating Configurable Hardware from Parallel Patterns
- Raghu Prabhakar,
- David Koeplinger,
- Kevin J. Brown,
- HyoukJoong Lee,
- Christopher De Sa,
- Christos Kozyrakis,
- Kunle Olukotun
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 44, Issue 2Pages 651–665https://doi.org/10.1145/2980024.2872415In recent years the computing landscape has seen an increasing shift towards specialized accelerators. Field programmable gate arrays (FPGAs) are particularly promising for the implementation of these accelerators, as they offer significant performance ...
Also Published in:
ASPLOS '16: Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450340915 -
- research-articleMarch 2016
TxRace: Efficient Data Race Detection Using Commodity Hardware Transactional Memory
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 44, Issue 2Pages 159–173https://doi.org/10.1145/2980024.2872384Detecting data races is important for debugging shared-memory multithreaded programs, but the high runtime overhead prevents the wide use of dynamic data race detectors. This paper presents TxRace, a new software data race detector that leverages ...
Also Published in:
ASPLOS '16: Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450340915 - research-articleMarch 2015
Reduced Hardware NOrec: A Safe and Scalable Hybrid Transactional Memory
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 43, Issue 1Pages 59–71https://doi.org/10.1145/2786763.2694393Because of hardware TM limitations, software fallbacks are the only way to make TM algorithms guarantee progress. Nevertheless, all known software fallbacks to date, from simple locks to sophisticated versions of the NOrec Hybrid TM algorithm, have ...
Also Published in:
ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450328357 - research-articleMarch 2015
Temporally Bounding TSO for Fence-Free Asymmetric Synchronization
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 43, Issue 1Pages 45–58https://doi.org/10.1145/2786763.2694374This paper introduces a temporally bounded total store ordering (TBTSO) memory model, and shows that it enables nonblocking fence-free solutions to asymmetric synchronization problems, such as those arising in memory reclamation and biased locking.
...
Also Published in:
ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450328357 - research-articleMarch 2015
iThreads: A Threading Library for Parallel Incremental Computation
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 43, Issue 1Pages 645–659https://doi.org/10.1145/2786763.2694371Incremental computation strives for efficient successive runs of applications by re-executing only those parts of the computation that are affected by a given input change instead of recomputing everything from scratch. To realize these benefits ...
Also Published in:
ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450328357 - research-articleMarch 2015
Asynchronized Concurrency: The Secret to Scaling Concurrent Search Data Structures
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 43, Issue 1Pages 631–644https://doi.org/10.1145/2786763.2694359We introduce "asynchronized concurrency (ASCY)," a paradigm consisting of four complementary programming patterns. ASCY calls for the design of concurrent search data structures (CSDSs) to resemble that of their sequential counterparts. We argue that ...
Also Published in:
ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450328357 - research-articleMarch 2015
Synchronization Using Remote-Scope Promotion
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 43, Issue 1Pages 73–86https://doi.org/10.1145/2786763.2694350Heterogeneous system architecture (HSA) and OpenCL define scoped synchronization to facilitate low overhead communication across a subset of threads. Scoped synchronization works well for static sharing patterns, where consumer threads are known a ...
Also Published in:
ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450328357 - columnJune 2014
Blueshell: a platform for rapid prototyping of multiprocessor NoCs and accelerators
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 41, Issue 5Pages 107–117https://doi.org/10.1145/2641361.2641379The rapid increase in FPGA logic capacity has enabled the prototyping of multiprocessor Network-on-Chip (NoC) architectures. However, the design space exploration of these complex architectures is highly time consuming with traditional methodologies for ...
- research-articleFebruary 2014
Data-parallel finite-state machines
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 42, Issue 1Pages 529–542https://doi.org/10.1145/2654822.2541988A finite-state machine (FSM) is an important abstraction for solving several problems, including regular-expression matching, tokenizing text, and Huffman decoding. FSM computations typically involve data-dependent iterations with unpredictable memory-...
Also Published in:
ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems: ISBN 9781450323055 - research-articleFebruary 2014
Fence-free work stealing on bounded TSO processors
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 42, Issue 1Pages 413–426https://doi.org/10.1145/2654822.2541987Work stealing is the method of choice for load balancing in task parallel programming languages and frameworks. Yet despite considerable effort invested in optimizing work stealing task queues, existing algorithms issue a costly memory fence when ...
Also Published in:
ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems: ISBN 9781450323055 - research-articleFebruary 2014
RelaxReplay: record and replay for relaxed-consistency multiprocessors
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 42, Issue 1Pages 223–238https://doi.org/10.1145/2654822.2541979Record and Deterministic Replay (RnR) of multithreaded programs on relaxed-consistency multiprocessors has been a long-standing problem. While there are designs that work for Total Store Ordering (TSO), finding a general solution that is able to record ...
Also Published in:
ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems: ISBN 9781450323055 - research-articleFebruary 2014
Transactionalizing legacy code: an experience report using GCC and Memcached
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 42, Issue 1Pages 399–412https://doi.org/10.1145/2654822.2541960The addition of transactional memory (TM) support to existing languages provides the opportunity to create new soft- ware from scratch using transactions, and also to simplify or extend legacy code by replacing existing synchronization with language-...
Also Published in:
ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems: ISBN 9781450323055 - research-articleJune 2013
Orchestrated scheduling and prefetching for GPGPUs
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 41, Issue 3Pages 332–343https://doi.org/10.1145/2508148.2485951In this paper, we present techniques that coordinate the thread scheduling and prefetching decisions in a General Purpose Graphics Processing Unit (GPGPU) architecture to better tolerate long memory latencies. We demonstrate that existing warp ...
Also Published in:
ISCA '13: Proceedings of the 40th Annual International Symposium on Computer Architecture: ISBN 9781450320795 - research-articleMarch 2013
OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance
- Adwait Jog,
- Onur Kayiran,
- Nachiappan Chidambaram Nachiappan,
- Asit K. Mishra,
- Mahmut T. Kandemir,
- Onur Mutlu,
- Ravishankar Iyer,
- Chita R. Das
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 41, Issue 1Pages 395–406https://doi.org/10.1145/2490301.2451158Emerging GPGPU architectures, along with programming models like CUDA and OpenCL, offer a cost-effective platform for many applications by providing high thread level parallelism at lower energy budgets. Unfortunately, for many general-purpose ...
Also Published in:
ASPLOS '13: Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems: ISBN 9781450318709 - research-articleMarch 2013
ConAir: featherweight concurrency bug recovery via single-threaded idempotent execution
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 41, Issue 1Pages 113–126https://doi.org/10.1145/2490301.2451129Many concurrency bugs are hidden in deployed software and cause severe failures for end-users. When they finally manifest and become known by developers, they are difficult to fix correctly. To support end-users, we need techniques that help software ...
Also Published in:
ASPLOS '13: Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems: ISBN 9781450318709 - research-articleMarch 2013
Cooperative empirical failure avoidance for multithreaded programs
ACM SIGARCH Computer Architecture News (SIGARCH), Volume 41, Issue 1Pages 39–50https://doi.org/10.1145/2490301.2451121Concurrency errors in multithreaded programs are difficult to find and fix. We propose Aviso, a system for avoiding schedule-dependent failures. Aviso monitors events during a program's execution and, when a failure occurs, records a history of events ...
Also Published in:
ASPLOS '13: Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems: ISBN 9781450318709