Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- tutorialMarch 2018
An Analysis of x86-64 Inline Assembly in C Programs
ACM SIGPLAN Notices (SIGPLAN), Volume 53, Issue 3Pages 84–99https://doi.org/10.1145/3296975.3186418C codebases frequently embed nonportable and unstandardized elements such as inline assembly code. Such elements are not well understood, which poses a problem to tool developers who aspire to support C code. This paper investigates the use of x86-64 ...
Also Published in:
VEE '18: Proceedings of the 14th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments: ISBN 9781450355797 - research-articleApril 2017
Typed Architectures: Architectural Support for Lightweight Scripting
- Channoh Kim,
- Jaehyeok Kim,
- Sungmin Kim,
- Dooyoung Kim,
- Namho Kim,
- Gitae Na,
- Young H. Oh,
- Hyeon Gyu Cho,
- Jae W. Lee
ACM SIGPLAN Notices (SIGPLAN), Volume 52, Issue 4Pages 77–90https://doi.org/10.1145/3093336.3037726Dynamic scripting languages are becoming more and more widely adopted not only for fast prototyping but also for developing production-grade applications. They provide high-productivity programming environments featuring high levels of abstraction with ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleApril 2017
Translation-Triggered Prefetching
ACM SIGPLAN Notices (SIGPLAN), Volume 52, Issue 4Pages 63–76https://doi.org/10.1145/3093336.3037705We propose translation-enabled memory prefetching optimizations or TEMPO, a low-overhead hardware mechanism to boost memory performance by exploiting the operating system's (OS) virtual memory subsystem. We are the first to make the following ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleApril 2017
Efficient Address Translation for Architectures with Multiple Page Sizes
ACM SIGPLAN Notices (SIGPLAN), Volume 52, Issue 4Pages 435–448https://doi.org/10.1145/3093336.3037704Processors and operating systems (OSes) support multiple memory page sizes. Superpages increase Translation Lookaside Buffer (TLB) hits, while small pages provide fine-grained memory protection. Ideally, TLBs should perform well for any distribution of ...
Also Published in:
ASPLOS '17: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450344654 - research-articleMarch 2015Best Paper
GhostRider: A Hardware-Software System for Memory Trace Oblivious Computation
ACM SIGPLAN Notices (SIGPLAN), Volume 50, Issue 4Pages 87–101https://doi.org/10.1145/2775054.2694385This paper presents a new, co-designed compiler and architecture called GhostRider for supporting privacy preserving computation in the cloud. GhostRider ensures all programs satisfy a property called memory-trace obliviousness (MTO): Even an adversary ...
Also Published in:
ASPLOS '15: Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems: ISBN 9781450328357 -
- research-articleJune 2014
Improving performance of loops on DIAM-based VLIW architectures
ACM SIGPLAN Notices (SIGPLAN), Volume 49, Issue 5Pages 135–144https://doi.org/10.1145/2666357.2597825Recent studies show that very long instruction word (VLIW) architectures, which inherently have wide datapath (e.g. 128 or 256 bits for one VLIW instruction word), can benefit from dynamic implied addressing mode (DIAM) and can achieve lower power ...
Also Published in:
LCTES '14: Proceedings of the 2014 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems: ISBN 9781450328777 - research-articleFebruary 2014
ASC: automatically scalable computation
ACM SIGPLAN Notices (SIGPLAN), Volume 49, Issue 4Pages 575–590https://doi.org/10.1145/2644865.2541985We present an architecture designed to transparently and automatically scale the performance of sequential programs as a function of the hardware resources available. The architecture is predicated on a model of computation that views program execution ...
Also Published in:
ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems: ISBN 9781450323055 - research-articleFebruary 2014
Quasar: resource-efficient and QoS-aware cluster management
ACM SIGPLAN Notices (SIGPLAN), Volume 49, Issue 4Pages 127–144https://doi.org/10.1145/2644865.2541941Cloud computing promises flexibility and high performance for users and high cost-efficiency for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both cost effectiveness and future scalability.
We present Quasar, a ...
Also Published in:
ASPLOS '14: Proceedings of the 19th international conference on Architectural support for programming languages and operating systems: ISBN 9781450323055 - research-articleJune 2013
LUCAS: latency-adaptive unified cluster assignment and instruction scheduling
ACM SIGPLAN Notices (SIGPLAN), Volume 48, Issue 5Pages 45–54https://doi.org/10.1145/2499369.2465565Clustered VLIW architectures are statically scheduled wide-issue architectures that combine the advantages of wide-issue processors along with the power and frequency scalability of clustered designs. Being statically scheduled, they require that the ...
Also Published in:
LCTES '13: Proceedings of the 14th ACM SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems: ISBN 9781450320856 - research-articleJune 2013
Improving processor efficiency by statically pipelining instructions
ACM SIGPLAN Notices (SIGPLAN), Volume 48, Issue 5Pages 33–44https://doi.org/10.1145/2499369.2465559A new generation of applications requires reduced power consumption without sacrificing performance. Instruction pipelining is commonly used to meet application performance requirements, but some implementation aspects of pipelining are inefficient with ...
Also Published in:
LCTES '13: Proceedings of the 14th ACM SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems: ISBN 9781450320856 - research-articleFebruary 2013
From relational verification to SIMD loop synthesis
ACM SIGPLAN Notices (SIGPLAN), Volume 48, Issue 8Pages 123–134https://doi.org/10.1145/2517327.2442529Existing pattern-based compiler technology is unable to effectively exploit the full potential of SIMD architectures. We present a new program synthesis based technique for auto-vectorizing performance critical innermost loops. Our synthesis technique ...
Also Published in:
PPoPP '13: Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming: ISBN 9781450319225 - research-articleMarch 2011
A case for neuromorphic ISAs
ACM SIGPLAN Notices (SIGPLAN), Volume 46, Issue 3Pages 145–158https://doi.org/10.1145/1961296.1950385The desire to create novel computing systems, paired with recent advances in neuroscientific understanding of the brain, has led researchers to develop neuromorphic architectures that emulate the brain. To date, such models are developed, trained, and ...
Also Published in:
ASPLOS XVI: Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems: ISBN 9781450302661 - research-articleJune 2009
Modulo scheduling without overlapped lifetimes
ACM SIGPLAN Notices (SIGPLAN), Volume 44, Issue 7Pages 1–10https://doi.org/10.1145/1543136.1542454This paper describes complementary software- and hardware-based approaches for handling overlapping register lifetimes that occur in modulo scheduled loops. Modulo scheduling takes the N-instructions in a loop body and constructs an M-stage software ...
Also Published in:
LCTES '09: Proceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems: ISBN 9781605583563 - keynoteFebruary 2009
How to build programmable multi-core chips
ACM SIGPLAN Notices (SIGPLAN), Volume 44, Issue 4Pages 283–284https://doi.org/10.1145/1594835.1504179The arrival of multi-core chips has heightened interest in the discipline of parallel programming, a topic that has received much attention for many years. Computer architects have much to learn from sound principles for structuring software and ...
Also Published in:
PPoPP '09: Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming: ISBN 9781605583976 - research-articleJune 2008
Improving performance and reducing energy-delay with adaptive resource resizing for out-of-order embedded processors
ACM SIGPLAN Notices (SIGPLAN), Volume 43, Issue 7Pages 71–78https://doi.org/10.1145/1379023.1375668While Ultra Deep Submicron (UDSM) CMOS scaling gives embedded processor designers ample silicon budget to increase processor resources to improve performance, restrictions with the power budget and practically achievable operating clock frequencies act ...
Also Published in:
LCTES '08: Proceedings of the 2008 ACM SIGPLAN-SIGBED conference on Languages, compilers, and tools for embedded systems: ISBN 9781605581040 - articleJune 2006
Compiler-directed thermal management for VLIW functional units
ACM SIGPLAN Notices (SIGPLAN), Volume 41, Issue 7Pages 163–172https://doi.org/10.1145/1159974.1134674As processors, memories, and other components of today's embedded systems are pushed to higher performance in more enclosed spaces, processor thermal management is quickly becoming a limiting design factor. While previous proposals mostly approached ...
Also Published in:
LCTES '06: Proceedings of the 2006 ACM SIGPLAN/SIGBED conference on Language, compilers, and tool support for embedded systems: ISBN 159593362X - articleJune 2006
Auto-vectorization of interleaved data for SIMD
ACM SIGPLAN Notices (SIGPLAN), Volume 41, Issue 6Pages 132–143https://doi.org/10.1145/1133255.1133997Most implementations of the Single Instruction Multiple Data (SIMD) model available today require that data elements be packed in vector registers. Operations on disjoint vector elements are not supported directly and require explicit data ...
Also Published in:
PLDI '06: Proceedings of the 27th ACM SIGPLAN Conference on Programming Language Design and Implementation: ISBN 1595933204 - articleJune 2005
Static strands: safely collapsing dependence chains for increasing embedded power efficiency
ACM SIGPLAN Notices (SIGPLAN), Volume 40, Issue 7Pages 127–136https://doi.org/10.1145/1070891.1065929Modern embedded processors are designed to maximize execution efficiency--the amount of performance achieved per unit of energy dissipated while meeting minimum performance levels. To increase this efficiency we propose utilizing static strands, ...
Also Published in:
LCTES '05: Proceedings of the 2005 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems: ISBN 1595930183 - articleOctober 2004
Helper threads via virtual multithreading on an experimental itanium® 2 processor-based platform
- Perry H. Wang,
- Jamison D. Collins,
- Hong Wang,
- Dongkeun Kim,
- Bill Greene,
- Kai-Ming Chan,
- Aamir B. Yunus,
- Terry Sych,
- Stephen F. Moore,
- John P. Shen
ACM SIGPLAN Notices (SIGPLAN), Volume 39, Issue 11Pages 144–155https://doi.org/10.1145/1037187.1024411Helper threading is a technology to accelerate a program by exploiting a processor's multithreading capability to run ``assist'' threads. Previous experiments on hyper-threaded processors have demonstrated significant speedups by using helper threads to ...
Also Published in:
ASPLOS XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems: ISBN 1581138040