Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2024JUST ACCEPTED
PACE: A Piece-Wise Approximate Floating-Point Divider with Runtime Configurability and High Energy Efficiency
ACM Transactions on Design Automation of Electronic Systems (TODAES), Just Accepted https://doi.org/10.1145/3706634Approximate computing emerges as a viable solution to enhance energy efficiency in applications sensitive to human perception, particularly on edge devices. This work introduces a novel piece-wise approximate floating-point divider that boasts resource ...
- research-articleDecember 2024
Watch Out for the Inherent Vulnerabilities in Developing Multi-tenant Cloud-FPGA: Communication Protocols
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 30, Issue 1Article No.: 13, Pages 1–24https://doi.org/10.1145/3702324As FPGAs are being deployed in the cloud infrastructure for acceleration, the technology of multi-tenant FPGA has emerged as a topic of interest. This development has drawn considerable attention to its security issues. While previous research primarily ...
- research-articleDecember 2024JUST ACCEPTED
ISOAcc: In-situ Shift Operation-based Accelerator For Efficient in-SRAM Multiplication
ACM Transactions on Design Automation of Electronic Systems (TODAES), Just Accepted https://doi.org/10.1145/3707205Digital SRAM-based CIM architectures must balance three critical factors: quantized neural network bitwidth, accuracy loss, and computational efficiency, each crucial to optimizing performance and efficiency. In Domain Specific Accelerators (DSAs), ...
- research-articleNovember 2024
Area-driven Boolean bi-decomposition by function approximation
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 30, Issue 1Article No.: 5, Pages 1–21https://doi.org/10.1145/3698879Bi-decomposition rewrites logic functions as the composition of simpler components. It is related to Boolean division, where a given function is rewritten as the product of a divisor and a quotient, but bi-decomposition can be defined for any Boolean ...
- research-articleNovember 2024
Performance Analysis of CNN Inference/Training with Convolution and Non-Convolution Operations on ASIC Accelerators
- Hadi Esmaeilzadeh,
- Soroush Ghodrati,
- Andrew B. Kahng,
- Sean Kinzer,
- Susmita Dey Manasi,
- Sachin S. Sapatnekar,
- Zhiang Wang
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 30, Issue 1Article No.: 3, Pages 1–34https://doi.org/10.1145/3696665Today’s performance analysis frameworks for deep learning accelerators suffer from two significant limitations. First, although modern convolutional neural networks (CNNs) consist of many types of layers other than convolution, especially during training, ...
-
- research-articleSeptember 2024
Estimating Power, Performance, and Area for On-Sensor Deployment of AR/VR Workloads Using an Analytical Framework
- Xiaoyu Sun,
- Xiaochen Peng,
- Sai Qian Zhang,
- Jorge Gomez,
- Win-San Khwa,
- Syed Shakib Sarwar,
- Ziyun Li,
- Weidong Cao,
- Zhao Wang,
- Chiao Liu,
- Meng-Fan Chang,
- Barbara De Salvo,
- Kerem Akarvardar,
- H.-S. Philip Wong
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 6Article No.: 93, Pages 1–27https://doi.org/10.1145/3670404Augmented Reality and Virtual Reality have emerged as the next frontier of intelligent image sensors and computer systems. In these systems, 3D die stacking stands out as a compelling solution, enabling in situ processing capability of the sensory data ...
- research-articleJuly 2024
Removal of SAT-Hard Instances in Logic Obfuscation Through Inference of Functionality
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 4Article No.: 71, Pages 1–23https://doi.org/10.1145/3674903Logic obfuscation is a prominent approach to protect intellectual property within integrated circuits during fabrication. Many attacks on logic locking have been proposed, particularly in the Boolean satifiability (SAT) attack family, leading to the ...
- research-articleJuly 2024
An Open-Source ML-Based Full-Stack Optimization Framework for Machine Learning Accelerators
- Hadi Esmaeilzadeh,
- Soroush Ghodrati,
- Andrew Kahng,
- Joon Kyung Kim,
- Sean Kinzer,
- Sayak Kundu,
- Rohan Mahapatra,
- Susmita Dey Manasi,
- Sachin Sapatnekar,
- Zhiang Wang,
- Ziqing Zeng
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 4Article No.: 68, Pages 1–33https://doi.org/10.1145/3664652Parameterizable machine learning (ML) accelerators are the product of recent breakthroughs in ML. To fully enable their design space exploration (DSE), we propose a physical-design-driven, learning-based prediction framework for hardware-accelerated deep ...
- research-articleJuly 2024
A Single Bitline Highly Stable, Low Power With High Speed Half-Select Disturb Free 11T SRAM Cell
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 4Article No.: 67, Pages 1–13https://doi.org/10.1145/3653675A half-select disturb-free 11T (HF11T) static random access memory (SRAM) cell with low power, better stability and high speed is presented in this paper. The proposed SRAM cell works well with bit-interleaving design, which enhances soft-error immunity. ...
- research-articleJune 2024
Semi-Permanent Stuck-At Fault injection attacks on Elephant and GIFT lightweight ciphers
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 4Article No.: 66, Pages 1–32https://doi.org/10.1145/3662734Fault attacks pose a potent threat to modern cryptographic implementations, particularly those used in physically approachable embedded devices in IoT environments. Information security in such resource-constrained devices is ensured using lightweight ...
- research-articleJune 2024
Load Balanced PIM-Based Graph Processing
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 4Article No.: 61, Pages 1–22https://doi.org/10.1145/3659951Graph processing is widely used for many modern applications, such as social networks, recommendation systems, and knowledge graphs. However, processing large-scale graphs on traditional Von Neumann architectures is challenging due to the irregular graph ...
- research-articleMay 2024
WCPNet: Jointly Predicting Wirelength, Congestion and Power for FPGA Using Multi-Task Learning
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 3Article No.: 57, Pages 1–19https://doi.org/10.1145/3656170To speed up the design closure and improve the QoR of FPGA, supervised single-task machine learning techniques have been used to predict individual design metric based on placement results. However, the design objective is to achieve optimal performance ...
- research-articleMay 2024
SEDONUT: A Single Event Double Node Upset Tolerant SRAM for Terrestrial Applications
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 3Article No.: 52, Pages 1–13https://doi.org/10.1145/3651985Radiation and its effect on neighboring nodes are critical not only for space applications but also for terrestrial applications at modern lower-technology nodes. This may cause static random-access memory (SRAM) failures due to single- and multi-node ...
- research-articleApril 2024
Comparative Analysis of Dynamic Power Consumption of Parallel Prefix Adder
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 3Article No.: 49, Pages 1–22https://doi.org/10.1145/3651984The Newcomb-Benford law, also known as Benford's law, is the law of anomalous numbers stating that in many real-life numerical datasets, including physical and statistical ones, numbers have a small initial digit. Numbers irregularity observed in nature ...
- research-articleMarch 2024
D3PBO: Dynamic Domain Decomposition-based Parallel Bayesian Optimization for Large-scale Analog Circuit Sizing
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 3Article No.: 44, Pages 1–25https://doi.org/10.1145/3643811Bayesian optimization (BO) is an efficient global optimization method for expensive black-box functions, but the expansion for high-dimensional problems and large sample budgets still remains a severe challenge. In order to extend BO for large-scale ...
- research-articleFebruary 2024
Scalable and Accelerated Self-healing Control Circuit Using Evolvable Hardware
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 2Article No.: 31, Pages 1–29https://doi.org/10.1145/3634682Controllers are mission-critical components of any electronic design. By sending control signals, they decide which and when other data path elements must operate. Faults, especially Single Event Upset (SEU) occurrence in these components, can lead to ...
- surveyJanuary 2024
A Survey on Approximate Multiplier Designs for Energy Efficiency: From Algorithms to Circuits
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 1Article No.: 23, Pages 1–37https://doi.org/10.1145/3610291Given the stringent requirements of energy efficiency for Internet-of-Things edge devices, approximate multipliers, as a basic component of many processors and accelerators, have been constantly proposed and studied for decades, especially in error-...
- research-articleDecember 2023
Flip: Data-centric Edge CGRA Accelerator
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 1Article No.: 22, Pages 1–25https://doi.org/10.1145/3631118Coarse-Grained Reconfigurable Arrays (CGRA) are promising edge accelerators due to the outstanding balance in flexibility, performance, and energy efficiency. Classic CGRAs statically map compute operations onto the processing elements (PE) and route the ...
- research-articleDecember 2023
NeuroCool: Dynamic Thermal Management of 3D DRAM for Deep Neural Networks through Customized Prefetching
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 1Article No.: 19, Pages 1–35https://doi.org/10.1145/3630012Deep neural network (DNN) implementations are typically characterized by huge datasets and concurrent computation, resulting in a demand for high memory bandwidth due to intensive data movement between processors and off-chip memory. Performing DNN ...
- research-articleDecember 2023
Construction of All Multilayer Monolithic RSMTs and Its Application to Monolithic 3D IC Routing
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 29, Issue 1Article No.: 17, Pages 1–28https://doi.org/10.1145/3626958Monolithic three-dimensional (M3D) integration allows ultra-thin silicon tier stacking in a single package. The high-density stacking is acquiring interest and is becoming more popular for smaller footprint areas, shorter wirelength, higher performance, ...