skip to main content
Volume 16, Issue 2June 2023
Editor:
  • Deming Chen
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
ISSN:1936-7406
EISSN:1936-7414
Recommend ACM DL
ALREADY A SUBSCRIBER?SIGN IN
Reflects downloads up to 13 Jan 2025Bibliometrics
Skip Table Of Content Section
research-article
Open Access
Efficient Compilation and Mapping of Fixed Function Combinational Logic onto Digital Signal Processors Targeting Neural Network Inference and Utilizing High-level Synthesis

Recent efforts for improving the performance of neural network (NN) accelerators that meet today’s application requirements have given rise to a new trend of logic-based NN inference relying on fixed function combinational logic. Mapping such large ...

research-article
FPGA Acceleration of Probabilistic Sentential Decision Diagrams with High-level Synthesis

Probabilistic Sentential Decision Diagrams (PSDDs) provide efficient methods for modeling and reasoning with probability distributions in the presence of massive logical constraints. PSDDs can also be synthesized from graphical models such as Bayesian ...

research-article
Open Access
Hardware-accelerated Real-time Drift-awareness for Robust Deep Learning on Wireless RF Data

Proactive and intelligent management of network resource utilization (RU) using deep learning (DL) can significantly improve the efficiency and performance of the next generation of wireless networks. However, variations in wireless RU are often affected ...

research-article
Open Access
A Survey on FPGA Cybersecurity Design Strategies

This article presents a critical literature review on the security aspects of field-programmable gate array (FPGA) devices. FPGA devices present unique challenges to cybersecurity through their reconfigurable nature. The article also pays special ...

research-article
Open Access
Automatic Creation of High-bandwidth Memory Architectures from Domain-specific Languages: The Case of Computational Fluid Dynamics

Numerical simulations can help solve complex problems. Most of these algorithms are massively parallel and thus good candidates for FPGA acceleration thanks to spatial parallelism. Modern FPGA devices can leverage high-bandwidth memory technologies, but ...

research-article
Hardware Optimizations of Fruit-80 Stream Cipher: Smaller than Grain

Fruit-80, which emerged as an ultra-lightweight stream cipher with 80-bit secret key, is oriented toward resource-constrained devices in the Internet of Things. In this article, we propose area and speed optimization architectures of Fruit-80 on FPGAs. ...

research-article
Open Access
FlexCNN: An End-to-end Framework for Composing CNN Accelerators on FPGA

With reduced data reuse and parallelism, recent convolutional neural networks (CNNs) create new challenges for FPGA acceleration. Systolic arrays (SAs) are efficient, scalable architectures for convolutional layers, but without proper optimizations, their ...

research-article
Open Access
Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-switched Inter-FPGA Networks

While FPGA accelerator boards and their respective high-level design tools are maturing, there is still a lack of multi-FPGA applications, libraries, and not least, benchmarks and reference implementations towards sustained HPC usage of these devices. As ...

research-article
Open Access
VCSN: Virtual Circuit-Switching Network for Flexible and Simple-to-Operate Communication in HPC FPGA Cluster

FPGA clusters promise to play a critical role in high-performance computing (HPC) systems in the near future due to their flexibility and high power efficiency. The operation of large-scale general-purpose FPGA clusters on which multiple users run diverse ...

research-article
Improving Energy Efficiency of CGRAs with Low-Overhead Fine-Grained Power Domains

To effectively minimize static power for a wide range of applications, power domains for coarse-grained reconfigurable array (CGRA) architectures need to be more fine-grained than those found in a typical application-specific integrated circuit. However, ...

research-article
Adaptive Selection and Clustering of Partial Reconfiguration Modules for Modern FPGA Design Flow

Dynamic Partially Reconfiguration (DPR) on FPGA has attracted significant research interest in recent years since it provides benefits such as reduced area and flexible functionality. However, due to the lack of supporting synthesis tools in the current ...

research-article
SASA: A Scalable and Automatic Stencil Acceleration Framework for Optimized Hybrid Spatial and Temporal Parallelism on HBM-based FPGAs

Stencil computation is one of the fundamental computing patterns in many application domains such as scientific computing and image processing. While there are promising studies that accelerate stencils on FPGAs, there lacks an automated acceleration ...

research-article
Deterministic Approach for Range-enhanced Reconfigurable Packet Classification Engine

Reconfigurable hardware is a promising technology for implementing firewalls, routing mechanisms, and new protocols for evolving high-performance network systems. This work presents a novel deterministic approach for a Range-enhanced Reconfigurable Packet ...

SECTION: Special Issue: FPT 2021
introduction
Open Access
Introduction to the Special Issue on FPT 2021
research-article
Toward Software-like Debugging for FPGAs via Checkpointing and Transaction-based Co-Simulation

Checkpoint-based debugging flows have recently been developed that allow the user to move the design state back and forth between an FPGA and a simulator. They provide a softwarelike debugging experience by combining the speed of hardware execution and ...

research-article
QiCells: A Modular RFSoC-based Approach to Interface Superconducting Quantum Bits

Quantum computers will be a revolutionary extension of the heterogeneous computing world. They consist of many quantum bits (qubits) and require a careful design of the interface between the classical computer architecture and the quantum processor. For ...

research-article
Public Access
Algorithm-hardware Co-optimization for Energy-efficient Drone Detection on Resource-constrained FPGA

Convolutional neural network (CNN)-based object detection has achieved very high accuracy; e.g., single-shot multi-box detectors (SSDs) can efficiently detect and localize various objects in an input image. However, they require a high amount of ...

Subjects

Comments