Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- posterFebruary 2023
HMLib: Efficient Data Transfer for HLS Using Host Memory
FPGA '23: Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate ArraysFebruary 2023, Page 50https://doi.org/10.1145/3543622.3573157Streaming applications compose an important portion of the workloads that FPGAs may accelerate but suffer from inefficient data movement. The inefficiency stems from copying data indirectly into the FPGA DRAM rather than directly into its on-chip memory, ...
- extended-abstractApril 2021
Can SYCL and OpenCL meet the challenges of functional safety?
IWOCL '21: Proceedings of the 9th International Workshop on OpenCLApril 2021, Article No.: 16, Page 1https://doi.org/10.1145/3456669.3456688Open standards are being looked at as an attractive alternative to proprietary solutions by the automotive domain to enable sensor fusion systems in cheap mass-market vehicles. Open standards specification for SYCL, OpenCL and Vulkan were not always ...
- short-paperJune 2020
Pedal to the Bare Metal: Road Traffic Simulation on FPGAs Using High-Level Synthesis
SIGSIM-PADS '20: Proceedings of the 2020 ACM SIGSIM Conference on Principles of Advanced Discrete SimulationJune 2020, Pages 117–121https://doi.org/10.1145/3384441.3395979The performance of Agent-based Traffic Simulations (ABTS) has been shown to benefit tremendously from offloading to accelerators such as GPUs. In the search for the most suitable hardware platform, reconfigurable hardware is a natural choice. Some ...
- posterFebruary 2020
FeCaffe: FPGA-enabled Caffe with OpenCL for Deep Learning Training and Inference on Intel Stratix 10
FPGA '20: Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2020, Page 314https://doi.org/10.1145/3373087.3375389Deep learning has becoming increasingly more popular in recent years, and there are many popular frameworks in the market accordingly, such as Caffe, TensorFlow and Pytorch. All these frameworks natively support CPUs and GPGPUs. However, FPGAs still ...
- posterFebruary 2020
Advanced Dataflow Programming using Actor Machines for High-Level Synthesis
FPGA '20: Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2020, Page 310https://doi.org/10.1145/3373087.3375330The use of parallelism has increased drastically in recent years. Parallel platforms come in many forms: multi-core processors, embedded hybrid solutions such as multi-processor system-on-chip with reconfigurable logic, and cloud datacenters with multi-...
-
- research-articleFebruary 2020
Boyi: A Systematic Framework for Automatically Deciding the Right Execution Model of OpenCL Applications on FPGAs
FPGA '20: Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2020, Pages 299–309https://doi.org/10.1145/3373087.3375313FPGA vendors provide OpenCL software development kits for easier programmability, with the goal of replacing the time-consuming and error-prone register-transfer level (RTL) programming. Many studies explore optimization methods (e.g., loop unrolling, ...
- research-articleFebruary 2020
Using OpenCL to Enable Software-like Development of an FPGA-Accelerated Biophotonic Cancer Treatment Simulator
FPGA '20: Proceedings of the 2020 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2020, Pages 86–96https://doi.org/10.1145/3373087.3375300The simulation of light propagation through tissues is important for medical applications, such as photodynamic therapy (PDT) for cancer treatment. To optimize PDT an inverse problem, which works backwards from a desired distribution of light to the ...
- research-articleSeptember 2019
Simulation of Random Network of Hodgkin and Huxley Neurons with Exponential Synaptic Conductances on an FPGA Platform
BCB '19: Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health InformaticsSeptember 2019, Pages 653–657https://doi.org/10.1145/3307339.3343460Field-programmable gate arrays (FPGAs) are becoming a promising choice as a heterogeneous computing component when floating-point optimized architectures are added to the current FPGAs. The maturing high-level synthesis tools offer a streamlined design ...
- research-articleMay 2019
Transitioning Spiking Neural Network Simulators to Heterogeneous Hardware
SIGSIM-PADS '19: Proceedings of the 2019 ACM SIGSIM Conference on Principles of Advanced Discrete SimulationMay 2019, Pages 115–126https://doi.org/10.1145/3316480.3322893Spiking neural networks (SNN) are among the most computationally intensive types of simulation models, with node counts on the order of up to 10^11. Currently, there is intensive research into hardware platforms suitable to support large-scale SNN ...
Analysis and Modeling of Collaborative Execution Strategies for Heterogeneous CPU-FPGA Architectures
- Sitao Huang,
- Li-Wen Chang,
- Izzat El Hajj,
- Simon Garcia de Gonzalo,
- Juan Gómez-Luna,
- Sai Rahul Chalamalasetti,
- Mohamed El-Hadedy,
- Dejan Milojicic,
- Onur Mutlu,
- Deming Chen,
- Wen-mei Hwu
ICPE '19: Proceedings of the 2019 ACM/SPEC International Conference on Performance EngineeringApril 2019, Pages 79–90https://doi.org/10.1145/3297663.3310305Heterogeneous CPU-FPGA systems are evolving towards tighter integration between CPUs and FPGAs for improved performance and energy efficiency. At the same time, programmability is also improving with High Level Synthesis tools (e.g., OpenCL Software ...
- posterFebruary 2019
Nuclear Reactor Simulations on OpenCL FPGA Platform
FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2019, Page 304https://doi.org/10.1145/3289602.3293983Field-programmable gate arrays (FPGAs) are becoming a promising choice as a heterogeneous computing component for scientific computing when floating-point optimized architectures are added to the current FPGAs. The maturing high-level synthesis (HLS) ...
- posterFebruary 2019
Base64 Encoding on OpenCL FPGA Platform
FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2019, Page 116https://doi.org/10.1145/3289602.3293932Base64 encoding has many applications on the Web. Previous studies are focused on improving the efficiency of Base64 encoding on central processing units (CPUs). As field-programmable gate arrays (FPGAs) are becoming promising heterogeneous computing ...
- research-articleFebruary 2019
AFFIX: Automatic Acceleration Framework for FPGA Implementation of OpenVX Vision Algorithms
FPGA '19: Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2019, Pages 252–261https://doi.org/10.1145/3289602.3293907Computer vision algorithms are computationally expensive and difficult to implement efficiently. Field Programmable Gate Arrays (FPGA)s offer a promising direction to reduce the computation cost by exploiting hardware parallelism. However, it is ...
- posterFebruary 2018
Evaluation of OpenCL Performance-oriented Optimizations for Streaming Kernels on the FPGA: (Abstract Only)
FPGA '18: Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2018, Page 287https://doi.org/10.1145/3174243.3174967The streaming applications efficiently and High-level synthesis (HLS) tools allow people without complex hardware design knowledge to evaluate an application on FPGAs, there is an opportunity and a need to understand where OpenCL and FPGA can play in ...
- posterFebruary 2018
Optimizations of Sequence Alignment on FPGA: A Case Study of Extended Sequence Alignment (Abstact Only)
FPGA '18: Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2018, Page 285https://doi.org/10.1145/3174243.3174958Detecting similarities between sequences is an important part of Bioinformatics. In this poster, we explore the use of high-level synthesis tool and a field-programmable gate array (FPGA) for optimizing a sequence alignment algorithm. We demonstrate the ...
- research-articleFebruary 2018
Combined Spatial and Temporal Blocking for High-Performance Stencil Computation on FPGAs Using OpenCL
FPGA '18: Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2018, Pages 153–162https://doi.org/10.1145/3174243.3174248Recent developments in High Level Synthesis tools have attracted software programmers to accelerate their high-performance computing applications on FPGAs. Even though it has been shown that FPGAs can compete with GPUs in terms of performance for ...
- research-articleFebruary 2017
Improving the Performance of OpenCL-based FPGA Accelerator for Convolutional Neural Network
FPGA '17: Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysFebruary 2017, Pages 25–34https://doi.org/10.1145/3020078.3021698OpenCL FPGA has recently gained great popularity with emerging needs for workload acceleration such as Convolutional Neural Network (CNN), which is the most popular deep learning architecture in the domain of computer vision. While OpenCL enhances the ...
- posterJanuary 2017
POSTER: MAPA: An Automatic Memory Access Pattern Analyzer for GPU Applications
PPoPP '17: Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel ProgrammingJanuary 2017, Pages 443–444https://doi.org/10.1145/3018743.3019034Various existing optimization and memory consistency management techniques for GPU applications rely on memory access patterns of kernels. However, they suffer from poor practicality because they require explicit user interventions to extract kernel ...
Also Published in:
ACM SIGPLAN Notices: Volume 52 Issue 8, August 2017 - posterSeptember 2016
POSTER: hVISC: A Portable Abstraction for Heterogeneous Parallel Systems
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationSeptember 2016, Pages 443–445https://doi.org/10.1145/2967938.2976039Programming heterogeneous parallel systems can be extremely complex because a single system may include multiple different parallelism models, instruction sets, and memory hierarchies, and different systems use different combinations of these features. ...
- research-articleSeptember 2016
Online Scalability Characterization of Data-Parallel Programs on Many Cores
PACT '16: Proceedings of the 2016 International Conference on Parallel Architectures and CompilationSeptember 2016, Pages 191–205https://doi.org/10.1145/2967938.2967960We present an accurate online scalability prediction model for data-parallel programs on NUMA many-core systems. Memory contention is considered to be the major limiting factor of program scalability as data parallelism limits the amount of ...