skip to main content
10.1145/3195970.3195999acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article
Public Access

Automated accelerator generation and optimization with composable, parallel and pipeline architecture

Published: 24 June 2018 Publication History

Abstract

CPU-FPGA heterogeneous architectures feature flexible acceleration of many workloads to advance computational capabilities and energy efficiency in today's datacenters. This advantage, however, is often overshadowed by the poor programmability of FPGAs. Although recent advances in high-level synthesis (HLS) significantly improve the FPGA programmability, it still leaves programmers facing the challenge of identifying the optimal design configuration in a tremendous design space. In this paper we propose the composable, parallel and pipeline (CPP) microarchitecture as an accelerator design template to substantially reduce the design space. Also, by introducing the CPP analytical model to capture the performance-resource trade-offs, we achieve efficient, analytical-based design space exploration. Furthermore, we develop the AutoAccel framework to automate the entire accelerator generation process. Our experiments show that the AutoAccel-generated accelerators outperform their corresponding software implementations by an average of 72x for a broad class of computation kernels.

References

[1]
Amazon EC2 F1 Instance. https://aws.amazon.com/ec2/instance-types/f1/
[2]
Intel to Start Shipping Xeons With FPGAs in Early 2016. http://www.eweek.com/servers/intel-to-start-shipping-xeons-with-fpgas-in-early-2016
[3]
Merlin Compiler. http://www.falcon-computing.com/index.php/solutions/merlin-compiler
[4]
SDAccel Development Environment. http://www.xilinx.com/products/design-tools/software-zone/sdaccel.html.
[5]
Xeon+FPGA Platform for the Data Center. https://www.ece.cmu.edu/~calcm/carl/lib/exe/fetch.php?media=carl15-gupta.pdf
[6]
R. S. Bird. 2006. Improving Saddleback Search: A Lesson in Algorithm Design. In Mathematics of Program Construction. Springer.
[7]
J. Cong et al. 2016. Source-to-Source Optimization for HLS. In FPGAs for Software Programmers. Springer International Publishing.
[8]
J. Cong et al. 2014. Combining Computation and Communication Optimizations in System Synthesis for Streaming Applications. In FPGA.
[9]
J. Cong et al. 2011. High-Level Synthesis for FPGAs: From Prototyping to Deployment. TCAD.
[10]
J. Cong et al. 2017. Bandwidth optimization through on-chip memory restructuring for HLS. In DAC.
[11]
D. Koeplinger et al. 2016. Automatic Generation of Efficient Accelerators for Reconfigurable Hardware. In ISCA.
[12]
A. Madhavan et al. 2014. Race Logic: A Hardware Acceleration for Dynamic Programming Algorithms. In ISCA.
[13]
S. B. Needleman et al. 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. JMB.
[14]
J. Ouyang et al. 2014. Sda: Software-defined accelerator for largescale dnn systems. In Hot Chips.
[15]
L. Page et al. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.
[16]
N. K. Pham et al. 2015. Exploiting Loop-array Dependencies to Accelerate the Design Space Exploration with High Level Synthesis. In DATE.
[17]
L.-N. Pouchet et al. 2013. Polyhedral-based Data Reuse Optimization for Configurable Computing. In FPGA.
[18]
A. Putnam et al. 2014. A reconfigurable fabric for accelerating large-scale data-center services. In ISCA.
[19]
B. Reagen et al. 2014. Machsuite: Benchmarks for accelerator design and customized architectures. In IISWC.
[20]
Z. Wang et al. 2016. A performance analysis framework for optimizing OpenCL applications on FPGAs. In HPCA.
[21]
P. Zhang et al. 2015. CMOST: A System-level FPGA Compilation Framework. In DAC.
[22]
J. Zhao et al. 2017. COMBA: A Comprehensive Model-Based Analysis Framework for High Level Synthesis of Real Applications. In ICCAD.
[23]
G. Zhong et al. 2016. Lin-Analyzer: A high-level performance analysis tool for FPGA-based accelerators. In DAC.
[24]
G. Zhong et al. 2017. Design Space Exploration of FPGA-based Accelerators with Multi-level Parallelism. In DATE.
[25]
H. R. Zohouri et al. 2016. Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs. In SC.

Cited By

View all
  1. Automated accelerator generation and optimization with composable, parallel and pipeline architecture

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    DAC '18: Proceedings of the 55th Annual Design Automation Conference
    June 2018
    1089 pages
    ISBN:9781450357005
    DOI:10.1145/3195970
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 June 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    DAC '18
    Sponsor:
    DAC '18: The 55th Annual Design Automation Conference 2018
    June 24 - 29, 2018
    California, San Francisco

    Acceptance Rates

    Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

    Upcoming Conference

    DAC '25
    62nd ACM/IEEE Design Automation Conference
    June 22 - 26, 2025
    San Francisco , CA , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)219
    • Downloads (Last 6 weeks)26
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media