General-purpose computing devices allow us to (1) customize computation after fabrication and (2) conserve area by reusing expensive active circuitry for different functions in time. We define RP-space, a restricted domain of the general-purpose architectural space focussed on reconfigurable computing architectures. Two dominant features differentiate reconfigurable from special-purpose architectures and account for most of the area overhead associated with RP devices: (1) instructions which tell the device how to behave, and (2) flexible interconnect which supports task dependent dataflow between operations. We can characterize RP-space by the allocation and structure of these resources and compare the efficiencies of architectural points across broad application characteristics. Conventional FPGAs fall at one extreme end of this space and their efficiency ranges over two orders of magnitude across the space of application characteristics. Understanding RP-space and its consequences allows us to pick the best architecture for a task and to search for more robust design points in the space. Our DPGA, a fine- grained computing device which adds small, on-chip instruction memories to FPGAs is one such design point. For typical logic applications and finite- state machines, a DPGA can implement tasks in one-third the area of a traditional FPGA. TSFPGA, a variant of the DPGA which focuses on heavily time-switched interconnect, achieves circuit densities close to the DPGA, while reducing typical physical mapping times from hours to seconds. Rigid, fabrication-time organization of instruction resources significantly narrows the range of efficiency for conventional architectures. To avoid this performance brittleness, we developed MATRIX, the first architecture to defer the binding of instruction resources until run-time, allowing the application to organize resources according to its needs. Our focus MATRIX design point is based on an array of 8-bit ALU and register-file building blocks interconnected via a byte-wide network. With today''s silicon, a single chip MATRIX array can deliver over 10 Gop/s (8-bit ops). On sample image processing tasks, we show that MATRIX yields 10-20x the computational density of conventional processors. Understanding the cost structure of RP-space helps us identify these intermediate architectural points and may provide useful insight more broadly in guiding our continual search for robust and efficient general-purpose computing structures.
Cited By
- Przybylski A, Haque K and Beckett P The Bel array Proceedings of the Australasian Computer Science Week Multiconference, (1-4)
- Richardson J, George A, Cheng K and Lam H (2016). Analysis of Fixed, Reconfigurable, and Hybrid Devices with Computational, Memory, I/O, & Realizable-Utilization Metrics, ACM Transactions on Reconfigurable Technology and Systems, 10:1, (1-21), Online publication date: 28-Dec-2016.
- Liu H, Rajavel S and Akoglu A (2013). Integration of Net-Length Factor with Timing- and Routability-Driven Clustering Algorithms, ACM Transactions on Reconfigurable Technology and Systems, 6:3, (1-21), Online publication date: 1-Oct-2013.
- Takano S (2012). Design and analysis of adaptive processor, ACM Transactions on Reconfigurable Technology and Systems, 5:1, (1-34), Online publication date: 1-Mar-2012.
- Richfield S Dealing with the "itanium effect" (abstract only) Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays, (277-277)
- Cardoso J, Diniz P and Weinhardt M (2010). Compiling for reconfigurable computing, ACM Computing Surveys, 42:4, (1-65), Online publication date: 1-Jun-2010.
- Williams J, Massie C, George A, Richardson J, Gosrani K and Lam H (2010). Characterization of Fixed and Reconfigurable Multi-Core Devices for Application Acceleration, ACM Transactions on Reconfigurable Technology and Systems, 3:4, (1-29), Online publication date: 1-Nov-2010.
- Beckett P (2009). Power scalability in a mesh-connected reconfigurable architecture, ACM Transactions on Embedded Computing Systems, 9:2, (1-28), Online publication date: 1-Oct-2009.
- Chu A, Miller S and Sima M Reconfigurable solutions for very-long arithmetic with applications in cryptography Proceedings of the 18th ACM Great Lakes symposium on VLSI, (59-64)
- Paul S and Bhunia S Reconfigurable computing using content addressable memory for improved performance and resource usage Proceedings of the 45th annual Design Automation Conference, (786-791)
- Matsuyama K, Amagasaki M, Nakayama H, Yamaguchi R, Iida M and Sueyoshi T Evaluating variable-grain logic cells using heterogeneous technology mapping Proceedings of the 3rd international conference on Reconfigurable computing: architectures, tools and applications, (142-154)
- Hauck S and DeHon A (2007). Reconfigurable Computing, 10.5555/1564780, Online publication date: 2-Nov-2007.
- Brandolese C, Fornaciari W, Pomante L, Salice F and Sciuto D (2006). Affinity-Driven System Design Exploration for Heterogeneous Multiprocessor SoC, IEEE Transactions on Computers, 55:5, (508-519), Online publication date: 1-May-2006.
- DeHon A Design of programmable interconnect for sublithographic programmable logic arrays Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays, (127-137)
- Tomono M, Nakanishi M, Watanabe K and Yamashita S Event-oriented computing with reconfigurable platform Proceedings of the 2005 Asia and South Pacific Design Automation Conference, (1248-1251)
- Ahmadinia A, Bobda C, Koch D, Majer M and Teich J Task scheduling for heterogeneous reconfigurable computers Proceedings of the 17th symposium on Integrated circuits and system design, (22-27)
- DeHon A and Wilson M Nanowire-based sublithographic programmable logic arrays Proceedings of the 2004 ACM/SIGDA 12th international symposium on Field programmable gate arrays, (123-132)
- Siemers C and Winterstein V (2003). The Universal Configurable Block/Machine—An Approach for a Configurable SoC-Architecture, The Journal of Supercomputing, 26:3, (309-331), Online publication date: 1-Nov-2003.
- Swahn B and Hassoun S Hardware Scheduling for Dynamic Adaptability using External Profiling and Hardware Threading Proceedings of the 2003 IEEE/ACM international conference on Computer-aided design
- Leijten-Nowak K and van Meerbergen J An FPGA architecture with enhanced datapath functionality Proceedings of the 2003 ACM/SIGDA eleventh international symposium on Field programmable gate arrays, (195-204)
- Akoglu A, Dasu A, Sudarsanam A, Srinivasan M and Panchanathan S Pattern Recognition Tool to Detect Reconfigurable Patterns in MPEG4 Video Processing Proceedings of the 16th International Parallel and Distributed Processing Symposium
- Sciuto D, Salice F, Pomante L and Fornaciari W Metrics for design space exploration of heterogeneous multiprocessor embedded systems Proceedings of the tenth international symposium on Hardware/software codesign, (55-60)
- Hartenstein R A decade of reconfigurable computing Proceedings of the conference on Design, automation and test in Europe, (642-649)
- Mitsuyama Y, Andales Z, Onoye T and Shirakawa I A dynamically reconfigurable hardware-based cipher chip Proceedings of the 2001 Asia and South Pacific Design Automation Conference, (11-12)
- DeHon A (2000). The Density Advantage of Configurable Computing, Computer, 33:4, (41-49), Online publication date: 1-Apr-2000.
- DeHon A Balancing interconnect and computation in a reconfigurable computing array (or, why you don't really want 100% LUT utilization) Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays, (69-78)
- Marshall A, Stansfield T, Kostarnov I, Vuillemin J and Hutchings B A reconfigurable arithmetic array for multimedia applications Proceedings of the 1999 ACM/SIGDA seventh international symposium on Field programmable gate arrays, (135-143)
- DeHon A and Wawrzynek J Reconfigurable computing Proceedings of the 36th annual ACM/IEEE Design Automation Conference, (610-615)
- Moreno J, Cabestany J, Madrenas J, Cantó E, Faura J and Insenser J Approaching Evolvable Hardware to Reality Proceedings of the 7th International Conference on Microelectronics for Neural, Fuzzy and Bio-Inspired Systems
- Sanchez E, Haenni J, Beuchat J, Stauffer A, Perez-Uribe A and Sipper M (1999). Static and Dynamic Configurable Systems, IEEE Transactions on Computers, 48:6, (556-564), Online publication date: 1-Jun-1999.
Recommendations
Pipeline Reconfigurable DSP for Dynamically Reconfigurable Architectures
Dynamically reconfigurable architectures, such as NATURE, achieve high logic density and low reconfiguration latency compared to traditional field-programmable gate arrays. Unlike fine-grained NATURE, reconfigurable DSP block incorporated NATURE ...
Implementation of FFT on General-Purpose Architectures for FPGA
This paper describes two general-purpose architectures targeted to Field Programmable Gate Array FPGA implementation. The first architecture is based on the coupling of a coarse-grain reconfigurable array with a general-purpose processor core. The ...