skip to main content
research-article

A high-level clustering algorithm targeting dual Vdd FPGAs

Published: 03 October 2008 Publication History

Abstract

Recent advanced power optimizations deployed in commercial FPGAs, laid out a roadmap towards FPGA devices that can be integrated into ultra low power systems. In this article, we present a high-level design tool to support the process of mapping an application onto a FPGA device with dual supply voltages. Our main contribution in this paper is an algorithm, which creates voltage scaling ready clusters by utilizing the timing slack available in the designs. We propose to first create clusters of CLBs within a given CLB-level netlist. This clustering algorithm intends to group chains of CLBs possessing similar amounts of timing slack along their critical path together. Once these clusters are identified, they are placed onto respective Vdd partitions on the device. We have evaluated different dual Vdd fabrics and the potential gain in power consumption is explored. When a subset of the logic blocks on the device can be driven by low Vdd levels (either with a dedicated low Vdd supply or with a programmable selection between low and high Vdd levels for these blocks) this affects placement and routing. As a result the maximum frequency of the designs may be affected. In order to evaluate the overall impact of creating voltage islands, we measured the Energy-Delay Product for our benchmark designs. We observed that the Energy-Delay product can be decreased by 26.9% when the placement of the designs into different voltage levels is guided by our clustering algorithm.

References

[1]
Akrout, C., Bialas, J., Canada, M., Cawthron, D., Corr, J., Davari, B., Floyd, R., Geissler, S., Goldblatt, R., Houle, R., Kartschoke, P., Kramer, D., McCormick, P., Rohrer, N., Salem, G., Schulz, R., Su, L., and Whitney, L. 1998. A 480 MHz RISC microprocessor in a 0.12 μm Leff CMOS technology with copper interconnects. IEEE Solid-State Circ. 33, 11, 1609--1616.
[2]
Betz, V., Rose, J., and Marquardt, A. 1999. Architecture and CAD for Deep-Submicron FPGAs. Kluwer Academic Publishers, Norwell, MA.
[3]
Boemo, E. I., Rivera, G. G.d., Lopez-Buedo, S., and Meneses, J. 1995. Some notes on power management on FPGA-based systems. Lecture Notes in Computer Science, Springer-Verlag, Berlin, 149--157.
[4]
Bozorgzadeh, E., Ghiasi, S., and Sarrafzadeh, M. 2004. Optimal integer delay budget assignment on directed acyclic graphs. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 23, 8, 1184--1199.
[5]
Brasen, D. and Saucier, G. 1998. Using cone structures for circuit partitioning into FPGA packages. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 17, 7, 592--600.
[6]
Chan, P. K., Schlag, M. D. F., and Zien, J. Y. 1995. Spectral-based multi-way FPGA partitioning. Proceedings of the International Symposium on Field Programmable Gate Arrays (FPGA).
[7]
Chang, D. and Marek-Sadowska, M. 1999. Partitioning sequential circuits on dynamically reconfigurable FPGAs. IEEE Trans. Comput. 48, 6, 565--578.
[8]
Chen, C., Hwang, T., and Liu, C.L. 1997. Low power FPGA design?A re-engineering approach. Proceedings of the Design Automation Conference (DAC).
[9]
Chen, C., Srivastava, A., and Sarrafzadeh, M. 2001. On gate level power optimization using dual-supply voltages. IEEE Trans. VLSI Syst. 9, 5, 616--629.
[10]
Chen, D., Cong, J., and Fan, Y. 2003. Low power high-level synthesis for FPGA architectures. Proceedings of the International Symposium on Low Power Electronic Design (ISLPED).
[11]
Chen, D., Cong, J., Li, F., and He, L. 2004. Low-power technology mapping for FPGA architectures with dual supply voltage. Proceedings of the International Symposium on Field-Programmable Gate Arrays (FPGA).
[12]
Cheng, L., Wong, P., Li, F., Lin, Y., and He, L. 2005. Device and architecture co-optimization for FPGA power reduction. Proceedings of the IEEE/ACM Design Automation Conference (DAC).
[13]
Gayasen, A., Lee, K., Vijaykrishnan, N., Kandemir, M., Irwin, M. J., and Tuan, T. 2004a. A dual-VDD low power FPGA architecture. Field Programmable Logic and its Applications (FPL).
[14]
Gayasen, A., Tsai, Y., Vijaykrishnan, N., Kandemir, M., Irwin, M. J., and Tuan, T. 2004b. Reducing leakage energy in FPGAs using region-constrained placement. Proceedings of the International Symposium on Field Programmable Gate Arrays (FPGA).
[15]
Govil, K., Chan, E., and Wasserman, H. 1995. Comparing algorithms for dynamic speed setting of a low power CPU. Proceedings of the International Conference on Mobile Computing and Networking.
[16]
Hamada, M., Takahashi, M., Arakida, H., Chiba, A., Terazawa, T., Ishikawa, T., Kanazawa, M., Igarashi, M., Usami, K., and Kuroda, T. 1998. A top-down low power design technique using clustered voltage scaling with variable supply-voltage scheme. Proceedings of the Custom Integrated Circuits Conference.
[17]
Hu, Y., Lin, Y., He, L., and Tuan, T. 2006. Simultaneous time slack budgeting and retiming for dual-Vdd FPGA power reduction. Proceedings of the IEEE/ACM Design Automation Conference (DAC).
[18]
Iyer, A. and Marculescu, D. 2002. Power efficiency of voltage scaling in multiple clock, multiple voltage cores. Proceedings of the International Conference on Computer Aided Design (ICCAD).
[19]
Kao, J. T. and Chandrakasan, A. P. 2000. Dual-threshold voltage techniques for low-power digital circuits. IEEE J. Solid-State Circ. 35, 7, 1009--1018.
[20]
Kuznar, R., Brglez, F., and Kozminski, K. 1993. Cost minimization of partitions into multiple devices. In Proceedings of the Design Automation Conference (DAC).
[21]
Lamoureux, J. and Wilton, S. J. E. 2003. On the interaction between power-aware FPGA CAD algorithms. Proceedings of the IEEE/ACM International Conference on Computer Aided Design (ICCAD).
[22]
Li, F., Lin, Y., and He, L. 2004a. FPGA power reduction using configurable dual-Vdd. In Proceedings of the IEEE/ACM Design Automation Conference (DAC).
[23]
Li, F., Lin, Y., and He, L. 2004b. Vdd programmability to reduce FPGA interconnect power. Proceedings of the IEEE/ACM International Conference on Computer Aided Design (ICCAD).
[24]
Li, F., Lin, Y., He, L., and Cong, J. 2004c. Low-power FPGA using predefined dual-Vdd/dual-Vt fabrics. Proceedings of the International Symposium on Field-Programmable Gate Arrays (FPGA).
[25]
Li, H., Katkoori, S., and Mak, W. 2004d. Power minimization algorithms for LUT-based FPGA technology mapping. ACM Trans. Des. Automa. Electron. Syst. 9, 1, 33--51.
[26]
Lin, Y. and He, L. 2006a. Dual-Vdd interconnect with chip-level time slack allocation for FPGA power reduction. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 25, 10, 2023--2034.
[27]
Lin, Y., Hu, Y., He, L., and Raghunatan, V. 2006b. An efficient chip-level time slack allocation algorithm for Dual-Vdd FPGA power reduction. Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED).
[28]
Lin, Y., Li, F., and He, L. 2005. Routing track duplication with fine-grained power-gating for FPGA interconnect power reduction. Proceedings of the Conference on Asia South Pacific Design Automation (ASPDAC).
[29]
Liu, H. and Wong, D. F. 1999. Circuit partitioning for dynamically reconfigurable FPGAs. Proceedings of the International Symposium on Field Programmable Gate Arrays (FPGA).
[30]
Mondal, S. and Ogrenci Memik, S. 2005. A low power FPGA routing architecture. Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS).
[31]
Mukherjee, R. and Ogrenci Memik, S. 2004. Power-driven partitioning. In Proceedings of the Field Programmable Logic and its Applications (FPL).
[32]
Poon, K. K. W. and Wilton, S. J. E. 2005. A detailed power model for field-programmable gate arrays. ACM Trans. Des. Automa. Electron. Syst. 10, 2, 119--132.
[33]
Puri, R., Stok, L., Cohn, J., Kung, D., Pan, D., Sylvester, D., Srivastava, A., and Kulkarni, S. 2003. Pushing ASIC performance in power envelope. Proceedings of the IEEE/ACM Design Automation Conference (DAC).
[34]
Simunic, T., Benini, L., Acquaviva, A., Glynn, P. W., and Micheli, G. D. 2001. Dynamic voltage scaling and power management for portable systems. Proceedings of the IEEE/ACM Design Automation Conference (DAC).
[35]
Sirichotiyakul, S., Edwards T., Oh C., Zuo J., Dharchoudhury A., Panda R., and Blaauw, D. 1999. Standby power minimization through simultaneous threshold voltage selection and circuit sizing. Proceedings of the Design Automation Conference (DAC).
[36]
Usami, K. and Horowitz, M. 1995. Clustered voltage scaling techniques for low-power design. Proceedings of the International Symposium on Low Power Electronics and Design (ISLPED).
[37]
Usami, K. and Igarashi, M. 2000. Low-power design methodology and applications utilizing dual supply voltages. Proceedings of the Asia South Pacific Design Automation Conference.
[38]
Wang, Q. and Vrudhula, S. 2002. Algorithms for minimizing standby power in deep submicron, dual-Vt CMOS circuits. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 21, 3, 306--318.
[39]
Wang, Z. H., Liu, E. C., Lai, J., and Wang, T. C. 2001. Power minimization in LUT-based FPGA technology mapping. Proceedings of the Asia South Pacific Design Automation Conference (ASPDAC).
[40]
Yeh, C., Chang, M., Chang, S., and Jone, W. 1999. Gate-level design exploiting dual supply voltages for power-driven applications. Proceedings of the IEEE/ACM Design Automation Conference (DAC).

Cited By

View all
  • (2018)Multi-Vdd Design for Content Addressable Memories (CAM): A Power-Delay Optimization AnalysisJournal of Low Power Electronics and Applications10.3390/jlpea80300258:3(25)Online publication date: 30-Jul-2018

Index Terms

  1. A high-level clustering algorithm targeting dual Vdd FPGAs

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Design Automation of Electronic Systems
    ACM Transactions on Design Automation of Electronic Systems  Volume 13, Issue 4
    September 2008
    328 pages
    ISSN:1084-4309
    EISSN:1557-7309
    DOI:10.1145/1391962
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Journal Family

    Publication History

    Published: 03 October 2008
    Accepted: 01 May 2008
    Revised: 01 July 2007
    Received: 01 May 2007
    Published in TODAES Volume 13, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Dynamic power
    2. clustering
    3. field programmable gate arrays
    4. partitioning
    5. placement
    6. voltage scaling

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Multi-Vdd Design for Content Addressable Memories (CAM): A Power-Delay Optimization AnalysisJournal of Low Power Electronics and Applications10.3390/jlpea80300258:3(25)Online publication date: 30-Jul-2018

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media