skip to main content
10.1145/3461648.3463851acmconferencesArticle/Chapter ViewAbstractPublication PagescpsweekConference Proceedingsconference-collections
research-article

Data-flow-sensitive fault-space pruning for the injection of transient hardware faults

Published: 22 June 2021 Publication History

Abstract

In the domain of safety-critical systems, fault injection campaigns on ISA-level have become a widespread approach to systematically assess the resilience of a system with respect to transient hardware faults. However, experimentally injecting all possible faults to achieve full fault-space coverage is infeasible in practice. Hence, pruning techniques, such as def/use pruning are commonly applied to reduce the campaign size by grouping injections that surely provoke the same erroneous behavior. We describe data-flow pruning, a new data-flow sensitive fault-space pruning method that extends on def/use-pruning by also considering the instructions’ semantics when deriving fault-equivalence sets. By tracking the information flow for each bit individually across the respective instructions and considering their fault-masking capability, data-flow pruning (DFP) has to plan fewer pilot injections as it derives larger fault-equivalence sets. Like def/use pruning, DFP is precise and complete and it can be used as a direct replacement/alternative in existing software-based fault-injection tools. Our prototypical implementation so far considers local fault equivalence for five types of instructions. In our experimental evaluation, this already reduces the number of necessary injections by up to 18 percent compared to def/use pruning.

References

[1]
J. Aidemark, P. Folkesson, and J. Karlsson. 2001. Path-based error coverage prediction. In Proceedings Seventh International On-Line Testing Workshop. 14–20. https://doi.org/10.1109/OLT.2001.937811
[2]
Jean Arlat, Martine Aguera, Louis Amat, Yves Crouzet, Jean-Charles Fabre, Jean-Claude Laprie, Eliane Martins, and David Powell. 1990. Fault Injection for Dependability Validation: A Methodology and Some Applications. IEEE Transactions on Software Engineering, 16, 2 (1990), Feb., 166–182. issn:0098-5589 https://doi.org/10.1109/32.44380
[3]
Alasdair Armstrong, Thomas Bauereiss, Brian Campbell, Alastair Reid, Kathryn E. Gray, Robert M. Norton, Prashanth Mundkur, Mark Wassell, Jon French, Christopher Pulte, Shaked Flur, Ian Stark, Neel Krishnaswami, and Peter Sewell. 2019. ISA Semantics for ARMv8-A, RISC-V, and CHERI-MIPS. In Proc. 46th ACM SIGPLAN Symposium on Principles of Programming Languages. https://doi.org/10.1145/3290384 Proc. ACM Program. Lang. 3, POPL, Article 71.
[4]
Ghazanfar Asadi and Mehdi Baradaran Tahoori. 2005. An analytical approach for soft error rate estimation in digital circuits. In Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on. 2991–2994. https://doi.org/10.1109/ISCAS.2005.1465256
[5]
Raul Barbosa, Jonny Vinter, Peter Folkesson, and Johan Karlsson. 2005. Assembly-Level Pre-injection Analysis for Improving Fault Injection Efficiency. In Dependable Computing - EDCC 5. Springer Berlin Heidelberg, Berlin, Heidelberg. 246–262. isbn:978-3-540-32019-7 https://doi.org/10.1007/11408901_19
[6]
Christian Bartsch, Carlos Villarraga, Dominik Stoffel, and Wolfgang Kunz. 2017. A HW/SW Cross-Layer Approach for Determining Application-Redundant Hardware Faults in Embedded Systems. Journal of Electronic Testing, 33, 1 (2017), 02, 77–92. issn:0923-8174, 1573-0727 https://doi.org/10.1007/s10836-017-5643-3
[7]
Robert C Baumann. 2005. Radiation-induced soft errors in advanced semiconductor technologies. IEEE Transactions on Device and Materials Reliability, 5, 3 (2005), 305–316. https://doi.org/10.1109/TDMR.2005.853449
[8]
Alfredo Benso and Paolo Ernesto Prinetto. 2003. Fault injection techniques and tools for embedded systems reliability evaluation. Kluwer Academic Publishers, Boston, Dordrecht, London. isbn:1-4020-7589-8
[9]
L. Berrojo, I. Gonzalez, F. Corno, M. S. Reorda, G. Squillero, L. Entrena, and C. Lopez. 2002. New techniques for speeding-up fault-injection campaigns. In Design, Automation & Test in Europe Conference & Exhibition 2002 (DATE ’02). IEEE Computer Society Press, Washington, DC, USA. 847–852. issn:1530–1591 https://doi.org/10.1109/DATE.2002.998398
[10]
Hyungmin Cho, S. Mirkhani, Chen-Yong Cher, J.A. Abraham, and S. Mitra. 2013. Quantitative evaluation of soft error injection techniques for robust system design. In Proceedings of the 50th annual Design Automation Conference. 1–10. issn:0738-100X https://doi.org/10.1145/2463209.2488859
[11]
C. Constantinescu. 2003. Trends and challenges in VLSI circuit reliability. Micro, IEEE, 23, 4 (2003), July, 14–19. issn:0272-1732 https://doi.org/10.1109/MM.2003.1225959
[12]
Björn Döbel, Horst Schirmeier, and Michael Engel. 2013. Investigating the Limitations of PVF for Realistic Program Vulnerability Assessment. In Proceedings of the 5th HiPEAC Workshop on Design for Reliability (DFR ’13). Berlin, Germany.
[13]
Mojtaba Ebrahimi, Mohammad Hadi Moshrefpour, Mohammad Saber Golanbari, and Mehdi B Tahoori. 2016. Fault injection acceleration by simultaneous injection of non-interacting faults. In Proceedings of the 53rd Annual Design Automation Conference. 25. https://doi.org/10.1145/2897937.2898023
[14]
Mojtaba Ebrahimi, Nour Sayed, Maryam Rashvand, and Mehdi B Tahoori. 2015. Fault injection acceleration by architectural importance sampling. In Hardware/Software Codesign and System Synthesis (CODES+ ISSS), 2015 International Conference on. 212–219. https://doi.org/10.1109/CODESISSS.2015.7331384
[15]
B. Fang, Q. Lu, K. Pattabiraman, M. Ripeanu, and S. Gurumurthi. 2016. ePVF: An Enhanced Program Vulnerability Factor Methodology for Cross-Layer Resilience Analysis. In 2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). 168–179. issn:2158-3927 https://doi.org/10.1109/DSN.2016.24
[16]
Johannes Grinschgl, Armin Krieg, Christian Steger, Reinhold Weiss, Holger Bock, and Josef Haid. 2012. Efficient fault emulation using automatic pre-injection memory access analysis. In SOC Conference (SOCC), 2012 IEEE International. 277–282. https://doi.org/10.1109/SOCC.2012.6398361
[17]
Ulf Gunneflo, Johan Karlsson, and Jan Torin. 1989. Evaluation of Error Detection Schemes Using Fault Injection by Heavy-ion Radiation. In Proceedings of the 19th International Symposium on Fault-Tolerant Computing (FTCS-19). IEEE Computer Society Press, 340–347. https://doi.org/10.1109/FTCS.1989.105590
[18]
M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown. 2001. MiBench: A free, commercially representative embedded benchmark suite. In Proceedings of the Fourth Annual IEEE International Workshop on Workload Characterization. WWC-4 (Cat. No.01EX538). 3–14. https://doi.org/10.1109/WWC.2001.990739
[19]
Jens Guthoff and Volkmar Sieh. 1995. Combining software-implemented and simulation-based fault injection into a single fault injection method. In Proceedings of the 25rd International Symposium on Fault-Tolerant Computing (FTCS-25). IEEE Computer Society Press, 196–206. https://doi.org/10.1109/FTCS.1995.466978
[20]
Siva Kumar Sastry Hari, Sarita V. Adve, Helia Naeimi, and Pradeep Ramachandran. 2012. Relyzer: Exploiting Application-Level Fault Equivalence to Analyze Application Resiliency to Transient Faults. In Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS ’12). ACM Press, New York, NY, USA. isbn:978-1-4503-0759-8 https://doi.org/10.1145/2150976.2150990
[21]
Siva Kumar Sastry Hari, Sarita V Adve, Helia Naeimi, and Pradeep Ramachandran. 2012. Relyzer: Exploiting application-level fault equivalence to analyze application resiliency to transient faults. In ACM SIGPLAN Notices. 47, 123–134. https://doi.org/10.1145/2189750.2150990
[22]
Siva Kumar Sastry Hari, Sarita V Adve, Helia Naeimi, and Pradeep Ramachandran. 2013. Relyzer: Application resiliency analyzer for transient faults. IEEE Micro, 33, 3 (2013), 58–66. https://doi.org/10.1109/MM.2013.30
[23]
Martin Hiller, Arshad Jhumka, and Neeraj Suri. 2002. PROPANE: An Environment for Examining the Propagation of Errors in Software. In Proceedings of the 2002 ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA ’02). ACM, New York, NY, USA. 81–85. isbn:1-58113-562-9 https://doi.org/10.1145/566172.566184
[24]
Mei-Chen Hsueh, Timothy K. Tsai, and Ravishankar K. Iyer. 1997. Fault Injection Techniques and Tools. IEEE Computer, 30, 4 (1997), April, 75–82. issn:0018-9162 https://doi.org/10.1109/2.585157
[25]
IEC 61508-3. 1998. IEC 61508-3: - Functional safety of electrical/electronic/programmable electronic safety-related systems – Part 3: Software requirements. International Electrotechnical Commission, Geneva, Switzerland.
[26]
ISO 26262-6. 2018. ISO 26262-6:2018: Road vehicles – Functional safety – Part 6: Product development at the software level. International Organization for Standardization, Geneva, Switzerland.
[27]
ISO 26262-9. 2018. ISO 26262-9:2018: Road vehicles – Functional safety – Part 9: Automotive Safety Integrity Level (ASIL)-oriented and safety-oriented analyses. International Organization for Standardization, Geneva, Switzerland.
[28]
R. Leveugle, A. Calvez, P. Maistri, and P. Vanhauwaert. 2009. Statistical Fault Injection: Quantified Error and Confidence. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE ’09). European Design and Automation Association, 3001 Leuven, Belgium, Belgium. 502–506. isbn:978-3-9810801-5-5 https://doi.org/10.1109/DATE.2009.5090716
[29]
G. Li, K. Pattabiraman, S. K. S. Hari, M. Sullivan, and T. Tsai. 2018. Modeling Soft-Error Propagation in Programs. In 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). 27–38. issn:2158-3927 https://doi.org/10.1109/DSN.2018.00016
[30]
Jianli Li and Qingping Tan. 2013. SmartInjector: Exploiting Intelligent Fault Injection for SDC Rate Analysis. In Proceedings of the International Conference on Defect and Fault Tolerance in VLSI and Nanotechnology Systems(DFT ’13). IEEE Computer Society Press, 236–242. https://doi.org/10.1109/DFT.2013.6653612
[31]
Qining Lu, M. Farahani, Jiesheng Wei, A. Thomas, and K. Pattabiraman. 2015. LLFI: An Intermediate Code-Level Fault Injection Tool for Hardware Faults. In Software Quality, Reliability and Security (QRS), 2015 IEEE International Conference on. 11–16. https://doi.org/10.1109/QRS.2015.13
[32]
Henrique Madeira, Mário Rela, Francisco Moreira, and João Gabriel Silva. 1994. RIFLE: A general purpose pin-level fault injector. In Proceedings of the 1st European Dependable Computing Conference (EDCC ’94), Klaus Echtle, Dieter Hammer, and David Powell (Eds.). Springer-Verlag, 197–216. isbn:978-3-540-58426-1 https://doi.org/10.1007/3-540-58426-9_132
[33]
Shubu Mukherjee. 2008. Architecture Design for Soft Errors. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA. isbn:978-0-12-369529-1
[34]
Bin Nie, Lishan Yang, Adwait Jog, and Evgenia Smirni. 2018. Fault site pruning for practical reliability analysis of GPGPU applications. In 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). 749–761. https://doi.org/10.1109/MICRO.2018.00066
[35]
Oskar Pusz, Daniel Kiechle, Christian Dietrich, and Daniel Lohmann. 2019. Program-Structure–Guided Approximation of Large Fault Spaces. In 2019 24th Pacific Rim International Symposium on Dependable Computing (PRDC’19). IEEE Computer Society Press, Washington, DC, USA. issn:1555-094X https://doi.org/10.1109/PRDC47002.2019.00044
[36]
P. Ramachandran, P. Kudva, J. Kellington, J. Schumann, and P. Sanda. 2008. Statistical Fault Injection. In 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN). 122–127. issn:1530-0889 https://doi.org/10.1109/DSN.2008.4630080
[37]
Behrooz Sangchoolie, Roger Johansson, and Johan Karlsson. 2017. Light-Weight Techniques for Improving the Controllability and Efficiency of ISA-Level Fault Injection Tools. In Dependable Computing (PRDC), 2017 IEEE 22nd Pacific Rim International Symposium on. 68–77. https://doi.org/10.1109/PRDC.2017.18
[38]
Thiago Santini, Christoph Borchert, Christian Dietrich, Horst Schirmeier, Martin Hoffmann, Olaf Spinczyk, Daniel Lohmann, Flávio Rech Wagner, and Paolo Rech. 2017. Effectiveness of Software-Based Hardening for Radiation-Induced Soft Errors in Real-Time Operating Systems. In Proceedings of the 2017 Conference on Architecture of Computing Systems (ARCS ’17). Springer-Verlag, Heidelberg, Germany. https://doi.org/10.1007/978-3-319-54999-6_1
[39]
Horst Schirmeier, Christoph Borchert, and Olaf Spinczyk. 2014. Rapid Fault-Space Exploration by Evolutionary Pruning. In International Conference on Computer Safety, Reliability, and Security, Andrea Bondavalli and Felicita Di Giandomenico (Eds.). Springer International Publishing, Cham. 17–32. isbn:978-3-319-10506-2
[40]
Horst Schirmeier, Christoph Borchert, and Olaf Spinczyk. 2015. Avoiding Pitfalls in Fault-Injection Based Comparison of Program Susceptibility to Soft Errors. In Proceedings of the 45th International Conference on Dependable Systems and Networks (DSN ’15). IEEE Computer Society Press, Washington, DC, USA. 12 pages. https://doi.org/10.1109/DSN.2015.44
[41]
Horst Schirmeier, Martin Hoffmann, Christian Dietrich, Michael Lenz, Daniel Lohmann, and Olaf Spinczyk. 2015. FAIL*: An Open and Versatile Fault-Injection Framework for the Assessment of Software-Implemented Hardware Fault Tolerance. In Proceedings of the 11th European Dependable Computing Conference (EDCC ’15), Pierre Sens (Ed.). 245–255. https://doi.org/10.1109/EDCC.2015.28
[42]
Bernard Schmidt, Carlos Villarraga, Thomas Fehmel, Jörg Bormann, Markus Wedler, Minh Nguyen, Dominik Stoffel, and Wolfgang Kunz. 2013. A New Formal Verification Approach for Hardware-Dependent Embedded System Software. IPSJ Transactions on System LSI Design Methodology, 6, 0 (2013), 135–145. issn:1882-6687 https://doi.org/10.2197/ipsjtsldm.6.135
[43]
Premkishore Shivakumar, Michael Kistler, Stephen W. Keckler, Doug Burger, and Lorenzo Alvisi. 2002. Modeling the Effect of Technology Trends on the Soft Error Rate of Combinational Logic. In Proceedings of the 32nd International Conference on Dependable Systems and Networks (DSN ’02). IEEE Computer Society Press, Washington, DC, USA. 389–398. https://doi.org/10.1109/DSN.2002.1028924
[44]
D Todd Smith, Barry W Johnson, Joseph A Profeta, and Daniele G Bozzolo. 1995. A method to determine equivalent fault classes for permanent and transient faults. In Reliability and Maintainability Symposium, 1995. Proceedings., Annual. 418–424. https://doi.org/10.1109/RAMS.1995.513278
[45]
V. Sridharan and D. R. Kaeli. 2009. Eliminating microarchitectural dependency from Architectural Vulnerability. In 2009 IEEE 15th International Symposium on High Performance Computer Architecture. 117–128. issn:1530-0897 https://doi.org/10.1109/HPCA.2009.4798243
[46]
R. Venkatagiri, K. Ahmed, A. Mahmoud, S. Misailovic, D. Marinov, C. W. Fletcher, and S. V. Adve. 2019. gem5-Approxilyzer: An Open-Source Tool for Application-Level Soft Error Analysis. In 2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). 214–221. https://doi.org/10.1109/DSN.2019.00033
[47]
R. Venkatagiri, A. Mahmoud, S. K. S. Hari, and S. V. Adve. 2016. Approxilyzer: Towards a systematic framework for instruction-level approximate computing and its application to hardware resiliency. In 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). 1–14. https://doi.org/10.1109/MICRO.2016.7783745

Cited By

View all

Index Terms

  1. Data-flow-sensitive fault-space pruning for the injection of transient hardware faults

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      LCTES 2021: Proceedings of the 22nd ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems
      June 2021
      162 pages
      ISBN:9781450384728
      DOI:10.1145/3461648
      • General Chair:
      • Jörg Henkel,
      • Program Chair:
      • Xu Liu
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 22 June 2021

      Permissions

      Request permissions for this article.

      Check for updates

      Badges

      Author Tags

      1. bit flip
      2. fault injection
      3. fault-space pruning
      4. functional correctness
      5. reliability
      6. single event upset

      Qualifiers

      • Research-article

      Conference

      LCTES '21
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 116 of 438 submissions, 26%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)33
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media