Details
Original language | English |
---|---|
Title of host publication | 2014 IEEE 15th International Symposium on High-Assurance Systems Engineering |
Pages | 33-40 |
Number of pages | 8 |
ISBN (electronic) | 978-1-4799-3466-9, 978-1-4799-3465-2 |
Publication status | Published - 6 Mar 2014 |
Externally published | Yes |
Event | 2014 IEEE 15th International Symposium on High-Assurance Systems Engineering, HASE 2014 - Miami, FL, United States Duration: 9 Jan 2014 → 11 Jan 2014 |
Abstract
Arithmetic error coding schemes (AN codes) are a well known and effective technique for soft error mitigation. Although coding theory being a rich area of mathematics, their implementation seems to be fairly easy. However, compliance with the theory can be lost easily while moving towards an actual implementation - finally jeopardizing the aspired fault-tolerance characteristics. In this paper, we present our experiences and lessons learned from implementing AN codes in the Cored dependable voter. We focus on the challenges and pitfalls in the transition from maths to machine code for a binary computer from a systems perspective. Our results show, that practical misconceptions (such as the use of prime numbers) and architecture-dependent implementation glitches occur on every stage of this transition. We identify typical pitfalls and describe practical measures to find and resolve them. Our measures eliminate all remaining SDCs in the Cored voter, which is validated by an extensive fault-injection campaign that covers 100 percent of the fault space for 1-bit and 2-bit errors.
Keywords
- AN code, Arithmetic error coding, Fault injection, Redundancy, Soft errors, Software-based fault tolerance
ASJC Scopus subject areas
- Computer Science(all)
- Software
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
2014 IEEE 15th International Symposium on High-Assurance Systems Engineering. 2014. p. 33-40.
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - A Practitioner’s Guide to Software-based Soft-Error Mitigation Using AN-Codes
AU - Hoffmann, Martin
AU - Ulbrich, Peter
AU - Dietrich, Christian
AU - Schirmeier, Horst
AU - Lohmann, Daniel
AU - Schroder-Preikschat, Wolfgang
PY - 2014/3/6
Y1 - 2014/3/6
N2 - Arithmetic error coding schemes (AN codes) are a well known and effective technique for soft error mitigation. Although coding theory being a rich area of mathematics, their implementation seems to be fairly easy. However, compliance with the theory can be lost easily while moving towards an actual implementation - finally jeopardizing the aspired fault-tolerance characteristics. In this paper, we present our experiences and lessons learned from implementing AN codes in the Cored dependable voter. We focus on the challenges and pitfalls in the transition from maths to machine code for a binary computer from a systems perspective. Our results show, that practical misconceptions (such as the use of prime numbers) and architecture-dependent implementation glitches occur on every stage of this transition. We identify typical pitfalls and describe practical measures to find and resolve them. Our measures eliminate all remaining SDCs in the Cored voter, which is validated by an extensive fault-injection campaign that covers 100 percent of the fault space for 1-bit and 2-bit errors.
AB - Arithmetic error coding schemes (AN codes) are a well known and effective technique for soft error mitigation. Although coding theory being a rich area of mathematics, their implementation seems to be fairly easy. However, compliance with the theory can be lost easily while moving towards an actual implementation - finally jeopardizing the aspired fault-tolerance characteristics. In this paper, we present our experiences and lessons learned from implementing AN codes in the Cored dependable voter. We focus on the challenges and pitfalls in the transition from maths to machine code for a binary computer from a systems perspective. Our results show, that practical misconceptions (such as the use of prime numbers) and architecture-dependent implementation glitches occur on every stage of this transition. We identify typical pitfalls and describe practical measures to find and resolve them. Our measures eliminate all remaining SDCs in the Cored voter, which is validated by an extensive fault-injection campaign that covers 100 percent of the fault space for 1-bit and 2-bit errors.
KW - AN code
KW - Arithmetic error coding
KW - Fault injection
KW - Redundancy
KW - Soft errors
KW - Software-based fault tolerance
UR - http://www.scopus.com/inward/record.url?scp=84898643057&partnerID=8YFLogxK
U2 - 10.1109/hase.2014.14
DO - 10.1109/hase.2014.14
M3 - Conference contribution
AN - SCOPUS:84898643057
SP - 33
EP - 40
BT - 2014 IEEE 15th International Symposium on High-Assurance Systems Engineering
T2 - 2014 IEEE 15th International Symposium on High-Assurance Systems Engineering, HASE 2014
Y2 - 9 January 2014 through 11 January 2014
ER -