Skip to Main content Skip to Navigation
Conference papers

A context saving fault tolerant approach for a shared memory many-core architecture

Abstract : Mechanisms for runtime fault-tolerance in many-core architectures are mandatory to cope with transient and permanent faults. This issue is even more relevant with aggressive technology nodes due to process variability, aging effects, and susceptibility to upsets, among other factors. This work proposes to save periodically the context and to re-schedule tasks to the last reliable known state and avoid the faulty processor. This technique is implemented on an embedded multicore architecture named P2012. The proposed fault-tolerant approach induces a limited overhead of 9.37% in an industrial image processing application while guaranteeing a full-error recovery if any error is detected.
Document type :
Conference papers
Complete list of metadatas

https://hal-cea.archives-ouvertes.fr/cea-01817862
Contributor : Léna Le Roy <>
Submitted on : Monday, June 18, 2018 - 2:28:55 PM
Last modification on : Monday, February 10, 2020 - 6:14:16 PM

Identifiers

Collections

Citation

E. Wachter, N. Ventroux, Fernando Moraes. A context saving fault tolerant approach for a shared memory many-core architecture. 2015 IEEE International Symposium on Circuits and Systems (ISCAS), May 2015, Lisbon, Portugal. pp.1570-1573, ⟨10.1109/ISCAS.2015.7168947⟩. ⟨cea-01817862⟩

Share

Metrics

Record views

97