Onboard computer fault-tolerance detection and mitigation

The aim for this thesis is to design a software library responsible for preventing, detecting, handling and logging faults caused by radiation in a representable flight computer system based on the Cobham Gaisler GR740 quad-core LEON4FT processor chip. The LEON processor family is commonly used in s...

Full description

Bibliographic Details
Main Author: Olofsson, Gustav
Format: Others
Language:English
Published: Luleå tekniska universitet, Institutionen för system- och rymdteknik 2020
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-81407
id ndltd-UPSALLA1-oai-DiVA.org-ltu-81407
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ltu-814072020-11-17T05:43:11ZOnboard computer fault-tolerance detection and mitigationengOlofsson, GustavLuleå tekniska universitet, Institutionen för system- och rymdteknik2020Onboard ComputerExceptionsError HandlingRTOSComputer SystemsDatorsystemThe aim for this thesis is to design a software library responsible for preventing, detecting, handling and logging faults caused by radiation in a representable flight computer system based on the Cobham Gaisler GR740 quad-core LEON4FT processor chip. The LEON processor family is commonly used in space applications and it is based on the open SPARC instruction set and has been extended with fault tolerant features to cope with both on-chip radiation effects as well as upsets in external memory. The new GR740 device introduces a new computer architecture with multiple buses as compared to previous chips, Level-2 cache and a memory scrubber accelerating fault mitigation in external SDRAM memories. As the processor system design keeps getting more complex it also requires software to handle more hardware and new events, including central handling and logging routines of faults. The report describes the analysis performed to identify sources of faults and proposed suitable mitigation techniques, the established software requirements and how they are translated into a software architecture, then implemented and finally demonstrated on hardware. Along with this, it is shown how the developed demonstrator application software library can be integrated into the RTEMS real-time operating system commonly used in European space missions. The results are based on the demonstrator execution, and the results show that the software is functionally working and validates that the performance of the scrubber matches the derived scrubbing timings. After the project is completed, the software library design will be evaluated for use in Cobham Gaisler’s payload computer platform for the GOMX-5 mission. Radiation upsets will be emulated by injecting faults while running the developed API on demonstrator applications. Furthermore, implementation of software into NASA cFS/cFE will be analysed. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-81407application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Onboard Computer
Exceptions
Error Handling
RTOS
Computer Systems
Datorsystem
spellingShingle Onboard Computer
Exceptions
Error Handling
RTOS
Computer Systems
Datorsystem
Olofsson, Gustav
Onboard computer fault-tolerance detection and mitigation
description The aim for this thesis is to design a software library responsible for preventing, detecting, handling and logging faults caused by radiation in a representable flight computer system based on the Cobham Gaisler GR740 quad-core LEON4FT processor chip. The LEON processor family is commonly used in space applications and it is based on the open SPARC instruction set and has been extended with fault tolerant features to cope with both on-chip radiation effects as well as upsets in external memory. The new GR740 device introduces a new computer architecture with multiple buses as compared to previous chips, Level-2 cache and a memory scrubber accelerating fault mitigation in external SDRAM memories. As the processor system design keeps getting more complex it also requires software to handle more hardware and new events, including central handling and logging routines of faults. The report describes the analysis performed to identify sources of faults and proposed suitable mitigation techniques, the established software requirements and how they are translated into a software architecture, then implemented and finally demonstrated on hardware. Along with this, it is shown how the developed demonstrator application software library can be integrated into the RTEMS real-time operating system commonly used in European space missions. The results are based on the demonstrator execution, and the results show that the software is functionally working and validates that the performance of the scrubber matches the derived scrubbing timings. After the project is completed, the software library design will be evaluated for use in Cobham Gaisler’s payload computer platform for the GOMX-5 mission. Radiation upsets will be emulated by injecting faults while running the developed API on demonstrator applications. Furthermore, implementation of software into NASA cFS/cFE will be analysed.
author Olofsson, Gustav
author_facet Olofsson, Gustav
author_sort Olofsson, Gustav
title Onboard computer fault-tolerance detection and mitigation
title_short Onboard computer fault-tolerance detection and mitigation
title_full Onboard computer fault-tolerance detection and mitigation
title_fullStr Onboard computer fault-tolerance detection and mitigation
title_full_unstemmed Onboard computer fault-tolerance detection and mitigation
title_sort onboard computer fault-tolerance detection and mitigation
publisher Luleå tekniska universitet, Institutionen för system- och rymdteknik
publishDate 2020
url http://urn.kb.se/resolve?urn=urn:nbn:se:ltu:diva-81407
work_keys_str_mv AT olofssongustav onboardcomputerfaulttolerancedetectionandmitigation
_version_ 1719357947303165952