Survivable Software

This podcast episode focuses on survivable software and fault tolerance techniques for space systems, using real-world examples like the Ariane 5 disaster to illustrate the critical importance of robust software design. It covers technical strategies such as software and hardware monitoring, time-space partitioning, checkpoint and restart, and the use of dissimilar software and Byzantine fault detection to improve reliability. The discussion emphasizes that complete software testing is impossible, so layered fault tolerance approaches are essential to prevent catastrophic failures in aerospace applications.

Presenters:

Dr. Robert Hodson


Similar Videos