EFTOS

EFTOS: A software framework for more dependable embedded HPC applications. Within the ESPRIT project EFTOS (Embedded Fault-Tolerant Supercomputing), a framework is developed to integrate fault tolerance flexibly and easily into distributed embedded HPC applications. This framework consists of a variety of reusable fault tolerance modules acting at different levels. The cost and performance overhead of generic Operating System and Hardware level fault tolerance mechanisms are avoided, while at the same time the burden of ad hoc fault tolerance programming is removed from the application developers. Integration of this functionality in real embedded applications validates this approach, and provides promising results.