Fault tolerance

Home > Computer Science > High-Performance Computing > Distributed Computing > Fault tolerance

This is the ability of a system to continue working in the event of failures or errors, usually through redundancy and other measures.