Detecting Distributed Termination When Processors Can Fail
Permanent Link(s)
Collections
Author
Lermen, C. W.
Schneider, Fred B.
Abstract
A collection of protocols to facilitate detection of the termination of a computation on a distributed system are developed. Communication is assumed to be accomplished by use of asynchronous broadcasting. It is argued that this is a reasonable assumption for a distributed system in light of advances in local networking. The protocols presented are all robust with respect to processor failures. They differ in their requirements - some make heavy use of the communications network at the end of a computation, while others spread the communications cost out through the computation. Problems of restarting failed processors are also addressed.
Date Issued
1980-12
Publisher
Cornell University
Keywords
Previously Published as
http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR80-449
Type
technical report