Now showing items 9-18 of 18

    • On the Optimum Checkpoint Selection Problem 

      Toueg, Sam; Babaoglu, Ozalp (Cornell University, 1983-02)
      We consider a model of computation consisting of a sequence of $n$ tasks. In the absence of failures, each task $i$ has a known completion time $t_{i}$. Checkpoints can be placed between any two consecutive tasks. At a ...
    • On the Reliability of Fault-Tolerant Distributed Computing Systems 

      Babaoglu, Ozalp (Cornell University, 1986-02)
      The designer of a fault-tolerant distributed system faces numerous alternatives. Using a stochastic model of processor failure times, we investigate design choices such as replication level, protocol running time, ...
    • Paralex: An Environment for Parellel Programming in Distributed Systems 

      Babaoglu, Ozalp; Alvisi, Lorenzo; Amoroso, Alessandro; Davoli, Renzo; Giachini, Luigi Alberto (Cornell University, 1991-12)
      Modern distributed systems consisting of powerful workstations and high-speed interconnection networks are an economical alternative to special-purpose super computers. The technical issues that need to be addressed in ...
    • Reliable Broadcast Protocols and Network Architecture: Tradeoffs and Lower Bounds 

      Babaoglu, Ozalp; Drummond, Rogerio; Stephenson, Patrick (Cornell University, 1986-05)
      Reliable Broadcast is a mechanism by which a processor in a distributed system disseminates a value to all other processors in the presence of both communication and processor failures. Protocols to achieve Reliable ...
    • Reliable Broadcasts Through Partial Broadcasts 

      Babaoglu, Ozalp; Stephenson, Patrick (Cornell University, 1985-12)
      In the Reliable Broadcast Problem, a processor disseminates a value to all other processors in a distributed system where both processors and communication components are subject to failures. We prove lower bounds for ...
    • Run-time Support for Dynamic Load Balancing and Debugging in Paralex 

      Babaoglu, Ozalp; Alvisi, Lorenzo; Amoroso, Alessandro; Davoli, Renzo; Giachini, Luigi Alberto (Cornell University, 1991-12)
      Paralex is a programming environment for developing and executing parallel applications in distributed systems. The user is spared complexities of distributed programming including remote execution, data representation, ...
    • Stopping Times of Distributed Consensus Protocols: A Probabilistic Analysis 

      Babaoglu, Ozalp (Cornell University, 1986-05)
      Given a model where each processor remains correct for an exponentially distributed random time and then fails independently of the others, we characterize system executions that permit the processors to reach consensus. ...
    • Streets of Byzantium: Network Architecture for Fast Reliable Broadcasts 

      Babaoglu, Ozalp; Drummond, Rogerio (Cornell University, 1985-06)
      A site broadcasting its local value to all other sites in a fault-prone environment is a fundamental paradigm in constructing reliable distributed systems. Time complexity lower bounds and network connectivity requirements ...
    • Time-Communication Tradeoffs for Reliable Broadcast Protocols 

      Babaoglu, Ozalp; Drummond, Rogerio (Cornell University, 1985-06)
      In the Reliable Broadcast Problem, a processor disseminates a value to all other processors in a distributed system where both processors and communication components are subject to failures. Solutions to this Reliable ...
    • Tools and Techniques for Adding Fault Tolerance to Distributed and Parallel Programs 

      Babaoglu, Ozalp (Cornell University, 1991-12)
      The scale of parallel computing systems is rapidly approaching dimensions where fault tolerance can no longer be ignored. No matter how reliable the individual components may be, the complexity of these systems results ...