Efficient Message Passing Interface (MPI) for Parallel Computing onClusters of Workstations

Bruck, Jehoshua; Dolev, Danny; Ho, Ching-Tien; Rosu, Marcel-Catalin; Strong, Ray

Efficient Message Passing Interface (MPI) for Parallel Computing onClusters of Workstations

dc.contributor.author	Bruck, Jehoshua	en_US
dc.contributor.author	Dolev, Danny	en_US
dc.contributor.author	Ho, Ching-Tien	en_US
dc.contributor.author	Rosu, Marcel-Catalin	en_US
dc.contributor.author	Strong, Ray	en_US
dc.date.accessioned	2007-04-23T16:26:52Z
dc.date.available	2007-04-23T16:26:52Z
dc.date.issued	1995-02	en_US
dc.description.abstract	Parallel computing on clusters of workstations and personal computers has very high potential, since it leverages existing hardware and software. Parallel programming environments offer the user a convenient way to express parallel computation and communication. In fact, recently, a Message Passing Interface (MPI) has been proposed as an industrial standard for writing "portable" message-passing parallel programs. The communication part of MPI consists of the usual point-to-point communication as well as collective communication. However, existing implementations of programming environments for clusters are built on top of a point-to-point communication layer (send and receive) over local area networks (LANs) and, as a result, suffer from poor performance in the collective communication part. In this paper, we present an efficient design and implementation of the collective communication part in MPI that is optimized for clusters of workstations. Our system consists of two main components: the MPI-CCL layer that includes the collective communication functionality of MPI and a User-level Reliable Transport Protocol (URTP) that interfaces with the LAN Data-link layer and leverages the fact that the LAN is a broadcast medium. Our system is integrated with the operating system via an efficient kernel extension mechanism that we developed. The kernel extension significantly improves the performance of our implementation as it can handle part of the communication overhead without involving user space. We have implemented our system on a collection of IBM RS/6000 workstations connected via a 10Mbit Ethernet LAN. Our performance measurements are taken from real scientific applications that run in a parallel mode by means of the MPI. The hypothesis behind our design is that system's performance will be bounded by interactions between the kernel and user space rather than by the bandwidth delivered by the LAN Data-Link Layer. Our results indicate that the performance of our MPI Broadcast (on top of Ethernet) is about twice as fast as a recently published software implementation of broadcast on top of ATM.	en_US
dc.format.extent	336028 bytes
dc.format.extent	395776 bytes
dc.format.extent	10 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	application/postscript
dc.format.mimetype	application/postscript
dc.identifier.citation	http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR95-1474	en_US
dc.identifier.uri	https://hdl.handle.net/1813/6083
dc.language.iso	en_US	en_US
dc.publisher	Cornell University	en_US
dc.subject	computer science	en_US
dc.subject	technical report	en_US
dc.title	Efficient Message Passing Interface (MPI) for Parallel Computing onClusters of Workstations	en_US
dc.type	technical report	en_US

Files

Original bundle

Now showing 1 - 3 of 3

Name:: 95-1474.pdf
Size:: 328.15 KB
Format:: Adobe Portable Document Format

Download

Name:: 95-1474.ps
Size:: 386.5 KB
Format:: Postscript Files

Download

Name:: junk.ps
Size:: 10 B
Format:: Postscript Files

Download

Collections

Computer Science Technical Reports