Abstract
An efficient design and implementation of the collective communication part in a Message Passing Interface (MPI) that is optimized for clusters of workstations is described. The system which consist of two main components, the MPI-CCL layer and a User-level Reliable Transport Protocol (URTP), is integrated with the operating system via an efficient kernel extension mechanism. The system is then implemented on a collection of IBM RS/6000 workstations connected via a 10Mbit Ethernet LAN. Results indicate that the performance of the MPI Broadcast (on top of Ethernet) is about twice as fast as a recently published software implementation of broadcast on top of ATM.
Original language | English |
---|---|
Pages | 64-73 |
Number of pages | 10 |
DOIs | |
State | Published - 1995 |
Externally published | Yes |
Event | Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA'95 - Santa Barbara, CA, USA Duration: 17 Jul 1995 → 19 Jul 1995 |
Conference
Conference | Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA'95 |
---|---|
City | Santa Barbara, CA, USA |
Period | 17/07/95 → 19/07/95 |