Abstract
An efficient design and implementation of the collective communication part in a Message Passing Interface (MPI) that is optimized for clusters of workstations is described. The system which consist of two main components, the MPI-CCL layer and a User-level Reliable Transport Protocol (URTP), is integrated with the operating system via an efficient kernel extension mechanism. The system is then implemented on a collection of IBM RS/6000 workstations connected via a 10Mbit Ethernet LAN. Results indicate that the performance of the MPI Broadcast (on top of Ethernet) is about twice as fast as a recently published software implementation of broadcast on top of ATM.
| Original language | English |
|---|---|
| Pages | 64-73 |
| Number of pages | 10 |
| DOIs | |
| State | Published - 1995 |
| Externally published | Yes |
| Event | Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA'95 - Santa Barbara, CA, USA Duration: 17 Jul 1995 → 19 Jul 1995 |
Conference
| Conference | Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA'95 |
|---|---|
| City | Santa Barbara, CA, USA |
| Period | 17/07/95 → 19/07/95 |