Abstract
We present a multivariate analysis technique called Co-plot that is especially suitable for samples with many variables and relatively few observations, as the data about workloads often is. Observations and variables are analyzed simultaneously. We find three stable clusters of highly correlated variables, but that the workloads themselves, on the other hand, are rather different from one another. Synthetic models for workload generation are also analyzed, and found to be reasonable; however, each model usually covers well one machine type. This leads us to conclude that a parameterized model of parallel workloads should be built, and we describe guidelines for such a model. Another feature that the models lack is self-similarity: We demonstrate that production logs exhibit this phenomenon in several attributes of the workload, and in contrast that the none of the synthetic models do.
Original language | English |
---|---|
Title of host publication | Job Scheduling Strategies for Parallel Processing - IPPS/SPDP 1999 Workshop, JSSPP 1999, Proceedings |
Editors | Dror G. Feitelson, Larry Rudolph |
Publisher | Springer Verlag |
Pages | 43-66 |
Number of pages | 24 |
ISBN (Print) | 3540666761, 9783540666769 |
DOIs | |
State | Published - 1999 |
Event | 5th Workshop on Job Scheduling Strategies for Parallel Processing, JSSPP 1999 held in conjunction with the IPPS/SPDP 1999 - San Juan, Puerto Rico Duration: 16 Apr 1999 → 16 Apr 1999 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 1659 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 5th Workshop on Job Scheduling Strategies for Parallel Processing, JSSPP 1999 held in conjunction with the IPPS/SPDP 1999 |
---|---|
Country/Territory | Puerto Rico |
City | San Juan |
Period | 16/04/99 → 16/04/99 |
Bibliographical note
Publisher Copyright:© Springer-Verlag Berlin Heidelberg 1999.