TY - GEN
T1 - Scientific workflow applications on amazon EC2
AU - Juve, Gideon
AU - Deelman, Ewa
AU - Vahi, Karan
AU - Mehta, Gaurang
AU - Berman, Benjamin P.
AU - Berriman, Bruce
AU - Maechling, Phil
PY - 2009
Y1 - 2009
N2 - The proliferation of commercial cloud computing providers has generated significant interest in the scientific computing community. Much recent research has attempted to determine the benefits and drawbacks of cloud computing for scientific applications. Although clouds have many attractive features, such as virtualization, on-demand provisioning, and "pay as you go" usage-based pricing, it is not clear whether they are able to deliver the performance required for scientific applications at a reasonable price. In this paper we examine the performance and cost of clouds from the perspective of scientific workflow applications. We use three characteristic workflows to compare the performance of a commercial cloud with that of a typical HPC system, and we analyze the various costs associated with running those workflows in the cloud. We find that the performance of clouds is not unreasonable given the hardware resources provided, and that performance comparable to HPC systems can be achieved given similar resources. We also find that the cost of running workflows on a commercial cloud can be reduced by storing data in the cloud rather than transferring it from outside.
AB - The proliferation of commercial cloud computing providers has generated significant interest in the scientific computing community. Much recent research has attempted to determine the benefits and drawbacks of cloud computing for scientific applications. Although clouds have many attractive features, such as virtualization, on-demand provisioning, and "pay as you go" usage-based pricing, it is not clear whether they are able to deliver the performance required for scientific applications at a reasonable price. In this paper we examine the performance and cost of clouds from the perspective of scientific workflow applications. We use three characteristic workflows to compare the performance of a commercial cloud with that of a typical HPC system, and we analyze the various costs associated with running those workflows in the cloud. We find that the performance of clouds is not unreasonable given the hardware resources provided, and that performance comparable to HPC systems can be achieved given similar resources. We also find that the cost of running workflows on a commercial cloud can be reduced by storing data in the cloud rather than transferring it from outside.
UR - http://www.scopus.com/inward/record.url?scp=77950143591&partnerID=8YFLogxK
U2 - 10.1109/ESCIW.2009.5408002
DO - 10.1109/ESCIW.2009.5408002
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:77950143591
SN - 9781424459452
T3 - e-science 2009 - Proceedings of the 2009 5th IEEE International Conference on e-Science Workshops
SP - 59
EP - 66
BT - e-science 2009 - Proceedings of the 2009 5th IEEE International Conference on e-Science Workshops
T2 - 2009 5th IEEE International Conference on e-Science Workshops, e-science 2009
Y2 - 9 December 2009 through 11 December 2009
ER -