Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud

Today, our picture of the Universe radically differs from that of just over a decade ago. We now know that the Universe is not only expanding as Hubble discovered in 1929, but that the rate of expansion is accelerating, propelled by mysterious new physics dubbed “Dark Energy”. This revolutionary dis...

Full description

Bibliographic Details
Main Authors: Keith R. Jackson, Krishna Muriki, Lavanya Ramakrishnan, Karl J. Runge, Rollin C. Thomas
Format: Article
Language:English
Published: Hindawi Limited 2011-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.3233/SPR-2011-0324
id doaj-a36e96f788d44bfe87943e331af1d066
record_format Article
spelling doaj-a36e96f788d44bfe87943e331af1d0662021-07-02T04:46:09ZengHindawi LimitedScientific Programming1058-92441875-919X2011-01-01192-310711910.3233/SPR-2011-0324Performance and Cost Analysis of the Supernova Factory on the Amazon AWS CloudKeith R. Jackson0Krishna Muriki1Lavanya Ramakrishnan2Karl J. Runge3Rollin C. Thomas4Lawrence Berkeley National Lab, Berkeley, CA, USALawrence Berkeley National Lab, Berkeley, CA, USALawrence Berkeley National Lab, Berkeley, CA, USALawrence Berkeley National Lab, Berkeley, CA, USALawrence Berkeley National Lab, Berkeley, CA, USAToday, our picture of the Universe radically differs from that of just over a decade ago. We now know that the Universe is not only expanding as Hubble discovered in 1929, but that the rate of expansion is accelerating, propelled by mysterious new physics dubbed “Dark Energy”. This revolutionary discovery was made by comparing the brightness of nearby Type Ia supernovae (which exploded in the past billion years) to that of much more distant ones (from up to seven billion years ago). The reliability of this comparison hinges upon a very detailed understanding of the physics of the nearby events. To further this understanding, the Nearby Supernova Factory (SNfactory) relies upon a complex pipeline of serial processes that execute various image processing algorithms in parallel on ~10 TBs of data. This pipeline traditionally runs on a local cluster. Cloud computing [Above the clouds: a Berkeley view of cloud computing, Technical Report UCB/EECS-2009-28, University of California, 2009] offers many features that make it an attractive alternative. The ability to completely control the software environment in a cloud is appealing when dealing with a community developed science pipeline with many unique library and platform requirements. In this context we study the feasibility of porting the SNfactory pipeline to the Amazon Web Services environment. Specifically we: describe the tool set we developed to manage a virtual cluster on Amazon EC2, explore the various design options available for application data placement, and offer detailed performance results and lessons learned from each of the above design options.http://dx.doi.org/10.3233/SPR-2011-0324
collection DOAJ
language English
format Article
sources DOAJ
author Keith R. Jackson
Krishna Muriki
Lavanya Ramakrishnan
Karl J. Runge
Rollin C. Thomas
spellingShingle Keith R. Jackson
Krishna Muriki
Lavanya Ramakrishnan
Karl J. Runge
Rollin C. Thomas
Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud
Scientific Programming
author_facet Keith R. Jackson
Krishna Muriki
Lavanya Ramakrishnan
Karl J. Runge
Rollin C. Thomas
author_sort Keith R. Jackson
title Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud
title_short Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud
title_full Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud
title_fullStr Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud
title_full_unstemmed Performance and Cost Analysis of the Supernova Factory on the Amazon AWS Cloud
title_sort performance and cost analysis of the supernova factory on the amazon aws cloud
publisher Hindawi Limited
series Scientific Programming
issn 1058-9244
1875-919X
publishDate 2011-01-01
description Today, our picture of the Universe radically differs from that of just over a decade ago. We now know that the Universe is not only expanding as Hubble discovered in 1929, but that the rate of expansion is accelerating, propelled by mysterious new physics dubbed “Dark Energy”. This revolutionary discovery was made by comparing the brightness of nearby Type Ia supernovae (which exploded in the past billion years) to that of much more distant ones (from up to seven billion years ago). The reliability of this comparison hinges upon a very detailed understanding of the physics of the nearby events. To further this understanding, the Nearby Supernova Factory (SNfactory) relies upon a complex pipeline of serial processes that execute various image processing algorithms in parallel on ~10 TBs of data. This pipeline traditionally runs on a local cluster. Cloud computing [Above the clouds: a Berkeley view of cloud computing, Technical Report UCB/EECS-2009-28, University of California, 2009] offers many features that make it an attractive alternative. The ability to completely control the software environment in a cloud is appealing when dealing with a community developed science pipeline with many unique library and platform requirements. In this context we study the feasibility of porting the SNfactory pipeline to the Amazon Web Services environment. Specifically we: describe the tool set we developed to manage a virtual cluster on Amazon EC2, explore the various design options available for application data placement, and offer detailed performance results and lessons learned from each of the above design options.
url http://dx.doi.org/10.3233/SPR-2011-0324
work_keys_str_mv AT keithrjackson performanceandcostanalysisofthesupernovafactoryontheamazonawscloud
AT krishnamuriki performanceandcostanalysisofthesupernovafactoryontheamazonawscloud
AT lavanyaramakrishnan performanceandcostanalysisofthesupernovafactoryontheamazonawscloud
AT karljrunge performanceandcostanalysisofthesupernovafactoryontheamazonawscloud
AT rollincthomas performanceandcostanalysisofthesupernovafactoryontheamazonawscloud
_version_ 1721339599610970112