Scalability of Parallel Scientific Applications on the Cloud

Cloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study the effects of moving parallel scientific applications onto the cloud, we deployed several benchmark applications like matrix–vector operations and...

Full description

Bibliographic Details
Main Authors: Satish Narayana Srirama, Oleg Batrashev, Pelle Jakovits, Eero Vainikko
Format: Article
Language:English
Published: Hindawi Limited 2011-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.3233/SPR-2011-0320
id doaj-14963dc593ee4e41a08805a1357162ff
record_format Article
spelling doaj-14963dc593ee4e41a08805a1357162ff2021-07-02T13:45:09ZengHindawi LimitedScientific Programming1058-92441875-919X2011-01-01192-39110510.3233/SPR-2011-0320Scalability of Parallel Scientific Applications on the CloudSatish Narayana Srirama0Oleg Batrashev1Pelle Jakovits2Eero Vainikko3Distributed Systems Group, Institute of Computer Science, University of Tartu, J. Liivi 2, Tartu, EstoniaDistributed Systems Group, Institute of Computer Science, University of Tartu, J. Liivi 2, Tartu, EstoniaDistributed Systems Group, Institute of Computer Science, University of Tartu, J. Liivi 2, Tartu, EstoniaDistributed Systems Group, Institute of Computer Science, University of Tartu, J. Liivi 2, Tartu, EstoniaCloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study the effects of moving parallel scientific applications onto the cloud, we deployed several benchmark applications like matrix–vector operations and NAS parallel benchmarks, and DOUG (Domain decomposition On Unstructured Grids) on the cloud. DOUG is an open source software package for parallel iterative solution of very large sparse systems of linear equations. The detailed analysis of DOUG on the cloud showed that parallel applications benefit a lot and scale reasonable on the cloud. We could also observe the limitations of the cloud and its comparison with cluster in terms of performance. However, for efficiently running the scientific applications on the cloud infrastructure, the applications must be reduced to frameworks that can successfully exploit the cloud resources, like the MapReduce framework. Several iterative and embarrassingly parallel algorithms are reduced to the MapReduce model and their performance is measured and analyzed. The analysis showed that Hadoop MapReduce has significant problems with iterative methods, while it suits well for embarrassingly parallel algorithms. Scientific computing often uses iterative methods to solve large problems. Thus, for scientific computing on the cloud, this paper raises the necessity for better frameworks or optimizations for MapReduce.http://dx.doi.org/10.3233/SPR-2011-0320
collection DOAJ
language English
format Article
sources DOAJ
author Satish Narayana Srirama
Oleg Batrashev
Pelle Jakovits
Eero Vainikko
spellingShingle Satish Narayana Srirama
Oleg Batrashev
Pelle Jakovits
Eero Vainikko
Scalability of Parallel Scientific Applications on the Cloud
Scientific Programming
author_facet Satish Narayana Srirama
Oleg Batrashev
Pelle Jakovits
Eero Vainikko
author_sort Satish Narayana Srirama
title Scalability of Parallel Scientific Applications on the Cloud
title_short Scalability of Parallel Scientific Applications on the Cloud
title_full Scalability of Parallel Scientific Applications on the Cloud
title_fullStr Scalability of Parallel Scientific Applications on the Cloud
title_full_unstemmed Scalability of Parallel Scientific Applications on the Cloud
title_sort scalability of parallel scientific applications on the cloud
publisher Hindawi Limited
series Scientific Programming
issn 1058-9244
1875-919X
publishDate 2011-01-01
description Cloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study the effects of moving parallel scientific applications onto the cloud, we deployed several benchmark applications like matrix–vector operations and NAS parallel benchmarks, and DOUG (Domain decomposition On Unstructured Grids) on the cloud. DOUG is an open source software package for parallel iterative solution of very large sparse systems of linear equations. The detailed analysis of DOUG on the cloud showed that parallel applications benefit a lot and scale reasonable on the cloud. We could also observe the limitations of the cloud and its comparison with cluster in terms of performance. However, for efficiently running the scientific applications on the cloud infrastructure, the applications must be reduced to frameworks that can successfully exploit the cloud resources, like the MapReduce framework. Several iterative and embarrassingly parallel algorithms are reduced to the MapReduce model and their performance is measured and analyzed. The analysis showed that Hadoop MapReduce has significant problems with iterative methods, while it suits well for embarrassingly parallel algorithms. Scientific computing often uses iterative methods to solve large problems. Thus, for scientific computing on the cloud, this paper raises the necessity for better frameworks or optimizations for MapReduce.
url http://dx.doi.org/10.3233/SPR-2011-0320
work_keys_str_mv AT satishnarayanasrirama scalabilityofparallelscientificapplicationsonthecloud
AT olegbatrashev scalabilityofparallelscientificapplicationsonthecloud
AT pellejakovits scalabilityofparallelscientificapplicationsonthecloud
AT eerovainikko scalabilityofparallelscientificapplicationsonthecloud
_version_ 1721328722173231104