Abstract | Cloud Computing has gained a lot of popularity in recent years because of the flexibility that it offers. In addition, there seems to be a rising interest in combining Parallel Computing, Cloud Computing and Big Data to create large scale scientific applications. WS-PGRADE is a gateway framework that allows users to create such applications by defining them as scientific workflows. This paper investigates how workflow systems and science gateways, such as WS-PGRADE, can be extended with data processing capabilities of Hadoop based on the MapReduce paradigm in the cloud. Analysis shows the methods described to integrate Hadoop with workflows and science gateways work well in different scenarios and can be used to create massively parallel applications for scientific analysis of Big Data. |
---|