Experimental Evaluation of Memory Configurations of Hadoop in Docker Environments
Abstract
Hadoop is widely used in these days for big data analytics. Docker, a new container technology, is hot nowadays, and is the new QuickStart option for Apache Hadoop. It is a trend to build Hadoop cluster in Docker environments... [ view full abstract ]
Hadoop is widely used in these days for big data analytics. Docker, a new container technology, is hot nowadays, and is the new QuickStart option for Apache Hadoop. It is a trend to build Hadoop cluster in Docker environments in clouds or clusters. However, how to make better use of hardware resources and improve Hadoop performance in Docker environments is a challenge for users. In this paper we study memory configurations of Hadoop in Docker environments, and analyse the performance of Hadoop while altering Hadoop’s memory configurations. We select two different applications (a CPU-intensive application – WordCount and a memory-intensive application – TeraSort) and measure their resource usages in terms of CPU and memory footprint. Extensive tests have been executed, the test results show that appropriate customization of memory configurations improves performance compared to the default memory parameter settings.
Authors
-
Xueyuan Wang
(Software Research Institute Athlone Institute of Technology)
-
Brian Lee
(Software Research Institute Athlone Institute of Technology)
-
Yuansong Qiao
(Software Research Institute Athlone Institute of Technology)
Topic Areas
Cloud Infrastructures , Cloud Computing
Session
CL1 » Cloud Infrastructures & Cloud Computing 1 (14:40 - Tuesday, 21st June, MS105)
Paper
Experimental_Evaluation_of_Memory_Configurations_of_Hadoop_in_Docker_Environments_issc-2016_Final.pdf