Towards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds

TitleTowards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds
Publication TypeConference Paper
Year of Publication2011
AuthorsFengguang Tian, Keke Chen
Conference NameIEEE Conference on Cloud Computing
KeywordsCloud Computing, MapReduce, Performance Modeling, Resource Provisioning
Abstract

Running MapReduce programs in the public cloud introduces the important problem: how to optimize resource provisioning to minimize the financial charge for a specific job? In this paper, we study the whole process of MapReduce processing and build up a cost function that explicitly models the relationship between the amount of input data, the available system resources (Map and Reduce slots), and the complexity of the Reduce function for the target MapReduce job. The model parameters can be learned from test runs with a small number of nodes. Based on this cost model, we can solve a number of decision problems, such as the optimal amount of resources that can minimize the financial cost with a time deadline or minimize the time under certain financial budget. Experimental results show that this cost model performs well on tested MapReduce programs.

Full Text

Fengguang Tian, Keke Chen, 'Towards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds', IEEE Conference on Cloud Computing, Washington DC, 2011.

Related Files: