Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while mai...

Full description

Bibliographic Details
Main Authors:	Yi-Lin Cheng, 鄭以琳
Other Authors:	Pang-Feng Liu
Format:	Others
Language:	en_US
Published:	2017
Online Access:	http://ndltd.ncl.edu.tw/handle/rk33m8

id	ndltd-TW-105NTU05392114
record_format	oai_dc
spelling	ndltd-TW-105NTU053921142019-05-15T23:39:40Z http://ndltd.ncl.edu.tw/handle/rk33m8 Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint 滿足資源使用限制及多種容器尺寸之容器數量自動調整 Yi-Lin Cheng 鄭以琳碩士國立臺灣大學資訊工程學研究所 105 Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while maintaining high resource utilization. We first propose a dynamic programming scaling algorithm as a baseline. This dynamic programming algorithm scales the containers optimally when given the future workload. Then, we present two greedy scaling algorithms that work without future workload information. We also propose a heuristic scaling algorithm that predicts the future resource demand using Gradient Boosting Regression. This algorithm first predicts the future workload for a short period of time, then makes its scaling decisions using the optimal dynamic programming algorithm. We conduct experiments with two realistic workload traces and compare those algorithms under different parameter settings. It is very challenging to minimize the container adjustment cost and the resource insufficiency at the same time, so we discuss the trade-off between these two goals in various situations. The experiments show that when the cost to start new servers is much more important than resource insufficiency penalty, our short-term prediction approach will increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the dynamic programming that knows the future workload. Pang-Feng Liu 劉邦鋒 2017 學位論文 ; thesis 33 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while maintaining high resource utilization. We first propose a dynamic programming scaling algorithm as a baseline. This dynamic programming algorithm scales the containers optimally when given the future workload. Then, we present two greedy scaling algorithms that work without future workload information. We also propose a heuristic scaling algorithm that predicts the future resource demand using Gradient Boosting Regression. This algorithm first predicts the future workload for a short period of time, then makes its scaling decisions using the optimal dynamic programming algorithm. We conduct experiments with two realistic workload traces and compare those algorithms under different parameter settings. It is very challenging to minimize the container adjustment cost and the resource insufficiency at the same time, so we discuss the trade-off between these two goals in various situations. The experiments show that when the cost to start new servers is much more important than resource insufficiency penalty, our short-term prediction approach will increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the dynamic programming that knows the future workload.
author2	Pang-Feng Liu
author_facet	Pang-Feng Liu Yi-Lin Cheng 鄭以琳
author	Yi-Lin Cheng 鄭以琳
spellingShingle	Yi-Lin Cheng 鄭以琳 Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
author_sort	Yi-Lin Cheng
title	Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_short	Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_full	Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_fullStr	Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_full_unstemmed	Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_sort	container auto-scaling with multiple container sizes and resource utilization constraint
publishDate	2017
url	http://ndltd.ncl.edu.tw/handle/rk33m8
work_keys_str_mv	AT yilincheng containerautoscalingwithmultiplecontainersizesandresourceutilizationconstraint AT zhèngyǐlín containerautoscalingwithmultiplecontainersizesandresourceutilizationconstraint AT yilincheng mǎnzúzīyuánshǐyòngxiànzhìjíduōzhǒngróngqìchǐcùnzhīróngqìshùliàngzìdòngdiàozhěng AT zhèngyǐlín mǎnzúzīyuánshǐyòngxiànzhìjíduōzhǒngróngqìchǐcùnzhīróngqìshùliàngzìdòngdiàozhěng
_version_	1719151775187992576

Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint

Similar Items