Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while mai...

Full description

Bibliographic Details
Main Authors: Yi-Lin Cheng, 鄭以琳
Other Authors: Pang-Feng Liu
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/rk33m8
id ndltd-TW-105NTU05392114
record_format oai_dc
spelling ndltd-TW-105NTU053921142019-05-15T23:39:40Z http://ndltd.ncl.edu.tw/handle/rk33m8 Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint 滿足資源使用限制及多種容器尺寸之容器數量自動調整 Yi-Lin Cheng 鄭以琳 碩士 國立臺灣大學 資訊工程學研究所 105 Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while maintaining high resource utilization. We first propose a dynamic programming scaling algorithm as a baseline. This dynamic programming algorithm scales the containers optimally when given the future workload. Then, we present two greedy scaling algorithms that work without future workload information. We also propose a heuristic scaling algorithm that predicts the future resource demand using Gradient Boosting Regression. This algorithm first predicts the future workload for a short period of time, then makes its scaling decisions using the optimal dynamic programming algorithm. We conduct experiments with two realistic workload traces and compare those algorithms under different parameter settings. It is very challenging to minimize the container adjustment cost and the resource insufficiency at the same time, so we discuss the trade-off between these two goals in various situations. The experiments show that when the cost to start new servers is much more important than resource insufficiency penalty, our short-term prediction approach will increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the dynamic programming that knows the future workload. Pang-Feng Liu 劉邦鋒 2017 學位論文 ; thesis 33 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while maintaining high resource utilization. We first propose a dynamic programming scaling algorithm as a baseline. This dynamic programming algorithm scales the containers optimally when given the future workload. Then, we present two greedy scaling algorithms that work without future workload information. We also propose a heuristic scaling algorithm that predicts the future resource demand using Gradient Boosting Regression. This algorithm first predicts the future workload for a short period of time, then makes its scaling decisions using the optimal dynamic programming algorithm. We conduct experiments with two realistic workload traces and compare those algorithms under different parameter settings. It is very challenging to minimize the container adjustment cost and the resource insufficiency at the same time, so we discuss the trade-off between these two goals in various situations. The experiments show that when the cost to start new servers is much more important than resource insufficiency penalty, our short-term prediction approach will increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the dynamic programming that knows the future workload.
author2 Pang-Feng Liu
author_facet Pang-Feng Liu
Yi-Lin Cheng
鄭以琳
author Yi-Lin Cheng
鄭以琳
spellingShingle Yi-Lin Cheng
鄭以琳
Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
author_sort Yi-Lin Cheng
title Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_short Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_full Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_fullStr Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_full_unstemmed Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
title_sort container auto-scaling with multiple container sizes and resource utilization constraint
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/rk33m8
work_keys_str_mv AT yilincheng containerautoscalingwithmultiplecontainersizesandresourceutilizationconstraint
AT zhèngyǐlín containerautoscalingwithmultiplecontainersizesandresourceutilizationconstraint
AT yilincheng mǎnzúzīyuánshǐyòngxiànzhìjíduōzhǒngróngqìchǐcùnzhīróngqìshùliàngzìdòngdiàozhěng
AT zhèngyǐlín mǎnzúzīyuánshǐyòngxiànzhìjíduōzhǒngróngqìchǐcùnzhīróngqìshùliàngzìdòngdiàozhěng
_version_ 1719151775187992576