Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while mai...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/rk33m8 |
id |
ndltd-TW-105NTU05392114 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NTU053921142019-05-15T23:39:40Z http://ndltd.ncl.edu.tw/handle/rk33m8 Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint 滿足資源使用限制及多種容器尺寸之容器數量自動調整 Yi-Lin Cheng 鄭以琳 碩士 國立臺灣大學 資訊工程學研究所 105 Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while maintaining high resource utilization. We first propose a dynamic programming scaling algorithm as a baseline. This dynamic programming algorithm scales the containers optimally when given the future workload. Then, we present two greedy scaling algorithms that work without future workload information. We also propose a heuristic scaling algorithm that predicts the future resource demand using Gradient Boosting Regression. This algorithm first predicts the future workload for a short period of time, then makes its scaling decisions using the optimal dynamic programming algorithm. We conduct experiments with two realistic workload traces and compare those algorithms under different parameter settings. It is very challenging to minimize the container adjustment cost and the resource insufficiency at the same time, so we discuss the trade-off between these two goals in various situations. The experiments show that when the cost to start new servers is much more important than resource insufficiency penalty, our short-term prediction approach will increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the dynamic programming that knows the future workload. Pang-Feng Liu 劉邦鋒 2017 學位論文 ; thesis 33 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 105 === Cloud web applications can scale resources according to dynamic workload using auto-scaling technique. In this thesis, we focus on scaling containers with multiple sizes. Our goal is to minimize the container adjustment cost and resource insufficiency while maintaining high resource utilization. We first propose a dynamic programming scaling algorithm as a baseline. This dynamic programming algorithm scales the containers optimally when given the future workload. Then, we present two greedy scaling algorithms that work without future workload information. We also propose a heuristic scaling algorithm that predicts the future resource demand using Gradient Boosting Regression. This algorithm first predicts the future workload for a short period of time, then makes its scaling decisions using the optimal dynamic programming algorithm. We conduct experiments with two realistic workload traces and compare those algorithms under different parameter settings. It is very challenging to minimize the container adjustment cost and the resource insufficiency at the same time, so we discuss the trade-off between these two goals in various situations. The experiments show that when the cost to start new servers is much more important than resource insufficiency penalty, our short-term prediction approach will increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the dynamic programming that knows the future workload.
|
author2 |
Pang-Feng Liu |
author_facet |
Pang-Feng Liu Yi-Lin Cheng 鄭以琳 |
author |
Yi-Lin Cheng 鄭以琳 |
spellingShingle |
Yi-Lin Cheng 鄭以琳 Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint |
author_sort |
Yi-Lin Cheng |
title |
Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint |
title_short |
Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint |
title_full |
Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint |
title_fullStr |
Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint |
title_full_unstemmed |
Container Auto-Scaling with Multiple Container Sizes and Resource Utilization Constraint |
title_sort |
container auto-scaling with multiple container sizes and resource utilization constraint |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/rk33m8 |
work_keys_str_mv |
AT yilincheng containerautoscalingwithmultiplecontainersizesandresourceutilizationconstraint AT zhèngyǐlín containerautoscalingwithmultiplecontainersizesandresourceutilizationconstraint AT yilincheng mǎnzúzīyuánshǐyòngxiànzhìjíduōzhǒngróngqìchǐcùnzhīróngqìshùliàngzìdòngdiàozhěng AT zhèngyǐlín mǎnzúzīyuánshǐyòngxiànzhìjíduōzhǒngróngqìchǐcùnzhīróngqìshùliàngzìdòngdiàozhěng |
_version_ |
1719151775187992576 |