Optimization And Implementation Of Application Auto-scaling Technology Based On Kubernetes

Posted on:2023-06-12

Degree:Master

Type:Thesis

Country:China

Candidate:W C Guo

Full Text:PDF

GTID:2568306914479064

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the development of cloud computing technology,container technology represented by Docker has been quickly recognized and has become the preferred solution with the advantages of lightweight,small resource consumption,and fast startup.Kubernetes stands out with powerful container orchestration capabilities,and has become a de facto standard in the field of container orchestration.However,the current autoscaling solution of Kubernetes cannot ensure the service quality in complex scenarios.This paper analyzes the auto-scaling principle of Kubernetes,which is a responsive scaling for single services.In this way,the response lag of scaling and bottleneck transfer are often caused.To solve the above problems,this paper proposes the following optimization strategies for auto-scaling:(1)A load prediction model based on LSTM network and attention mechanism is proposed.This model considers the influencing factors of various load indicators on the predicted load,mines the time series features of the load data and the correlation between different loads through the convolutional neural network,and then uses the channel attention to weight the extracted features.Finally,this model uses bi-directional LSTM with temporal attention to make the prediction.Experiments on real load data show that the proposed model surpasses traditional algorithms such as LSTM in prediction accuracy,laying the foundation for the optimization of Kubernetes auto-scaling.(2)A scaling method based on deep reinforcement learning is proposed.This method combines the load prediction model of the first part to model the reinforcement learning environment of multi-service application scaling.This model makes scaling decisions with improved DQN,and learns the optimal auto-scaling strategy by interacting with the environment continuously.Finally,model can adjust instances of multiple services simultaneously in one scaling decision cycle.Experiments are conducted on the Kubernetes cluster to analyze the scaling effect and performance.Experiment results show that the proposed method can respond in advance according to changes of traffic,and effectively scale applications as a whole,so as to improve resources utilization with service quality assurance.

Keywords/Search Tags:

Kubernetes, load prediction, auto-scaling

PDF Full Text Request

Related items

1	Research On Container Auto-scaling Based On Kubernetes
2	Research On Flexible Scaling Strategy Of Container Resources Based On Kubernetes
3	Research And Implementation Of Workload Prediction And Auto-scaling In Container Cloud Platform
4	Design And Implementation Of Cloud-Native Application Platform Based On Kubernetes
5	Research On 5G VNF Auto Scaling Based On Load Prediction
6	Research On Kubernetes-oriented Automatic Container Elastic Scaling Technology
7	Design And Implementation Of Container Auto Scaling Algorithm Platform Based On Kubernetes
8	Research And Application Of Active Scaling And Load Balancing Algorithm For Micro-Services Based On Kubernetes
9	Design And Implementation Of Elastic Scaling And Dynamic Scheduling Strategy Based On Kubernetes
10	Research On Dynamic Scaling Technology Of Container Cluster Based On Combined Prediction Model