Kubernetes container cloud is currently a popular cloud computing technology, and its default elastic scaling method HPA(Horizontal Pod Autoscaler) can horizontally expand and shrink cloud native applications.However, this method is based on a single load index, which is difficult to apply to diversified cloud-native applications. In addition, the method performs elastic expansion based on the current load, so that the process of expansion and contraction has obvious hysteresis. This method is based on the sliding time window algorithm for elastic shrinkage, which is slow and easy to waste system resources.To solve these problems, an improved elastic stretching method is proposed in this paper. A dynamic weighted fusion algorithm is designed to fuse multiple load indicators into comprehensive load factors, which can fully reflect the comprehensive load of cloud native applications.CEEMDAN(Complete Ensemble Empirical Mode Decomposition with Adaptive Noise)-ARIMA(Autoregressive Integrated Moving Average Model) prediction model is proposed, and elastic expansion is realized in advance to cope with the burst traffic based on the predicted load value of the model.A method combining rapid capacity reduction and sliding time window is proposed to reduce system resource waste on the basis of ensuring application service quality.Experimental results show that compared with the HPA mechanism, the improved elastic scaling method shortens the average response time by 336.55% when dealing with the first burst traffic, reduces system resource usage by 50% after the traffic ends, and can quickly expand the capacity when encountering burst traffic again, with an average response time shortened by 66.83%.