Application of adaptive ensembles of machine learning methods to the problem of time series forecasting

T.A. Voloshin, K.S. Zaytsev, M.E. Dunaev

Abstract


The purpose of this work is to study the process of applying adaptive ensembles of machine learning methods to the problem of time series forecasting, mainly for streaming data processing. For this, experiments with different types of regression algorithms were carried out, both classical methods for ensembling decision trees (random forest, gradient boosting) and neural networks, in particular convolutional and recurrent ones, were studied. For research, time series of application performance metrics, generated with the Apache Kafka service, were used. To apply machine learning algorithms to the trained data, the series prediction problem was converted to a general supervised learning problem using the sliding window technique. During the experiments, predictions were made for a long interval, for this a recursive forecasting strategy was applied. To improve the results of analysis in streaming mode, an ensemble model was proposed, which, as new data became available, based on errors, recalculate the weights that determine the contribution made to the final prediction by each individual algorithm. The possibility of initializing these weights by pre-training the model on a part of the training set was explored. During the experiments, ensemble methods showed good results in terms of accuracy, and the proposed adaptive weighted averaging technique really turned out to be able to improve the forecasting efficiency. The results of using neural networks were less impressive, which may be due to the use of a relatively small sample of data for training.

Full Text:

PDF (Russian)

References


Chyon, F. A., Suman, M. N. H., Fahim, M. R. I., & Ahmmed, M. S. (2022). Time series analysis and predicting COVID-19 affected patients by ARIMA model using machine learning. Journal of virological methods, 301, 114433.

McClymont, H., Si, X., & Hu, W. (2023). Using weather factors and google data to predict COVID-19 transmission in Melbourne, Australia: A time-series predictive model. Heliyon, 9(3), e13782.

Seong, Byeongchan. (2020). Smoothing and forecasting mixed-frequency time series with vector exponential smoothing models. Economic Modelling. 91.

Necati Aksoy, Istemihan Genc. (2023). Predictive models development using gradient boosting based methods for solar power plants. Journal of Computational Science, 67, 101958

Ibrahim, Ahmed & Kashef, Rasha & Corrigan, Liam. (2021). Predicting market movement direction for bitcoin: A comparison of time series modeling methods. Computers & Electrical Engineering. 89. 106905.

Dercole, Fabio & Sangiorgio, Matteo & Schmirander, Yunus. (2020). An empirical assessment of the universality of ANNs to predict oscillatory time series. IFAC-PapersOnLine. 53. 1255-1260.

Cantero-Chinchilla, Sergio & Simpson, Chris & Ballisat, Alexander & Croxford, Anthony & Wilcox, Paul. (2022). Convolutional neural networks for ultrasound corrosion profile time series regression. NDT & E International. 133. 102756.

Alassafi, Madini & Jarrah, Mu'tasem & Al-Otaibi, Reem. (2021). Time Series Predicting of COVID-19 based on Deep Learning. Neurocomputing. 468.

Tessoni, V., Amoretti, M. (2022). Advanced statistical and machine learning methods for multi-step multivariate time series forecasting in predictive maintenance. Procedia Computer Science. 200. 748–757.

Zhang, Lingyu & Bian, Wenjie & Qu, Wenyi & Tuo, Liheng & Wang, Yunhai. (2021). Time series forecast of sales volume based on XGBoost. Journal of Physics: Conference Series. 1873.

Wibawa, A.P., Utama, A.B.P., Elmunsyah, H. et al. Time-series analysis with smoothed Convolutional Neural Network. J Big Data 9, 44 (2022).

Sako, K., Mpinda, B. N., & Rodrigues, P. C. (2022). Neural Networks for Financial Time Series Forecasting. Entropy (Basel, Switzerland), 24(5), 657.

Rajagukguk RA, Ramadhan RAA, Lee H-J. A Review on Deep Learning Models for Forecasting Time Series Data of Solar Irradiance and Photovoltaic Power. Energies. 2020; 13(24):6623.

Bontempi, Gianluca & Ben Taieb, Souhaib & Le Borgne, Yann-Aël. (2013). Machine Learning Strategies for Time Series Forecasting.

Shikun, Chen & Luc, Nguyen. (2022). RRMSE Voting Regressor: A weighting function based improvement to ensemble regression.

Yajiao Tang & Zhenyu Song & Yulin Zhu & Huaiyu Yuan & Maozhang Hou & Junkai Ji & Cheng Tang & Jianqiang Li. (2022). A survey on machine learning models for financial time series forecasting. Neurocomputing, 512, 363-380.

Aggarwal, Akarsh & Alshehri, Mohammed & Kumar, Manoj & Alfarraj, Osama & Sharma, Purushottam & Pardasani, Kamal. (2020). Landslide data analysis using various time-series forecasting models. Computers & Electrical Engineering. 88. 106858.

Runge, Jason & Saloux, Etienne. (2023). A comparison of prediction and forecasting artificial intelligence models to estimate the future energy demand in a district heating system. Energy. 269. 126661.

Hewamalage, Hansika & Bergmeir, Christoph & Bandara, Kasun. (2020). Global Models for Time Series Forecasting: A Simulation Study. Pattern Recognition. 124. 108441.

Hoi, Steven & Sahoo, Doyen & Lu, Jing & Zhao, Peilin. (2018). Online Learning: A Comprehensive Survey. Neurocomputing.

Alina Beygelzimer, Elad Hazan, Satyen Kale, and Haipeng Luo. (2015). Online gradient boosting.


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность IT Congress 2024

ISSN: 2307-8162