COVID-19 Forecasting Using Recurrent Neural Network and Machine Learning
DOI:
https://doi.org/10.47611/jsrhs.v13i2.6799Keywords:
Machine Learning, Recurrent Neural Networks, LSTM, COVID-19, SARIMAX, Deep LearningAbstract
The COVID-19 variant’s complexity and the dire consequences of the variant spread are the inspirations behind the profound research on modeling and predicting new emerging variant surges. Multiple factors, including the variant characteristics, vaccination rate, the immune response of vaccinated individuals, and disease prevention health policies, impact the COVID-19 infection trend. The advancements in machine learning and neural network models, combined with the growth in computing, have demonstrated outstanding potential in modeling and predicting epidemic diseases. This research presents the modeling and prediction of the combined COVID-19 variant infection trends using the Holt-Winters exponential smoothing and seasonal auto-regressive integrated moving average with exogenous factors (SARIMAX) time-series machine learning models and recurrent neural network (RNN) long short-term memory (LSTM) model. Real-world United States COVID variant data from the Centers for Disease Control and Prevention (CDC) is used for prediction. The SARIMAX model factoring the seasonality of COVID-19 infections showed higher prediction accuracy than the Holt-Winters model, which is heavily weighted towards the most recent trends. The LSTM model had the best prediction accuracy of 91% with the lowest root mean square error (RMSE) values due to its property of selectively remembering patterns for long duration and the forget gates that correct the vanishing gradient problem, minimizing the error losses. This research demonstrates the promising application of the neural network deep learning models for epidemic disease modeling and prediction, enabling timely assessment of different policy decisions to mitigate the impact of an epidemic.
Downloads
References or Bibliography
Alali, Y., Harrou, F., & Sun, Y. (2022). A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models. Scientific Reports, 12(1), 2467. https://doi.org/10.1038/s41598-022-06218-3
Centers for Disease Control and Prevention. (n.d.). Weekly United States covid-19 cases and deaths by state - archived. Retrieved November 2023, from https://data.cdc.gov/Case-Surveillance/Weekly-United-States-COVID-19-Cases-and-Deaths-by-/pwn4-m3yp/about_data
Gold, J. A., Rossen, L. M., Ahmad, F. B., Sutton, P., Li, Z., Salvatore, P. P., ... & Jackson, B. R. (2020). Race, ethnicity, and age trends in persons who died from COVID-19—United States, May–August 2020. Morbidity and Mortality Weekly Report, 69(42), 1517. https://doi.org/10.15585%2Fmmwr.mm6942e1
Kannan, S., Shaik Syed Ali, P., & Sheeza, A. (2021). Omicron (B. 1.1. 529)-variant of concern-molecular profile and epidemiology: a mini review. Eur. Rev. Med. Pharmacol. Sci, 25(24), 8019-8022. https://doi.org/10.26355/eurrev_202112_27653
Narin, A., Kaya, C., & Pamuk, Z. (2021). Automatic detection of coronavirus disease (covid-19) using x-ray images and deep convolutional neural networks. Pattern Analysis and Applications, 24, 1207-1220. https://doi.org/10.1007/s10044-021-00984-y
Omane-Adjepong, M., Oduro, F. T., & Oduro, S. D. (2013). Determining the better approach for short-term forecasting of ghana’s inflation: Seasonal ARIMA Vs holt-winters. International Journal of Business, Humanities and Technology, 3(1), 69-79.
Polo, D., Quintela-Baluja, M., Corbishley, A., Jones, D. L., Singer, A. C., Graham, D. W., & Romalde, J. L. (2020). Making waves: Wastewater-based epidemiology for COVID-19–approaches and challenges for surveillance and prediction. Water research, 186, 116404. https://doi.org/10.1016/j.watres.2020.116404
Saigal, S., & Mehrotra, D. (2012). Performance comparison of time series data using predictive data mining techniques. Advances in Information Mining, 4(1), 57-66.
Shrestha, L. B., Foster, C., Rawlinson, W., Tedla, N., & Bull, R. A. (2022). Evolution of the SARS‐CoV‐2 omicron variants BA. 1 to BA. 5: implications for immune escape and transmission. Reviews in Medical Virology, 32(5), e2381. https://doi.org/10.1002/rmv.2381
Siami-Namini, S., Tavakoli, N., & Namin, A. S. (2019, December). The performance of LSTM and BiLSTM in forecasting time series. In 2019 IEEE International conference on big data (Big Data) (pp. 3285-3292). IEEE. https://doi.org/10.1109/BigData47090.2019.9005997
Siami-Namini, S., Tavakoli, N., & Namin, A. S. (2018, December). A comparison of ARIMA and LSTM in forecasting time series. In 2018 17th IEEE international conference on machine learning and applications (ICMLA) (pp. 1394-1401). IEEE. https://doi.org/10.1109/ICMLA.2018.00227
Sunny, M. A. I., Maswood, M. M. S., & Alharbi, A. G. (2020, October). Deep learning-based stock price prediction using LSTM and bi-directional LSTM model. In 2020, 2nd novel intelligent and leading emerging sciences conference (NILES) (pp. 87-92). IEEE. https://doi.org/10.1109/NILES50944.2020.9257950
World Health Organization (WHO) Coronavirus (COVID-19) dashboard. Retrieved November 2023, from https://covid19.who.int/
Wortham, J. M., Lee, J. T., Althomsons, S., Latash, J., Davidson, A., Guerra, K., ... & Reagan-Steiner, S. (2020). Characteristics of Persons Who Died with COVID-19-United States, February 12-May 18, 2020. MMWR. Morbidity and mortality weekly report, 69(28), 923-929. http://dx.doi.org/10.15585/mmwr.mm6928e1
Zou, D., Wang, L., Xu, P., Chen, J., Zhang, W., & Gu, Q. (2020). Epidemic model guided machine learning for COVID-19 forecasts in the United States. MedRxiv, 2020-05. https://doi.org/10.1101/2020.05.24.20111989
Published
How to Cite
Issue
Section
Copyright (c) 2024 Aahan Shah; Kieu Pham
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.