COVID-19 Forecasting Using Recurrent Neural Network and Machine Learning

Authors

  • Aahan Shah Milpitas High School
  • Kieu Pham Milpitas High School

DOI:

https://doi.org/10.47611/jsrhs.v13i2.6799

Keywords:

Machine Learning, Recurrent Neural Networks, LSTM, COVID-19, SARIMAX, Deep Learning

Abstract

The COVID-19 variant’s complexity and the dire consequences of the variant spread are the inspirations behind the profound research on modeling and predicting new emerging variant surges. Multiple factors, including the variant characteristics, vaccination rate, the immune response of vaccinated individuals, and disease prevention health policies, impact the COVID-19 infection trend. The advancements in machine learning and neural network models, combined with the growth in computing, have demonstrated outstanding potential in modeling and predicting epidemic diseases. This research presents the modeling and prediction of the combined COVID-19 variant infection trends using the Holt-Winters exponential smoothing and seasonal auto-regressive integrated moving average with exogenous factors (SARIMAX) time-series machine learning models and recurrent neural network (RNN) long short-term memory (LSTM) model. Real-world United States COVID variant data from the Centers for Disease Control and Prevention (CDC) is used for prediction. The SARIMAX model factoring the seasonality of COVID-19 infections showed higher prediction accuracy than the Holt-Winters model, which is heavily weighted towards the most recent trends. The LSTM model had the best prediction accuracy of 91% with the lowest root mean square error (RMSE) values due to its property of selectively remembering patterns for long duration and the forget gates that correct the vanishing gradient problem, minimizing the error losses. This research demonstrates the promising application of the neural network deep learning models for epidemic disease modeling and prediction, enabling timely assessment of different policy decisions to mitigate the impact of an epidemic.

Downloads

Download data is not yet available.

References or Bibliography

Alali, Y., Harrou, F., & Sun, Y. (2022). A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models. Scientific Reports, 12(1), 2467. https://doi.org/10.1038/s41598-022-06218-3

Centers for Disease Control and Prevention. (n.d.). Weekly United States covid-19 cases and deaths by state - archived. Retrieved November 2023, from https://data.cdc.gov/Case-Surveillance/Weekly-United-States-COVID-19-Cases-and-Deaths-by-/pwn4-m3yp/about_data

Gold, J. A., Rossen, L. M., Ahmad, F. B., Sutton, P., Li, Z., Salvatore, P. P., ... & Jackson, B. R. (2020). Race, ethnicity, and age trends in persons who died from COVID-19—United States, May–August 2020. Morbidity and Mortality Weekly Report, 69(42), 1517. https://doi.org/10.15585%2Fmmwr.mm6942e1

Kannan, S., Shaik Syed Ali, P., & Sheeza, A. (2021). Omicron (B. 1.1. 529)-variant of concern-molecular profile and epidemiology: a mini review. Eur. Rev. Med. Pharmacol. Sci, 25(24), 8019-8022. https://doi.org/10.26355/eurrev_202112_27653

Narin, A., Kaya, C., & Pamuk, Z. (2021). Automatic detection of coronavirus disease (covid-19) using x-ray images and deep convolutional neural networks. Pattern Analysis and Applications, 24, 1207-1220. https://doi.org/10.1007/s10044-021-00984-y

Omane-Adjepong, M., Oduro, F. T., & Oduro, S. D. (2013). Determining the better approach for short-term forecasting of ghana’s inflation: Seasonal ARIMA Vs holt-winters. International Journal of Business, Humanities and Technology, 3(1), 69-79.

Polo, D., Quintela-Baluja, M., Corbishley, A., Jones, D. L., Singer, A. C., Graham, D. W., & Romalde, J. L. (2020). Making waves: Wastewater-based epidemiology for COVID-19–approaches and challenges for surveillance and prediction. Water research, 186, 116404. https://doi.org/10.1016/j.watres.2020.116404

Saigal, S., & Mehrotra, D. (2012). Performance comparison of time series data using predictive data mining techniques. Advances in Information Mining, 4(1), 57-66.

Shrestha, L. B., Foster, C., Rawlinson, W., Tedla, N., & Bull, R. A. (2022). Evolution of the SARS‐CoV‐2 omicron variants BA. 1 to BA. 5: implications for immune escape and transmission. Reviews in Medical Virology, 32(5), e2381. https://doi.org/10.1002/rmv.2381

Siami-Namini, S., Tavakoli, N., & Namin, A. S. (2019, December). The performance of LSTM and BiLSTM in forecasting time series. In 2019 IEEE International conference on big data (Big Data) (pp. 3285-3292). IEEE. https://doi.org/10.1109/BigData47090.2019.9005997

Siami-Namini, S., Tavakoli, N., & Namin, A. S. (2018, December). A comparison of ARIMA and LSTM in forecasting time series. In 2018 17th IEEE international conference on machine learning and applications (ICMLA) (pp. 1394-1401). IEEE. https://doi.org/10.1109/ICMLA.2018.00227

Sunny, M. A. I., Maswood, M. M. S., & Alharbi, A. G. (2020, October). Deep learning-based stock price prediction using LSTM and bi-directional LSTM model. In 2020, 2nd novel intelligent and leading emerging sciences conference (NILES) (pp. 87-92). IEEE. https://doi.org/10.1109/NILES50944.2020.9257950

World Health Organization (WHO) Coronavirus (COVID-19) dashboard. Retrieved November 2023, from https://covid19.who.int/

Wortham, J. M., Lee, J. T., Althomsons, S., Latash, J., Davidson, A., Guerra, K., ... & Reagan-Steiner, S. (2020). Characteristics of Persons Who Died with COVID-19-United States, February 12-May 18, 2020. MMWR. Morbidity and mortality weekly report, 69(28), 923-929. http://dx.doi.org/10.15585/mmwr.mm6928e1

Zou, D., Wang, L., Xu, P., Chen, J., Zhang, W., & Gu, Q. (2020). Epidemic model guided machine learning for COVID-19 forecasts in the United States. MedRxiv, 2020-05. https://doi.org/10.1101/2020.05.24.20111989

Published

05-31-2024

How to Cite

Shah, A., & Pham, K. (2024). COVID-19 Forecasting Using Recurrent Neural Network and Machine Learning. Journal of Student Research, 13(2). https://doi.org/10.47611/jsrhs.v13i2.6799

Issue

Section

HS Research Articles