Comparison of exponential smoothing and ARIMA time series models for forecasting COVID-19 cases: a secondary data analysis

Authors

  • Abhinav Bahuguna Department of Biostatistics, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Jolly Grant, Dehradun, Uttarakhand, India
  • Akanksha Uniyal Department of Biostatistics, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Jolly Grant, Dehradun, Uttarakhand, India https://orcid.org/0000-0003-1408-1772
  • Neha Sharma Department of Community Medicine, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Jolly Grant, Dehradun, Uttarakhand, India
  • Jayanti Semwal Department of Community Medicine, Himalayan Institute of Medical Sciences, Swami Rama Himalayan University, Jolly Grant, Dehradun, Uttarakhand, India

DOI:

https://doi.org/10.18203/2320-6012.ijrms20231344

Keywords:

ARIMA, COVID-19, Exponential smoothing, Forecast, Predictive models, Time series analysis

Abstract

Background: In order to manage outbreaks and plan resources, health systems must be capable of accurately projecting COVID-19 case patterns. Health systems can effectively predict future illness patterns by using mathematical and statistical modelling of infectious diseases. Different methods have been used with comparatively good accuracy for various prediction goals in medical sciences. Some illustrations are provided by statistical techniques intended to forecast epidemic cases. In order to increase healthcare systems readiness, this study aimed to identify the most accurate models for COVID-19 with a high global prevalence of positive cases.

Methods: Exponential smoothing model and ARIMA were employed on time series datasets to forecast confirmed cases in upcoming months and hence the effectiveness of these predictive models were compared on the basis of performance measures.

Results: It was seen that the ARIMA (0,0,2) model is best fitted with smaller values of performance measures (RMSE=4.46 and MAE=2.86) while employed on the recent dataset for short duration. Holt-Winters Exponential smoothing model was found to be more accurate to deal with a longer period of time series based data.

Conclusions: The study revealed that working with recent dataset is more accurate to forecast the number of confirmed cases as compared to the data collected for longer period. The early-stage warnings through these predictive models would be beneficial for governments and health professionals to be prepared with the strategies at different levels for public health prevention.

Metrics

Metrics Loading ...

References

Rothan HA, Byrareddy SN. The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak. J Autoimmun. 2020;109:102433.

Khan FM, Gupta R. ARIMA and NAR based prediction model for time series analysis of COVID-19 cases in India. J Safety Science Resilience. 2020;1(1):12-8.

Kurbalija V, Radovanovi´c M, Ivanovi´c M, Schmidt D, Trzebiatowski GL, Burkhard HD, Hinrichs C. Time-series analysis in the medical domain: a study of tacrolimus administration and influence on kidney graft function. Comput Biol Med. 2014;50:19-31.

Nsoesie E, Beckman R, Shashaani S, Nagaraj K, Marathe M. A simulation optimization approach to epidemic forecasting. PLoS ONE. 2013;8:e67164.

Orbann C, Sattenspiel L, Miller E, Dimka J. Defining epidemics in computer simulation models: How do definitions influence conclusions? Epidemics. 2017;19:24-32.

Thomson MC, Molesworth AM, Djingarey MH, Yameogo KR, Belanger F, Cuevas LE. Potential of environmental models to predict meningitis epidemics in Africa. Trop Med Int Health. 2006;11:781-8.

Liu Q, Li Z, Ji Y, Martinez L, Zia UH, Javaid A, Lu W, Wang J. Forecasting the seasonality and trend of pulmonary tuberculosis in Jiangsu province of China using advanced statistical time-series analyses. Infect Drug Resist. 2019;12:2311-22.

Zhang X, Liu Y, Yang M, Zhang T, Young A, Li X. Comparative study of four time series methods in forecasting typhoid fever incidence in China. PLoS ONE. 2013;8:e63116.

Wang Y, Shen Z, Jiang Y. Comparison of ARIMA and GM (1,1) models for prediction of hepatitis B in China. PLoS ONE. 2018;13:e0201987.

Zhang L, Wang L, Zheng Y, Wang K, Zhang X, Zheng Y. Time prediction models for echinococcosis based on gray system theory and epidemic dynamics. Int J Environ Res. Public Health. 2017;14:262.

Cao L, Liu H, Li J, Yin X, Duan Y, Wang J. Relationship of meteorological factors and human brucellosis in Hebei province, China. Sci Environ. 2020;703:135491.

Imai C, Hashizume M. A systematic review of methodology: Time series regression analysis for environmental factors and infectious diseases. Trop Med Health. 2015;43:1-9.

Benvenuto D, Giovanetti M, Vassallo L, Angeletti S, Ciccozzi M. Application of the ARIMA model on the COVID-2019 epidemic dataset. Data Brief. 2020;29:1-4.

Dehesh T, Mardani HA, Dehesh P. Forecasting of COVID-19 confirmed cases in different countries with ARIMA models. Preprints. 2020;10:1101-45.

Chintalapudi N, Battineni G, Amenta F. COVID-19 virus outbreak forecasting of registered and recovered cases after sixty day lockdown in Italy: A data driven model approach. J Microbiol Immunol Infect. 2020;53:396-403.

Singh S, Murali Sundram B, Rajendran K, Boon Law K, Aris T, Ibrahim H, Chandra Dass S, Singh Gill B. Forecasting daily confirmed COVID-19 cases in Malaysia using ARIMA models. J Infect Dev Ctries. 2020;14(9):971-6.

Singh S, Sundram BM, Rajendran K, Law KB, Aris T, Ibrahim H, Dass SC, Gill BS. Forecasting daily confirmed COVID-19 cases in Malaysia using ARIMA models. J Infection Developing Countries. 2020;14(09):971-6.

Bezerra AK, Santos ÉM. Prediction the daily number of confirmed cases of COVID-19 in Sudan with ARIMA and Holt Winter exponential smoothing. Int J Development Res. 2020;10(08):39408-13.

Daily COVID-19 Health Bulletin: Ministry of Health and Family Welfare (MoHFW), Government of Uttarakhand. Available at https:// health. uk.gov.in. Accessed on 12 January 2023

Singh S, Sundram BM, Rajendran K, Law KB, Aris T, Ibrahim H, Dass SC, Gill BS. Forecasting daily confirmed COVID-19 cases in Malaysia using ARIMA models. J Infection Developing Countries. 2020;14(09):971-6.

Semwal J, Bahuguna A, Sharma N, Dikshit RK, Bijalwan R, Augustine P. Time series analysis of COVID-19 data-a study from Northern India. Indian J Community Health. 2022;34(2):87-9.

Papastefanopoulos V, Linardatos P, Kotsiantis S. COVID-19: a comparison of time series methods to forecast percentage of active cases per population. Applied Sci. 2020;10(11):3880.

Djakaria I, Saleh SE. COVID-19 forecast using Holt-Winters exponential smoothing. J Physics. 2021;1882(1):012033.

Jain A, Sukhdeve T, Gadia H, Sahu SP, Verma S. COVID19 prediction using time series analysis. Int Artificial Intelligence Smart Systems. 2021;25:1599-606.

Semwal J, Bahuguna A, Uniyal A, Vyas S. A study to analyse COVID-19 outbreak using multiple linear regression: a supervised machine learning approach. National J Community Med. 2023;14(02):82-9.

Gupta A. COVID-19: time series analysis. Int J Scientific Development Res. 2023:2455-631.

Zrieq R, Kamel S, Boubaker S, Algahtani FD, Alzain MA, Alshammari F, et al. Time-Series analysis and healthcare implications of COVID-19 pandemic in Saudi Arabia. Healthcare. 2022;10(10):1874.

Sandoval BC, Ferreira G, Parra BK, Flores LP. Prediction of confirmed cases of and deaths caused by COVID-19 in Chile through time series techniques: a comparative study. PLoS One. 2021;16(4):e0245414.

Downloads

Published

2023-04-29

How to Cite

Abhinav Bahuguna, Uniyal, A., Sharma, N., & Semwal, J. (2023). Comparison of exponential smoothing and ARIMA time series models for forecasting COVID-19 cases: a secondary data analysis . International Journal of Research in Medical Sciences, 11(5), 1727–1734. https://doi.org/10.18203/2320-6012.ijrms20231344

Issue

Section

Original Research Articles