Machine learning for time series forecasting: A statistical perspective

Authors

  • Charanjit Singh 1Associate Professor, Department Of Applied Science And Humanities, Global Group Of Institutes, Amritsar, Punjab, India Author
  • Naimoonisa begum 2Assistant Professor, Department Of Computer Science And Science And Engineering, Muffakham Jah College Of Engineering And Technology, Hyderabad ,Telangana, India Author
  • Paladugu Harshitha AI ML Research Associate Intern, Department Of Department Of Information Technology, Chaitanya Bharathi Institute Of Technology Osmania University (OU) Hyderabad, Hyderabad, Telangana ,India Author
  • Mr. M.Saravanan Assistant Professor, Department Of Computer Technology And Information Technology, Kongu Arts And Science College (Autonomous) Erode, Erode, Tamilnadu, India Author

DOI:

https://doi.org/10.62647/

Keywords:

Time series forecasting; Machine learning; Statistical modeling; ARIMA; Random Forest; Synthetic data; Healthcare analytics

Abstract

Time series forecasting is a fundamental problem in many real-world applications, including healthcare, finance, and energy systems, where accurate predictions are essential for effective decision-making. Classical statistical models such as autoregressive integrated moving average (ARIMA) have been widely used due to their theoretical rigor and interpretability; however, their performance is often limited by assumptions of linearity, stationarity, and predefined error distributions. Recent advances in machine learning have introduced flexible, data-driven alternatives that demonstrate superior forecasting accuracy in complex environments. This study investigates the advantages of machine learning for time series forecasting from a statistical perspective using a controlled synthetic case study. A synthetic dataset representing daily hospital admissions is generated with trend, multiple seasonalities, and nonlinear dynamics to emulate real-world behavior. Forecasting performance of a classical ARIMA model is compared with a machine learning-based Random Forest regressor. Model evaluation is conducted using standard error metrics and residual diagnostics. The results show that the machine learning model consistently outperforms the statistical model, achieving lower prediction error and improved residual behavior. From a statistical viewpoint, this improvement is attributed to the ability of machine learning models to act as nonparametric estimators of conditional expectations while relaxing restrictive modeling assumptions. The findings highlight that machine learning complements rather than replaces traditional statistical approaches and provides a robust framework for forecasting complex time series data.

Downloads

Download data is not yet available.

Downloads

Published

08-02-2026

How to Cite

Machine learning for time series forecasting: A statistical perspective. (2026). International Journal of Information Technology and Computer Engineering, 14(1), 159-169. https://doi.org/10.62647/