Forecasting daily natural gas consumption with regression, time series and machine learning based methods

Yucesan M., Pekel E., Çelik E., Gul M., Serin F.

Energy Sources, Part A: Recovery, Utilization and Environmental Effects, 2021 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Publication Date: 2021
  • Doi Number: 10.1080/15567036.2021.1875082
  • Journal Name: Energy Sources, Part A: Recovery, Utilization and Environmental Effects
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, ABI/INFORM, Aerospace Database, Applied Science & Technology Source, CAB Abstracts, Communication Abstracts, Compendex, Computer & Applied Sciences, Environment Index, Greenfile, INSPEC, Metadex, Pollution Abstracts, Veterinary Science Database, Civil Engineering Abstracts
  • Keywords: Natural gas consumption, forecasting methods, regression, time series, machine learning, Turkey
  • Istanbul University Affiliated: Yes


© 2021 Taylor & Francis Group, LLC.An effective short-term natural gas forecasting method contributes to social contributions and allows industrial chain elements to function effectively and minimize economic losses. We dealt with a comparative framework on the applicability of different methods in daily natural gas service (NGS) consumption forecasting. In this context, time series, machine learning, evolutionary and population-based approaches, and their hybrid versions are applied to the NGS data. Hybridized approaches are tested in the scope of NGS consumption forecasting for the first time in the literature in this study. The case of Turkey is handled, and its NGS data is used to demonstrate the comparative framework’s applicability. The comparative study is assessed in the lights of common forecasting accuracy measures of mean absolute percentage error (MAPE), R-squared (R2), and mean squared error (MSE). According to each method’s results, the seasonal autoregressive integrated moving average with exogenous regressors (SARIMAX) and artificial neural network (ANN) hybrid model provides the most dominant performance with respect to MAPE. The lowest error was obtained with a MAPE value of 0.357 in this hybrid model constructed under seven neurons in its ANN structure. This model is followed by another hybrid model, autoregressive integrated moving average (ARIMA)-ANN, with a MAPE value of 0.5 under nine neurons in terms of accuracy performance. The worst performance value belongs to the Genetic algorithm-ANN hybrid model with a MAPE value of approximately 26%.