by Erickson Johny Galindo da Silva, Artur Paiva Coutinho, Jean Firmino Cardoso and Saulo de Tarso Marques Bezerra.
Agreste Campus, Federal University of Pernambuco, Av. Marielle Franco, Km 59, Caruaru 55014-900, Brazil
Abstract
The centuries-old history of dam construction, from the Saad el-Kafara Dam to global expansion in the 1950s, highlights the importance of these structures in water resource management. The Jucazinho Dam, built in 1998, emerged as a response to the scarcity of water in the Agreste region of Pernambuco, Brazil. After having less than 1% of its water storage capacity in 2016, the dam recovered in 2020 after interventions by the local water utility. In this context, the reliability of influent flow prediction models for dams becomes crucial for managers. This study proposed hydrological models based on artificial intelligence that aim to generate flow series, and we evaluated the adaptability of these models for the operation of the Jucazinho Dam. Data normalization between 0 and 1 was applied to avoid the predominance of variables with high values. The model was based on machine learning and employed support vector regression (SVM), random forest (RF) and artificial neural networks (ANNs), as provided by the Python Sklearn library. The selection of the monitoring stations took place via the Brazilian National Water and Sanitation Agency’s (ANA) HIDROWEB portal, and we used Spearman’s correlation to identify the relationship between precipitation and flow. The evaluation of the performance of the model involved graphical analyses and statistical criteria such as the Nash–Sutcliffe model efficiency coefficient (NSE), the percentage of bias (PBIAS), the coefficient of determination (R2) and the root mean standard deviation ratio (RSR). The results of the statistical coefficients for the test data indicated unsatisfactory performance for long-term predictions (8, 16 and 32 days ahead), revealing a downward trend in the quality of the fit with an increase in the forecast horizon. The SVM model stood out by obtaining the best indices of NSE, PBIAS, R2 and RSR. The graphical results of the SVM models showed underestimation of the flow values with an increase in the forecast horizon due to the sensitivity of the SVM to complex patterns in the time series. On the other hand, the RF and ANN models showed hyperestimation of the flow values as the number of forecast days increased, which was mainly attributed to overfitting. In summary, this study highlights the relevance of artificial intelligence in flow prediction for the efficient management of dams, especially in water scarcity and data-scarce scenarios. A proper choice of models and the ensuring of reliable input data are crucial for obtaining accurate forecasts and can contribute to water security and the effective operation of dams such as Jucazinho.
Keywords:
support vector machine; random forest; artificial neural network; hydrological modeling; rainfall; flow; forecasting