Next Article in Journal
Applying GIS and Text Mining Methods to Twitter Data to Explore the Spatiotemporal Patterns of Topics of Interest in Kuwait
Previous Article in Journal
Between Post-Flâneur and Smartphone Zombie: Smartphone Users’ Altering Visual Attention and Walking Behavior in Public Space
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Theoretical Approach for Forecasting Different Types of Drought Simultaneously, Using Entropy Theory and Machine-Learning Methods

by
Pouya Aghelpour
1,
Babak Mohammadi
2,*,
Seyed Mostafa Biazar
3,
Ozgur Kisi
4,5 and
Zohreh Sourmirinezhad
1
1
MSc Graduated of Agricultural Meteorology, Department of Water Engineering, Faculty of Agriculture, Bu-Ali Sina University, Hamedan 65178-38695, Iran
2
College of Hydrology and Water Resources, Hohai University, Nanjing 210098, China
3
Department of Water Engineering, Faculty of Agriculture, University of Tabriz, Tabriz 51666-16471, Iran
4
Department of Civil Engineering, Ilia State University, Tbilisi 0162, Georgia
5
Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2020, 9(12), 701; https://doi.org/10.3390/ijgi9120701
Submission received: 16 October 2020 / Revised: 15 November 2020 / Accepted: 23 November 2020 / Published: 25 November 2020

Abstract

:
Precipitation deficit can affect different natural resources such as water, soil, rivers and plants, and cause meteorological, hydrological and agricultural droughts. Multivariate drought indexes can theoretically show the severity and weakness of various drought types simultaneously. This study introduces an approach for forecasting joint deficit index (JDI) and multivariate standardized precipitation index (MSPI) by using machine–learning methods and entropy theory. JDI and MSPI were calculated for the 1–12 months’ time window (JDI1–12 and MSPI1–12), using monthly precipitation data. The methods implemented for forecasting are group method of data handling (GMDH), generalized regression neural network (GRNN), least squared support vector machine (LSSVM), adaptive neuro-fuzzy inference system (ANFIS) and ANFIS optimized with three heuristic optimization algorithms, differential evolution (DE), genetic algorithm (GA) and particle swarm optimization (PSO) as meta-innovative methods (ANFIS-DE, ANFIS-GA and ANFIS-PSO). Monthly precipitation, monthly temperature and previous amounts of the index’s values were used as inputs to the models. Data from 10 synoptic stations situated in the widest climatic zone of Iran (extra arid-cold climate) were employed. Optimal model inputs were selected by gamma test and entropy theory. The evaluation results, which were given using mean absolute error (MAE), root mean squared error (RMSE) and Willmott index (WI), show that the machine learning and meta-innovative models can present acceptable forecasts of general drought’s conditions. The algorithms DE, GA and PSO, could improve the ANFIS’s performance by 39.4%, 38.7% and 22.6%, respectively. Among all the applied models, the GMDH shows the best forecasting accuracy with MAE = 0.280, RMSE = 0.374 and WI = 0.955. In addition, the models could forecast MSPI better than JDI in the majority of cases (stations). Among the two methods used to select the optimal inputs, it is difficult to select one as a better input selector, but according to the results, more attention can be paid to entropy theory in drought studies.

1. Introduction

Drought forecasting is one of the major concerns to the water managers, agriculturalists and other water exploiters. Iran, especially its central desert, is situated on the arid and extra-arid belt of the world and the drought intensity variations directly influence the people’s life, agriculture, surface water and groundwater in the country. During the years 1998–2000, Iran experienced one of the worst and the costliest drought periods that occurred in the last 50 years. During this three-year dry period, water scarcity reached a crisis point in more than 270 cities, thousands of villages ran out of drinking water, the rate of surface water flow decreased to 55%, and Iran’s dam and reservoirs have had to operate at their minimum volume capacity to transfer water because of the low inflows and high temperatures [1,2]. Thus, the investigation of the meteorological, hydrological and agricultural droughts is of great importance to the country. Drought indices can show the weakness and intensity of different types of droughts separately, but in recent years, certain indices have been introduced for indicating the intensity and weakness of the various types of droughts together, in a single series [3]. Among the most important of these indices, the multivariate standard precipitation index (MSPI) and joint deficit index (JDI) can be pointed out. These indices combine some series of standard precipitation indices (SPI) that each of them indicates a certain type of drought [4]. The combination is based on the various statistical methods, and the result reflects the general status of the drought in a given month.
JDI was first introduced by Kao and Govindaraju [5] as a joint drought index. They introduced the modified standard precipitation index (SPImod) in a study and pooled 1–12 month time window to come up with JDI and asserted that the method could be implemented on all of the other hydrological variables, such as the discharge rates of the rivers plus the precipitation data. After that, Mirabbasi et al. [6] calculated JDI for 50 rain gauges in the north of Iran during the period from 1970 to 2007. They concluded that JDI could provide a comprehensive view of the short- and long-term status of the drought in Iran, and they also forecasted rainfall for one to three-month horizons to investigate the drought status by JDI for the forthcoming months. In a study, Hao and Singh [7] investigated the multivariate indices of drought, including JDI, and the applied characteristics and weak and strong points of these indices. Ma et al. [8] combined JDI and standardized Palmer Index to introduce SPDI-JDI (standardized Palmer drought index-based joint drought index) index for ten climatic zones in Texas.
In 2014, Bazrafshan et al. [9] introduced the MSPI index as a joint drought index. They examined SPI’s time windows by performing principal component analysis and applied them to four stations from various climates in Iran and showed that the first principal component (PC1) could account for 74% of the general drought variations. Bazrafshan et al. [2] have investigated MSPI and JDI in terms of multivariate drought monitoring for 42 meteorological stations all over Iran. They calculated both of the indices for 1–12 month time window (which monitors meteorological, hydrological and agricultural droughts simultaneously) and showed that MSPI outperforms JDI for illustrating the intensity and weakness of drought in Iran’s climates. Bateni et al. [10] used MSPI for introducing the agrometeorological drought indices (AMDI-SA). In addition, MSPI prediction has been reported for Iranian climates in Aghelpour et al. [11] study for agricultural drought forecasting.
Since the droughts begin slowly, it is possible to rely on real-time predictions so that measures can be taken and policies can be adopted for reducing the drought effects [12]. In recent years, many models have been employed for predicting the drought indices, amongst which machine learning-based models can be pointed out. Moreover, metaheuristic models are obtained by blending these models with optimization algorithms that are applied to predicting drought. These models have been widely used for predicting the SPI family, such as SPI as the meteorological drought index [13,14,15,16] and SPEI as the agricultural drought index [17,18,19,20,21], and their favorable performance has been reported under various climatic conditions. For example, Kisi et al. [22] used adaptive neuro-fuzzy inference system (ANFIS) and its hybrid forms with particle swarm optimization (PSO) and genetic algorithm (GA) for forecasting SPI as a meteorological drought index. Malik et al. used [23] a co-active neuro-fuzzy inference system (CANFIS) for forecasting streamflow drought index (SDI), which is a hydrological drought index of the SPI family. Soh et al. [17] used the wavelet ANFIS model and SPEI indicator for forecasting agricultural drought. In most of the drought studies, machine learning models had significantly acceptable performances in predicting different types of droughts separately.
According to the research done, however, no model has been evaluated to forecast different types of droughts simultaneously, as well as the selection of optimal model inputs is a challenge for nonlinear dynamic models. The questions as to which inputs should be used for model development have been a challenge in practice. Although much research has been done on drought forecasting issues, most of them do not consider effective input selection for drought modeling. As a result, the main challenge lies in the evaluation of existing data and assessing the adequacy of data. Although modeling improves performance by adding information, observations show that adding information improves model performance only slightly. Modeling accuracy can be reduced by increasing information. This is because additional information causes the model to be over-fitted. The over-fitted model also performs well in training but has a very poor performance in testing. Overfitting also occurs when multivariate models have too much input data. It is, therefore, important to know which inputs are effective in modeling and which are not [24,25,26,27]. Furthermore, MSPI has been forecasted by ANFIS and its hybrid-type models [11], but it has been investigated just from the perspective of agricultural drought. In fact, the present study is the first research trying to forecast multivariate drought in extra-arid climates, based on the indices JDI and MSPI; which can theoretically show the severity and weakness of meteorological, hydrological and agricultural droughts, simultaneously. Thus, the current research paper is intended to evaluate the performance of machine learning and metaheuristic models for forecasting these two multivariate drought indices. Furthermore, as another innovation in drought forecasting studies, entropy theory and gamma tests have been used for selecting the suitable predictor input variables.

2. Materials and Methods

2.1. Study Region

Based on the extended De Marton Climatic Zones Classification Method, the widest climatic class of Iran is the extra-arid cold climatic zone. This zone has an area of 352,738.4 km2 in Iran, which accounts for about 21.4% of the country’s total area [28]. Based on this method, ten synoptic stations are specified in this climactic breadth, as shown in Figure 1. In the present study, drought monitoring and prediction are carried out in these ten stations. The reason for choosing this climate is that drought’s damaging effects are more significant in extra arid areas.
The information used in this study includes monthly precipitation (P) and temperature (T) data whose positions and statistical specifications have been summarized in Table 1. As observed from the table, precipitation data have skewed distribution (skewness ranges from 1.81 (Kabootarabad) to 3.11 (Yazd)). The data of each of these stations were procured from the general office of the country’s meteorology organization for the period of time from their establishment years till 2017. The precipitation data have been utilized for calculating both of the indices, i.e., JDI and MSPI, and also used as input to the prediction models. The monthly temperature data were also used as input, and their effects on the models’ accuracy were also investigated.

2.2. Indices

2.2.1. Joint Deficit Index (JDI)

The index was designed for the investigation of the joint behavior of the precipitation in SPImod’s 1-month to 12-month time windows considering their seasonal changes and combines the time windows of SPImod, as one time window of JDI (JDI1–12). JDI is essentially based on Copula Functions. This index uses a 12-dimensional experimental copula for showing the associations between SPImod’s time windows [5]. The experimental copula is based on the empirical ranking scales of joint experimental likelihood. For example, the 12D experimental copula is computed by Equation (1) for an n-numbered sample:
C n ( u 1 . . u 12 ) = 1 n i = 1 n I ( R i 1 n + 1 u 1 . . R i 12 n + 1 u 12 )
where I (A) denotes the logical A term that is assigned a value equal to unity in case of holding true and zero in the otherwise case. (Ri1, …, Ri12) are the ranks of the observed ith datum that are, respectively, shown as u1, …, u12. For more information, please refer to Kao and Govindaraju [5].

2.2.2. Multivariate Standardized Precipitation Index

The index is capable of blending several SPI variables, each of which belongs to a time window in one single series. The method that was used by Bazrafshan et al. [9] for this blending is principal component analysis (PCA). By this method, a large part of the drought intensity variability in different time windows is summarized in 1 series [9]. MSPI is based on the first principal component (PC1) that is a linear combination of k-time windows capable of explaining the highest percentage of variability in k first variables. The notable point is that their values cannot be compared between various places and months due to the algebraic properties of PC1, meaning that, unlike SPI that takes a mean value between 0 and 1, PC1 lacks such a trait, and hence, it must be standardized in respect to the mean and standard deviation of the various months (Equation (2)):
Z 1 y m = P C 1 y m P C 1 m ¯ S D 1 m P C 1 y m S D 1 m
In this equation, Z1ym is the standardized value of PC1 in yth year and mth month; P C 1 m ¯ is the mean value of PC1 in mth month, and SD1m is the standard deviation of PC1 in the mth month. Here, Z1ym is the value of MSPI. For more information regarding the MSPI calculation method, please refer to the original article by Bazrafshan et al. [9]. The drought intensity categories of these two indices are like SPI, as presented in Table 2:
Since JDI is calculated for 1–12 months’ time window (JDI1–12), MSPI has also been calculated for the same time window (MSPI1–12). Moreover, because 1-month SPI (SPI1) indicates the drought status from meteorological perspectives, 3-month SPI (SPI3) shows drought status from soil moisture viewpoint, 6-month SPI (SPI6) reflects hydrological drought conditions (river streams and dam reservoirs), and 9-month to 12-month SPIs (SPI9, SPI10, SPI11 and SPI12) can express the agricultural drought status [4], so the 1–12 month time window can concomitantly exhibit all the meteorological, soil moisture, hydrological and agricultural drought conditions, together. In the current research paper, these two indices (JDI and MSPI) were calculated using MATLAB Software.

2.3. Input Determination Methods

After calculating the indices and before prediction by the models, two methods of entropy theory and gamma test were utilized for selecting the optimal inputs and accelerating the modeling task. These methods are described below.

2.3.1. Gamma Test

Gamma test shows the variance of the estimated error according to the direct data. Thus, the error estimation that is commonly termed gamma test can be considered as equivalent to the sum of nonlinear error squares [30]. To have a better understanding of the gamma test in case of assuming that two points x and x’ approaching one another in the input space corresponding to y and y’ as outputs, these two points should also be approaching one another in the output space, otherwise the non-approaching of these two points stems from the difference caused by errors (noises) [31].

2.3.2. Entropy Theory

Shannon proposed the main concept of information theory in 1948 [32]. The definition of entropy is derived from information theory that analyzes a series of statistical structures from numerical series, signs and symbols establishing relationships between the signals [33]. In general, four types of entropy were introduced for quantifying the information content: boundary entropy, joint entropy, conditional entropy and information transfer entropy. The present study uses information transfer entropy that is succinctly explained as follows:
Shannon specified the first boundary entropy, named H(x), from a discrete random variable, x:
H ( x ) = K i = 1 n P ( x i ) l o g P ( x i )
The parameter takes a value equal to unity (K = 1) on the condition that the amount of H(x) logarithm is expressed based on Napier’s constant (e); n expresses the number of events with a probability of P ( x i ) = ( i = 1 .   2 .   . n ) .
The entropy of two random and independent variables of x and y is obtained from the sum of the boundary entropy of the two variables:
H ( x . y ) = H ( x ) + H ( y )
When the two random variables of x and y are associated, the joint entropy takes a value smaller than the total entropy obtained from Equation (5):
T ( x . y ) = H ( x ) + H ( y ) H ( x . y )
Information transfer entropy is another type of entropy specifying the mutual relationship between x and y, and it is obtained as the differential of the total and joint entropies for the two dependent variables x and y [34]:
T ( x . y ) = H ( x ) H ( x | y ) = H ( y ) H ( x | y )
Here, H(x|y) expresses the conditional entropy. The conditional entropy indicates x if y expresses the residual uncertainty in x on the condition that y is clearly known and vice versa.
According to the above explanations, entropy calculations can be generalized to M variables. The total entropy for the independent variable, x m ( m = 1 .   2 .   . M ) , is computable from Equation (7):
H ( x 1 . x 2 .   .   x M ) = m = 1 M H ( x m )
If the variables are associated, their joint entropy is calculated from Equation (8):
H ( x 1 . x 2 .   .   x M ) = H ( x 1 ) + m = 1 M H ( x m | x 1 . x 2 .   .   x m 1 )
The complementary information for entropy theory can be found in [27].

2.4. Models

2.4.1. Adaptive Neuro-Fuzzy Inference System (ANFIS)

ANFIS is a model that uses neural network learning algorithms and fuzzy logic for designing nonlinear maps between the input and output spaces. The model is capable of learning by neural networks and defining and using the relationships between input–output variables by fuzzy rules, and subsequently creating the input structure of the system.
Three methods of grid partitioning, subtractive clustering and fuzzy c-means are utilized in the classical ANFIS model, and the present study uses fuzzy c-means [35]. For more information about the details of the ANFIS model and fuzzy c-means method, the reference to articles by Jang et al. [36] is suggested.
In the present study, three strong optimization algorithms have also been used for optimizing the ANFIS model, as briefly explained below.

2.4.2. Differential Evolution (DE) Algorithm

This algorithm is a method in the area of complementary calculations, and it was introduced by Storn and Price [32]. DE begins working by providing a series of suggested responses and tries to find the optimum response within a series of consecutive iterations. The most important feature of this algorithm is its high speed, simplicity and strength. DE sets three primary parameters of population size (NP), mutation weight (F) and crossing (Cr). For more information, please see the articles presented by Storn and Price [37] and Halabi et al. [38].

2.4.3. Genetic Algorithm (GA)

This algorithm was introduced by Holland [39]. The principal component in GA is a chromosome that is consisted of genes, and each gene explains a parameter of the problem [40]. As the most important stage in the genetic algorithm, the selection is the process through which the next generation’s chromosomes are determined based on the current generation’s survival law. In this algorithm, the search is commenced in an initial population of the strains and the unique strains are evaluated in each repetition corresponding to an efficiency condition and assigned a proportion value. For complementary information, refer to the article by Jaramillo et al. [41].

2.4.4. Particle Swarm Optimization (PSO) Algorithm

PSO was inspired by the collective movement of the fish or birds’ flight in groups [42], and it was subsequently employed successfully in various sciences. In this algorithm, each solution is just a particle in the search space [43]. All of the particles are assigned a fitness value evaluated by a fitness function that must be optimized. Further information on the PSO algorithm can be found in Eberhart and Kennedy [42].
The schematic form of the ANFIS model, as well as its combination with DE, GA and PSO algorithms, is illustrated in Figure 2, considering the studied inputs and outputs. In this prediction approach, the time lags of each index can only be used to predict the same index; so here, this procedure is used for predicting JDI and MSPI separately.

2.4.5. Group Method of Data Handling (GMDH)

This model was offered in 1970 by Ivakhnenko, and it is a multivariate nonlinear statistical analysis [39]. GMDH is based on the ability to distinguish nested second- or third-degree polynomials for simulating regression and performing classification. In GMDH, each composite layer is comprised of several nodes as the model’s inputs for the transfer function. The results are transferred to the next layer that is per se employed as an input layer for the next layer of the network [40]. For more information, refer to articles by Ivakhnenko [44], Aghelpour and Varshavian [45], Ashrafzadeh et al. [46] and Aghelpour et al. [47]

2.4.6. Generalized Regression Neural Network (GRNN)

This type of neural network is one type of multipurpose basic and approximating radial neural networks for smooth functions. GRNN is a three-layer neural network in which, like the other neural networks [48]. The number of neurons existent on the first and last layers is, respectively, equal to the dimensions of the input and output vectors, but, unlike the other networks, the number of the neurons in the hidden layer of GRNN is equal to the number of the observed data. This type of neural network uses normal efficiency function in each of the neurons of the hidden layer, and the data inputted into this function for each neuron include the Euclidean distance between the input and observed data related to that neuron. See the sources for complementary information on GRNN [45,48,49].

2.4.7. Least Squares Support Vector Machine (LSSVM)

The least-squares support vector machine (LSSVM) was proposed by Suykens and Vandewalle in 1999, and it is used for the prediction of the turbulent time series [50]. LSSVM is a type of support vector machine that uses a collection of linear equations for training, while SVM uses a second-degree optimization problem, and this is their major difference [51]. For complementary information about this model, refer to Moazenzadeh and Mohammadi [52], Mohammadi and Aghashariatmadri [53] and Guan et al. [54].

2.5. Performance Evaluation Scales

To ensure the accuracy of the models’ predictions, mean absolute error (MAE), root mean square error (RMSE), Willmott index (WI) and Pearson’s correlation coefficient (R) were employed, and their related equations are shown in Equations (9)–(12):
MAE = 1 n i = 1 n | y i f i |
RMSE = 1 n i = 1 n ( y i f i ) 2
WI = 1 i = 1 N ( y i f i ) 2 i = 1 N ( | f i y ¯ | + | y i y ¯ | ) 2
R = i = 1 n ( y i y ¯ ) ( f i f ¯ ) i = 1 n ( y i y ¯ ) 2 i = 1 n ( f i f ¯ ) 2
where y i and y ¯ are, respectively the observed data and their means; f i and f ¯ are the model-predicted data and their means, and n is the number of the data.

3. Results

3.1. Input Selection

At first, the models’ inputs, including temperature, precipitation and the values of the index itself, were backwardly arranged up to 12 steps. Since the periodicity of a hydrological process has a 1 year (12 months) return period, it is logical to consider up to 12-month time lags as inputs. Then, gamma tests and entropy theory were utilized for selecting the optimal inputs. The results of these evaluations are given in Table 3 and Table 4. In these tables, P and T, respectively, denote the total monthly precipitation and mean temperature. The subscripts ti, as well, indicate ith time lag for each variable and they were considered as the models’ input in the present study.
In the present study, the processes followed in the article by the references of Ahmadi et al. [18], Biazar et al. [55] and Biazar et al. [56], have also been implemented in such a way that the entropy of the entire data was first calculated in these tables based on the entropy theory and the variables were omitted one-by-one and the entropy values of each combination are calculated in a second stage. If by deleting a variable from the input combination, the entropy value exceeds the primary entropy, the variable should be removed from the input combination. As for the gamma test, the same two stages are executed so that the variable whose gamma value becomes smaller than the initial gamma is removed. This process is separately repeated for all the variables (P, T, JDI and MSPI), and the results of the input selection for JDI and MSPI are summarized in Table 3 and Table 4.
Next, the input and target data (JDIt/MSPIt) are inserted into the models. In the modeling processes of the present study, 75% of the input and target samples was used for training, and 25% of them was applied in testing. In all of the stations, the modeling and prediction of multivariate drought were carried out based on the inputs selected by both of the methods (gamma and entropy). According to the paper’s issue (predicting), in all of the models, the test part was shown in the evaluation tables and discussed.

3.2. Results of JDI Prediction

Predicting JDI was implemented by two input scenarios and seven models in 10 stations. The results of JDI prediction are shown in Table 5.
This includes 140 evaluations for JDI. In these evaluations, there are 70 comparisons between entropy theory and gamma test, which shows the superiority of entropy theory in 43 cases (61.4%). Between the models, GMDH could have the best JDI predictions in all of the stations. The most accurate prediction of JDI is reported for Boshrouyeh station by inputs of entropy theory and GMDH model, with MAE = 0.338, RMSE = 0.456, and WI = 0.864. Among ANFIS and its hybrid models, the bio-inspired algorithms DE, GA and PSO could improve the performance of ANFIS by an average of 32.8%, 32.0% and 20.9%, respectively. The weakest performance belongs to the simple ANFIS model in East Isfahan station, which is given by inputs of gamma test with MAE = 0.626, RMSE = 1.207, and WI = 0.674. The predictions of JDI for each station are shown in time-series plots (Figure 3).
In this figure, the best prediction of each station in the test period (bold cases of Table 5) is presented beside its observation. At the first look at these plots (Figure 3), there is not significant overlapping of the outputs for the stations in JDI prediction. However, in some of the cases, the overlapping can be acceptable. For example, in Boshrouyeh station, the moderate and severe droughts that occurred in Augusts of 2011, 2012, 2013 and 2014, are predicted accurately. Or the moderate drought classes for Yazd station in September of 2002, 2003, 2005, 2007, 2011, 2013, 2014 and 2016, were acceptably predicted. However, in the wet classes of the stations, the overlapping is weak and unacceptable.

3.3. Results of MSPI Prediction

MSPI prediction was also implemented by the inputs of the two aforementioned methods, entropy theory and gamma test, and the evaluation results are shown in Table 6.
Similarly, for this index, there are 140 evaluations and 70 comparisons between the input selectors too. In 40 cases (about 57.1%), entropy theory was superior in comparison to gamma tests. In MSPI prediction like JDI, GMDH is evaluated as the most accurate model, which had the best predictions in all of the investigated stations. The most accurate prediction of MSPI with MAE = 0.280, RMSE = 0.374 and WI = 0.955, is made for Marvast station by GMDH model and inputs of entropy theory. In evaluating ANFIS and its hybrid forms, the results showed that DE, GA and PSO algorithms could cause on average 45.9%, 45.4% and 24.2% improvement for ANFIS, respectively. The weakest MSPI prediction is reported for the ANFIS model in Boshrouyeh station, with MAE = 1.235, RMSE = 1.086 and WI = 0.772, which is given by the inputs of gamma tests. In Figure 4, the overlapping of MSPI predictions and their observations are shown in time series plots.
In these graphs, the best predictions of each station (bold cases of Table 5) are drawn. At first look at Figure 4, it is obvious that there is stronger overlapping in MSPI prediction compared to JDI in all of the stations. The predictions of stations East Isfahan, Isfahan, Kashan and Yazd, have significant overlapping with their observations, even in severe and extreme drought events. The weakest MSPI predictions can be reported for Biarjmand and Boshrouyeh stations, but in comparison with JDI, they are significantly more accurate. In addition, in wet classes, the predictions of MSPI are acceptable in all of the investigated stations. In addition, the scatter plots for comparing the predicted, and observed values are provided in Figure 5.

3.4. Models’ Comparisons

In this section, a Taylor diagram was used for comparing the overall performance of the models in such a way that the entire outputs of each model for the entire stations were evaluated against the real values of the indices (Figure 6).
The diagram exhibits the error rate, correlation coefficient and the standard deviation of the estimated values in comparison to the observed amounts [57,58]. Taylor diagrams (Figure 6) show that ANFIS and GRNN have had the lowest accuracy in contrast to the other models in terms of both of the suggested inputs (entropy theory and gamma test), and GMDH was found with the best predictions. It can be stated, according to the Taylor diagram, that the models’ predictions are more correlated and less erroneous when applying the inputs of entropy theory. Furthermore, the standard deviation of the outputs also approaches the real values to some extent, with the use of entropy theory’s inputs. In diagrams related to JDI, the maximum value of R is about 0.7, and the minimum value of RMSE is about 0.65 that is pertinent to the GMDH model and entropy theory. MSPI diagrams also confirm that GMDH (especially with entropy theory’s inputs) has the best performance with RMSE below 0.5 and R found to be about 0.9. In this case (MSPI with entropy theory’s inputs), the standard deviation values of the predicted data are closest to that of the observed data.
The models’ error distributions are shown in violin plots (Figure 7). In this violin plot, the entire errors of each model (for all the stations) were delineated for each index. It can be seminally observed that the violin’s curvature is softer in the violin plots drawn for ANFIS and GRNN models. This is reflective of the idea that these two models’ errors have relatively more uniform distribution than the other models; or, in other words, the errors’ frequencies slightly differ between the larger and smaller values, and the errors with larger values are relatively more frequent in these two models. In the ANFIS-PSO model and, after this, in LSSVM, ANFIS-GA and ANFIS-DE models, the violins’ curvatures are reduced and their frequency is gradually increased about zero error. In the GMDH model, this issue reaches its peak point in such a way that the GMDH model’s errors feature the highest frequency about zero in all the four diagrams of Figure 7. The issue can be clearly seen in the MSPI’s outputs.

3.5. Evaluation of the Models’ Accuracy in Drought Classes

In this part of the present study, the models’ errors are evaluated with regard to the various drought classes. Something that draws attention at first glance in Figure 8 is that JDI is one class shorter than MSPI.
The reason for this issue is JDI’s weakness in showing the very severe droughts, and it had been previously examined by Bazrafshan et al. [2]. Considering Figure 8, MSPI takes a smaller RMSE value in comparison to JDI for all of the drought classes. The lowest prediction error pertains to the normal class, and the RMSE value is increased after this with the increase in the intensity of drought or wetness. It can also be observed that drought is predicted with smaller errors in contrast to wetness for identical intensities. For instance, in MSPI predictions based on the inputs suggested by entropy theory, RMSE value was smaller in severe drought periods than severe wetness periods and/or more accurate predictions were offered by the models for the very severe drought periods as compared to very intensive wetness periods. In addition, in predicting moderate and severe droughts (for both MSPI and JDI) as well as predicting very severe droughts (MSPI), GMDH followed by ANFIS-DE and ANFIS-GA can be trusted to give more reliable results compared to other models.

4. Discussion

Forecasting JDI has not been found among the previous studies up to now. MSPI forecasting was found only in one study [11] that is getting discussed. Aghelpour et al. [11] used the GA & PSO algorithms to optimize ANFIS, and there are also similar stations (Isfahan & Yazd). Comparing these two prediction studies shows that the similar models, including ANFIS, ANFIS-GA & ANFIS-PSO, had better performances for these two stations, in the research of Aghelpour et al. [11]. There are similarities between the models and stations; so, the difference can only be related to the time windows. In Aghelpour et al. [11], the prediction was implemented for the time windows MSPI9-12, but in the current study, MSPI1–12 was investigated. MSPI9–12 is an index of agricultural drought and deal with about one type of drought. However, MSPI1–12 is an index for three types of droughts, including meteorological, hydrological and agricultural drought simultaneously. In predicting three types of droughts simultaneously, more error is completely normal, compared to predicting just one type of drought.
In comparing the two multivariate drought indices, these prediction results can complement the study by Bazrafshan et al. [2]. They found MSPI is more appropriate than JDI for monitoring multivariate drought status in Iran. The present study adds that MSPI predictions are also more accurate than JDI’s. Hence, it can be stated that the MSPI is more appropriate for both monitoring and predicting multivariate droughts compared to JDI in Iran.

5. Conclusions

Machine learning models have good accuracies for predicting multivariate drought based on JDI and MSPI and can be considered good alternatives among the models that can provide short-term predictions.
Among the studied models, GMDH was found to have the best accuracy and is suggested to be employed for multivariate drought predictions. The reason for that the number of parameters in this model is more than the other models. The number of layers and neurons in GMDH is changeable and creates a large range of network make-up, whereas GRNN has only spread parameters. The optimizers of ANFIS, including DE, GA and PSO, could improve the ANFIS’s accuracy by 39.4%, 38.7% and 22.6%, respectively. Hence, in multivariate drought forecasting, the use of DE and GA is suggested.
On the other hand, according to the results of the gamma test and entropy theory, precipitation and temperature values of the previous months (in addition to the past amounts of the indices) are also influential in forecasting multivariate drought. Hence, the use of these variables as the models’ inputs is suggested for predicting the general status of the drought. None of the two methods of input selection for short-term prediction of the indices proposed in the current research can be decisively considered superior to the other, but entropy theory seems to be more promising according to the general results. Generally, forecasting some different types of drought together (e.g., meteorological, hydrological and agricultural droughts simultaneously) by the current method is a “theoretical study” and to be changed to an “applied study”, it is suggested to validate the indices by the actual simultaneous events of these different drought types.

Author Contributions

Conceptualization, Pouya Aghelpour; methodology, Pouya Aghelpour; software, Pouya Aghelpour, Seyed Mostafa Biazar, and Babak Mohammadi; validation, Pouya Aghelpour, Seyed Mostafa Biazar, and Babak Mohammadi; investigation, Pouya Aghelpour, and Zohreh Sourmirinezhad; resources, Babak Mohammadi; data curation, Pouya Aghelpour; writing—original draft preparation, Babak Mohammadi, Pouya Aghelpour, and Zohreh Sourmirinezhad; writing—review and editing, Babak Mohammadi, and Ozgur Kisi; visualization, Pouya Aghelpour; supervision, Ozgur Kisi. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors thank the Iran Meteorological Organization for providing the data used in this study, and the reviewers and the editors who gave valuable time to review the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Abbaspour, M.; Sabetraftar, A. Review of cycles and indices of drought and their effect on water resources, ecological, biological, agricultural, social and economical issues in Iran. Int. J. Environ. Stud. 2005, 62, 709–724. [Google Scholar] [CrossRef]
  2. Bazrafshan, J.; Nadi, M.; Ghorbani, K. Comparison of Empirical Copula-Based Joint Deficit Index (JDI) and Multivariate Standardized Precipitation Index (MSPI) for Drought Monitoring in Iran. Water Resour. Manag. 2015, 29, 2027–2044. [Google Scholar] [CrossRef]
  3. Bai, X.; Wang, Y.; Jin, J.; Ning, S.; Wang, Y.; Wu, C. Spatio-Temporal Evolution Analysis of Drought Based on Cloud Transformation Algorithm over Northern Anhui Province. Entropy 2020, 22, 106. [Google Scholar] [CrossRef] [Green Version]
  4. Svoboda, M.; Hayes, M.; Wood, D. Standradized Precipitation Index User Guide; World Meteorological Organization: Geneva, Switzerland, 2012; Volume 21, pp. 1333–1348. [Google Scholar] [CrossRef] [Green Version]
  5. Kao, S.C.; Govindaraju, R.S. A copula-based joint deficit index for droughts. J. Hydrol. 2010, 380, 121–134. [Google Scholar] [CrossRef]
  6. Mirabbasi, R.; Anagnostou, E.N.; Fakheri-Fard, A.; Dinpashoh, Y.; Eslamian, S. Analysis of meteorological drought in northwest Iran using the Joint Deficit Index. J. Hydrol. 2013, 492, 35–48. [Google Scholar] [CrossRef]
  7. Hao, Z.; Singh, V.P. Drought characterization from a multivariate perspective: A review. J. Hydrol. 2015, 527, 668–678. [Google Scholar] [CrossRef]
  8. Ma, M.; Ren, L.; Singh, V.P.; Tu, X.; Jiang, S.; Liu, Y. Evaluation and application of the SPDI-JDI for droughts in Texas, USA. J. Hydrol. 2015, 521, 34–45. [Google Scholar] [CrossRef]
  9. Bazrafshan, J.; Hejabi, S.; Rahimi, J. Drought Monitoring Using the Multivariate Standardized Precipitation Index (MSPI). Water Resour. Manag. 2014, 28, 1045–1060. [Google Scholar] [CrossRef]
  10. Bateni, M.M.; Behmanesh, J.; De Michele, C.; Bazrafshan, J.; Rezaie, H. Composite agrometeorological drought index accounting for seasonality and autocorrelation. J. Hydrol. Eng. 2018, 23. [Google Scholar] [CrossRef] [Green Version]
  11. Aghelpour, P.; Bahrami-Pichaghchi, H.; Kisi, O. Comparison of three different bio-inspired algorithms to improve ability of neuro fuzzy approach in prediction of agricultural drought, based on three different indexes. Comput. Electron. Agric. 2020, 170, 105279. [Google Scholar] [CrossRef]
  12. Moreira, E.E.; Pires, C.L.; Pereira, L.S. SPI drought class predictions driven by the North Atlantic Oscillation index using log-linear modeling. Water 2016, 8, 43. [Google Scholar] [CrossRef] [Green Version]
  13. Belayneh, A.; Adamowski, J.; Khalil, B.; Ozga-Zielinski, B. Long-term SPI drought forecasting in the Awash River Basin in Ethiopia using wavelet neural networks and wavelet support vector regression models. J. Hydrol. 2014, 508, 418–429. [Google Scholar] [CrossRef]
  14. Deo, R.C.; Kisi, O.; Singh, V.P. Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model. Atmos. Res. 2017, 184, 149–175. [Google Scholar] [CrossRef] [Green Version]
  15. Mokhtarzad, M.; Eskandari, F.; Jamshidi Vanjani, N.; Arabasadi, A. Drought forecasting by ANN, ANFIS, and SVM and comparison of the models. Environ. Earth Sci. 2017, 76, 729. [Google Scholar] [CrossRef]
  16. Zahraie, B.; Nasseri, M.; Nematizadeh, F. Exploring spatiotemporal meteorological correlations for basin scale meteorological drought forecasting using data mining methods. Arab. J. Geosci. 2017, 10, 419. [Google Scholar] [CrossRef]
  17. Chen, M.; Ning, S.; Cui, Y.; Jin, J.; Zhou, Y.; Wu, C. Quantitative assessment and diagnosis for regional agricultural drought resilience based on set pair analysis and connection entropy. Entropy 2019, 21, 373. [Google Scholar] [CrossRef] [Green Version]
  18. Deo, R.C.; Şahin, M. Application of the Artificial Neural Network model for prediction of monthly Standardized Precipitation and Evapotranspiration Index using hydrometeorological parameters and climate indices in eastern Australia. Atmos. Res. 2015, 161, 65–81. [Google Scholar] [CrossRef]
  19. Maca, P.; Pech, P. Forecasting SPEI and SPI drought indices using the integrated artificial neural networks. Comput. Intell. Neurosci. 2016, 2016, 3868519. [Google Scholar] [CrossRef] [Green Version]
  20. Soh, Y.W.; Koo, C.H.; Huang, Y.F.; Fung, K.F. Application of artificial intelligence models for the prediction of standardized precipitation evapotranspiration index (SPEI) at Langat River Basin, Malaysia. Comput. Electron. Agric. 2018, 144, 164–173. [Google Scholar] [CrossRef]
  21. Tian, Y.; Xu, Y.P.; Wang, G. Agricultural drought prediction using climate indices based on Support Vector Regression in Xiangjiang River basin. Sci. Total Environ. 2018, 622, 710–720. [Google Scholar] [CrossRef]
  22. Mohammadi, B.; Guan, Y.; Aghelpour, P.; Emamgholizadeh, S.; Pillco Zolá, R.; Zhang, D. Simulation of Titicaca Lake Water Level Fluctuations Using Hybrid Machine Learning Technique Integrated with Grey Wolf Optimizer Algorithm. Water 2020, 12, 3015. [Google Scholar] [CrossRef]
  23. Kisi, O.; Gorgij, A.D.; Zounemat-Kermani, M.; Mahdavi-Meymand, A.; Kim, S. Drought forecasting using novel heuristic methods in a semi-arid environment. J. Hydrol. 2019, 578, 124053. [Google Scholar] [CrossRef]
  24. Malik, A.; Kumar, A.; Singh, R.P. Application of heuristic approaches for prediction of hydrological drought using multi-scalar streamflow drought index. Water Resour. Manag. 2019, 33, 3985–4006. [Google Scholar] [CrossRef]
  25. Ahmadi, A.; Han, D.; Karamouz, M.; Remesan, R. Input data selection for solar radiation estimation. Hydrol. Process. 2009, 23, 2754–2764. [Google Scholar] [CrossRef]
  26. Remesan, R.; Shamim, M.A.; Han, D. Model data selection using gamma test for daily solar radiation estimation. Hydrol. Process. 2008, 22, 4301–4309. [Google Scholar] [CrossRef]
  27. Ashrafzadeh, A.; Ghorbani, M.A.; Biazar, S.M.; Yaseen, Z.M. Evaporation process modelling over northern Iran: Application of an integrative data-intelligence model with the krill herd optimization algorithm. Hydrol. Sci. J. 2019, 64, 1843–1856. [Google Scholar] [CrossRef]
  28. Khaledian, M.R.; Isazadeh, M.; Biazar, S.M.; Pham, Q.B. Simulating Caspian Sea surface water level by artificial neural network and support vector machine models. Acta Geophys. 2020, 68, 553–563. [Google Scholar] [CrossRef]
  29. Rahimi, J.; Ebrahimpour, M.; Khalili, A. Spatial changes of Extended De Martonne climatic zones affected by climate change in Iran. Theor. Appl. Climatol. 2013, 112, 409–418. [Google Scholar] [CrossRef]
  30. McKee, T.B.; Doesken, N.J.; Kleist, J. The relationship of drought frequency and duration to time scales. In Proceedings of the 8th Conference on Applied Climatology, Boston, MA, USA, 17–22 January 1993; Volume 17, pp. 179–183. [Google Scholar]
  31. Evans, D.; Jones, A.J. A proof of the Gamma test. Proc. R. Soc. A Math. Phys. Eng. Sci. 2002, 458, 2759–2799. [Google Scholar] [CrossRef]
  32. Moghaddamnia, A.; Ghafari Gousheh, M.; Piri, J.; Amin, S.; Han, D. Evaporation estimation using artificial neural networks and adaptive neuro-fuzzy inference system techniques. Adv. Water Resour. 2009, 32, 88–97. [Google Scholar] [CrossRef]
  33. Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  34. Cheng, L.; Niu, J.; Liao, D. Entropy-based investigation on the precipitation variability over the Hexi Corridor in China. Entropy 2017, 19, 660. [Google Scholar] [CrossRef] [Green Version]
  35. Reza, F.M. An Introduction to Information Theory; Courier Corporation: North Chelmsford, MA, USA, 1994; ISBN 0486682102. [Google Scholar]
  36. Mohammadi, B.; Linh, N.T.T.; Pham, Q.B.; Ahmed, A.N.; Vojteková, J.; Guan, Y.; Abba, S.I.; El-Shafie, A. Adaptive neuro-fuzzy inference system coupled with shuffled frog leaping algorithm for predicting river streamflow time series. Hydrol. Sci. J. 2020, 65. [Google Scholar] [CrossRef]
  37. Jang, J.S.R. ANFIS: Adaptive-Network-Based Fuzzy Inference System. IEEE Trans. Syst. Man Cybern. 1993, 23, 665–685. [Google Scholar] [CrossRef]
  38. Storn, R.; Price, K. Differential Evolution—A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  39. Halabi, L.M.; Mekhilef, S.; Hossain, M. Performance evaluation of hybrid adaptive neuro-fuzzy inference system models for predicting monthly global solar radiation. Appl. Energy 2018, 213, 247–261. [Google Scholar] [CrossRef]
  40. Holland, J.H. Adaptation in Natural and Artificial Systems; MT Press: Berkeley, CA, USA, 2019. [Google Scholar]
  41. Jahani, B.; Mohammadi, B. A comparison between the application of empirical and ANN methods for estimation of daily global solar radiation in Iran. Theor. Appl. Climatol. 2019, 137, 1257–1269. [Google Scholar] [CrossRef]
  42. Jaramillo, J.H.; Bhadury, J.; Batta, R. On the use of genetic algorithms to solve location problems. Comput. Oper. Res. 2002, 29, 761–779. [Google Scholar] [CrossRef]
  43. Eberhart, R.; Kennedy, J. New optimizer using particle swarm theory. In Proceedings of the International Symposium on Micro Machine and Human Science, Nagoya, Japan, 4–6 October 1995. [Google Scholar]
  44. Mohammadi, B.; Ahmadi, F.; Mehdizadeh, S.; Guan, Y.; Pham, Q.B.; Linh, N.T.T.; Tri, D.Q. Developing Novel Robust Models to Improve the Accuracy of Daily Streamflow Modeling. Water Resour. Manag. 2020, 34, 3387–3409. [Google Scholar] [CrossRef]
  45. Ivakhnenko, A.G. Heuristic self-organization in problems of engineering cybernetics. Automatica 1970, 6, 207–219. [Google Scholar] [CrossRef]
  46. Aghelpour, P.; Varshavian, V. Evaluation of stochastic and artificial intelligence models in modeling and predicting of river daily flow time series. Stoch. Environ. Res. Risk Assess. 2020, 34, 33–50. [Google Scholar] [CrossRef]
  47. Ashrafzadeh, A.; Kişi, O.; Aghelpour, P.; Biazar, S.M.; Masouleh, M.A. Comparative study of time series models, support vector machines, and GMDH in forecasting long-term evapotranspiration rates in northern Iran. J. Irrig. Drain. Eng. 2020, 146, 4020010. [Google Scholar] [CrossRef]
  48. Aghelpour, P.; Guan, Y.; Bahrami-Pichaghchi, H.; Mohammadi, B.; Kisi, O.; Zhang, D. Using the MODIS Sensor for Snow Cover Modeling and the Assessment of Drought Effects on Snow Cover in a Mountainous Area. Remote Sens. 2020, 12, 3437. [Google Scholar] [CrossRef]
  49. Araghinejad, S. Data-Driven Modeling: Using MATLAB® in Water Resources and Environmental Engineering; Springer Science and Business Media: Berlin, Germany, 2014; Volume 67. [Google Scholar]
  50. Houichi, L.; Dechemi, N.; Heddam, S.; Achour, B. An evaluation of ANN methods for estimating the lengths of hydraulic jumps in U-shaped channel. J. Hydroinform. 2013, 15, 147–154. [Google Scholar] [CrossRef]
  51. Suykens, J.A.K.; Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 1999, 9, 293–300. [Google Scholar] [CrossRef]
  52. Mohammadi, B.; Mehdizadeh, S. Modeling daily reference evapotranspiration via a novel approach based on support vector regression coupled with whale optimization algorithm. Agric. Water Manag. 2020, 237, 106145. [Google Scholar] [CrossRef]
  53. Moazenzadeh, R.; Mohammadi, B. Assessment of bio-inspired metaheuristic optimisation algorithms for estimating soil temperature. Geoderma 2019, 353, 152–171. [Google Scholar] [CrossRef]
  54. Mohammadi, B.; Aghashariatmadari, Z. Estimation of solar radiation using neighboring stations through hybrid support vector regression boosted by Krill Herd algorithm. Arab. J. Geosci. 2020, 13, 1–16. [Google Scholar] [CrossRef]
  55. Guan, Y.; Mohammadi, B.; Pham, Q.B.; Adarsh, S.; Balkhair, K.S.; Rahman, K.U.; Linh, N.T.T.; Tri, D.Q. A novel approach for predicting daily pan evaporation in the coastal regions of Iran using support vector regression coupled with krill herd algorithm model. Theor. Appl. Climatol. 2020, 142, 349–367. [Google Scholar] [CrossRef]
  56. Biazar, S.M.; Fard, A.F.; Singh, V.P.; Dinpashoh, Y.; Majnooni-Heris, A. Estimation of Evaporation from Saline-Water with More Efficient Input Variables. Pure Appl. Geophys. 2020, 177, 5599–5619. [Google Scholar] [CrossRef]
  57. Biazar, S.M.; Fard, A.F.; Singh, V.P.; Dinpashoh, Y.; Majnooni-Heris, A. Estimation of evaporation from saline water. Environ. Monit. Assess. 2020, 192, 1–17. [Google Scholar] [CrossRef] [PubMed]
  58. Aghelpour, P.; Mohammadi, B.; Biazar, S.M. Long-term monthly average temperature forecasting in some climate types of Iran, using the models SARIMA, SVR, and SVR-FA. Theor. Appl. Climatol. 2019, 138, 1471–1480. [Google Scholar] [CrossRef]
Figure 1. Situation of the intended stations in Iran.
Figure 1. Situation of the intended stations in Iran.
Ijgi 09 00701 g001
Figure 2. Schematic form of adaptive neuro-fuzzy inference system (ANFIS) merging with the algorithms differential evolution (DE), genetic algorithm (GA) and particle swarm optimization (PSO).
Figure 2. Schematic form of adaptive neuro-fuzzy inference system (ANFIS) merging with the algorithms differential evolution (DE), genetic algorithm (GA) and particle swarm optimization (PSO).
Ijgi 09 00701 g002
Figure 3. The variation of forecasts and observations for the JDI prediction in all of the stations; The numbers of the subplots refer to the stations including Anar (1), Biarjmand (2), Boshrouyeh (3), East Isfahan (4), Isfahan (5), Kabootarabad (6), Kashan (7), Marvast (8), Naein (9) and Yazd (10).
Figure 3. The variation of forecasts and observations for the JDI prediction in all of the stations; The numbers of the subplots refer to the stations including Anar (1), Biarjmand (2), Boshrouyeh (3), East Isfahan (4), Isfahan (5), Kabootarabad (6), Kashan (7), Marvast (8), Naein (9) and Yazd (10).
Ijgi 09 00701 g003
Figure 4. The variation of forecasts and observations for the MSPI prediction in all of the stations; The numbers of the subplots refer to the stations including Anar (1), Biarjmand (2), Boshrouyeh (3), East Isfahan (4), Isfahan (5), Kabootarabad (6), Kashan (7), Marvast (8), Naein (9) and Yazd (10).
Figure 4. The variation of forecasts and observations for the MSPI prediction in all of the stations; The numbers of the subplots refer to the stations including Anar (1), Biarjmand (2), Boshrouyeh (3), East Isfahan (4), Isfahan (5), Kabootarabad (6), Kashan (7), Marvast (8), Naein (9) and Yazd (10).
Ijgi 09 00701 g004
Figure 5. Scatter plots in comparing the performances of the gamma test and entropy theory for MSPI and JDI prediction (Marvast station).
Figure 5. Scatter plots in comparing the performances of the gamma test and entropy theory for MSPI and JDI prediction (Marvast station).
Ijgi 09 00701 g005
Figure 6. Taylor diagrams of the model prediction; Total evaluation of JDI’s predictions by inputs of gamma test (1) and entropy theory (2) and total evaluation of MSPI’s predictions by inputs of gamma test (3) and entropy theory (4).
Figure 6. Taylor diagrams of the model prediction; Total evaluation of JDI’s predictions by inputs of gamma test (1) and entropy theory (2) and total evaluation of MSPI’s predictions by inputs of gamma test (3) and entropy theory (4).
Ijgi 09 00701 g006
Figure 7. Violin plots of the errors; Total evaluation of JDI’s prediction error resulted by inputs of gamma test (1) and entropy theory (2) and total evaluation of MSPI’s prediction error resulted by inputs of gamma test (3) and entropy theory (4).
Figure 7. Violin plots of the errors; Total evaluation of JDI’s prediction error resulted by inputs of gamma test (1) and entropy theory (2) and total evaluation of MSPI’s prediction error resulted by inputs of gamma test (3) and entropy theory (4).
Ijgi 09 00701 g007aIjgi 09 00701 g007b
Figure 8. Root mean squared error (RMSE) values in drought classes; JDI’s prediction resulted by inputs of gamma test (1) and entropy theory (2) and MSPI’s prediction resulted by inputs of gamma test (3) and entropy theory (4).
Figure 8. Root mean squared error (RMSE) values in drought classes; JDI’s prediction resulted by inputs of gamma test (1) and entropy theory (2) and MSPI’s prediction resulted by inputs of gamma test (3) and entropy theory (4).
Ijgi 09 00701 g008
Table 1. Synoptic stations’ position and data specification.
Table 1. Synoptic stations’ position and data specification.
StationXYZPeriodMeanSt. Dev. (a)Max. (b)Min. (c)Skew. (d)Annual Mean
P (e)
(mm)
T (f)
(°C)
P
(mm)
T
(°C)
P
(mm)
T
(°C)
P
(mm)
T
(°C)
P
(mm)
T
(°C)
P
(mm)
T
(°C)
Anar30.8855.251408.801986–20175.8820.3210.698.9392.1035.500.000.702.86−0.0870.5620.32
Biarjmand36.0555.831106.201992–201710.1416.3613.779.7185.3032.100.00−4.402.48−0.11121.6816.36
Boshrouyeh33.9057.45885.001988–20177.1721.0911.3910.3378.7036.600.00−5.802.21−0.1586.0421.09
East Isfahan32.6751.871543.001980–20178.4415.4212.569.5586.0031.700.00−2.602.20−0.03101.2815.42
Isfahan32.6251.671550.401951–201710.4116.4515.599.26148.2032.000.00−3.402.51−0.05124.9216.45
Kabootarabad32.5251.851545.001992–20179.4117.9313.549.5869.0033.200.00−2.101.81−0.04112.9217.93
Kashan33.9851.45982.301967–201711.1019.8216.3810.29124.1037.500.00−5.102.51−0.08133.2019.82
Marvast30.5054.251546.601997–20175.3419.8510.148.9364.8034.200.003.602.69−0.0364.0819.85
Naein32.8553.081549.001993–20177.8718.6612.449.3585.6034.300.00−3.402.55−0.1094.4418.66
Yazd31.9054.281237.201953–20174.7219.598.769.5669.2036.100.00−1.103.11−0.0856.6419.59
(a) St. Dev. = standard deviation; (b) Max. = maximum; (c) Min. = minimum; (d) Skew. = skewness; (e) P = precipitation; (f) T = temperature.
Table 2. Classification of the SPI’s values and corresponding probability limits [9,29].
Table 2. Classification of the SPI’s values and corresponding probability limits [9,29].
SPI ClassesProbability LimitsDescription
SPI ≥ 2≥97%Extremely wet
2 > SPI ≥ 1.593.3–97.7%Severely wet
1.5 > SPI ≥ 184.1–93.3%Moderately wet
1 > SPI > −115.9–84.1%Normal
−1 ≥ SPI > −1.56.7–15.9%Moderately dry
−1.5 ≥ SPI > −22.3–6.7%Severely dry
−2 ≥ SPI2.3% ≥Extremely dry
Table 3. Chosen inputs by gamma test (●) and entropy theory (∆) for joint deficit index (JDI).
Table 3. Chosen inputs by gamma test (●) and entropy theory (∆) for joint deficit index (JDI).
StationVariableTime Lags (i)
123456789101112
AnarP(a)t-i●∆●∆●∆●∆●∆●∆ ●∆
T(b)t-i●∆●∆●∆●∆●∆●∆●∆●∆●∆
JDIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
BiarjmandPt-i●∆ ●∆●∆
Tt-i●∆●∆●∆●∆●∆●∆
JDIt-i●∆●∆●∆●∆
BoshrouyehPt-i●∆●∆●∆ ●∆ ●∆
Tt-i
JDIt-i
East of IsfahanPt-i●∆●∆●∆ ●∆
Tt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆
JDIt-i●∆●∆●∆●∆●∆●∆●∆
IsfahanPt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i●∆●∆●∆
JDIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
KabootarabadPt-i●∆●∆●∆●∆
Tt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆
JDIt-i●∆●∆●∆●∆●∆
KashanPt-i●∆●∆ ●∆ ●∆●∆
Tt-i●∆
JDIt-i●∆●∆●∆●∆●∆●∆
MarvastPt-i●∆●∆●∆●∆●∆ ●∆
Tt-i●∆●∆●∆●∆●∆
JDIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
NaeinPt-i●∆●∆●∆●∆●∆ ●∆●∆●∆●∆●∆
Tt-i
JDIt-i
YazdPt-i●∆●∆●∆●∆●∆ ●∆
Tt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
JDIt-i●∆●∆●∆●∆●∆●∆●∆●∆
(a) P = precipitation; (b) T = temperature.
Table 4. Chosen inputs by gamma test (●) and entropy theory (∆) for multivariate standardized precipitation index (MSPI).
Table 4. Chosen inputs by gamma test (●) and entropy theory (∆) for multivariate standardized precipitation index (MSPI).
StationVariableTime Lags (i)
123456789101112
AnarP(a)t-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆ ●∆
T(b)t-i ●∆●∆●∆●∆●∆
MSPIt-i●∆●∆●∆●∆●∆●∆●∆
BiarjmandPt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i ●∆●∆●∆●∆●∆●∆●∆
MSPIt-i●∆●∆●∆●∆●∆●∆
BoshrouyehPt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i ●∆ ●∆
MSPIt-i●∆●∆●∆
East of IsfahanPt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i ●∆●∆●∆●∆●∆
MSPIt-i●∆●∆●∆●∆●∆●∆●∆●∆
IsfahanPt-i●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i●∆●∆
MSPIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
KabootarabadPt-i●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i●∆●∆●∆●∆
MSPIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
KashanPt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i
MSPIt-i
MarvastPt-i●∆●∆●∆●∆●∆●∆●∆ ●∆
Tt-i ●∆
MSPIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
NaeinPt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
Tt-i
MSPIt-i●∆●∆●∆●∆
YazdPt-i●∆●∆●∆●∆●∆●∆●∆●∆ ●∆●∆
Tt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
MSPIt-i●∆●∆●∆●∆●∆●∆●∆●∆●∆●∆
(a) P = precipitation; (b) T = temperature.
Table 5. Evaluation criteria for the test period of the models in predicting JDI.
Table 5. Evaluation criteria for the test period of the models in predicting JDI.
StationInput
Definer
ANFISANFIS-DEANFIS-GAANFIS-PSOGMDHGRNNLSSVM
MAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWI
AnarGamma0.5620.7600.7600.5250.6540.7780.5430.6760.7620.5760.7430.7760.4150.5340.8550.5590.6900.6780.4610.6050.803
Entropy0.5760.9210.6780.5120.6640.7760.5130.6670.7780.5030.6960.7840.3980.5230.8620.5440.6750.6830.4680.6180.797
BiarjmandGamma0.6730.9000.6810.5910.7410.7250.5740.7350.7270.6100.7980.7380.5430.6930.7930.7100.8500.5480.6150.7490.697
Entropy0.6820.8670.7310.5820.7650.7280.5750.7460.7300.6240.8250.7120.5310.6770.7900.7500.8900.4970.6070.7430.713
BoshrouyehGamma0.6751.1660.5010.6520.8020.5020.6930.8180.5010.7290.8710.4500.6070.7220.5050.6740.8180.4340.7210.8500.520
Entropy0.7841.1490.4840.4710.5930.7310.5020.6420.7190.5840.7490.6870.3380.4560.8640.6610.7770.5640.4910.6440.739
East IsfahanGamma0.6261.2070.6740.5170.7230.6990.5240.7370.6940.5940.8260.6440.4660.6530.7800.6490.8510.5360.5310.7310.670
Entropy0.6961.1980.5670.5590.7580.6850.5550.7470.6960.6090.8150.6640.4530.6450.7920.6610.8240.5530.5410.7340.684
IsfahanGamma0.7730.9240.7700.5730.7710.7880.5660.7680.7840.6090.8230.7640.5340.7210.8030.7060.9040.6300.5720.7700.777
Entropy0.8231.0720.6960.5700.7770.7860.5880.7990.7820.6290.8390.7560.5340.7340.7940.7070.9010.6180.5740.7740.772
KabootarabadGamma0.6480.8870.6380.5990.7700.6110.5900.7640.6140.6010.8080.6220.4910.6560.7390.6300.7460.6110.5730.7400.620
Entropy0.6980.8540.7270.5250.6840.7120.5510.7240.7040.5270.7060.7550.3910.5550.8230.5830.7330.5990.5110.6690.703
KashanGamma0.6040.8290.6500.4890.6580.6980.4860.6560.7170.5210.7240.6640.4400.6190.7610.6340.7860.2360.4740.6430.716
Entropy0.6520.8400.6520.4910.6760.7010.4950.6960.6900.5740.7920.6380.4550.6150.7450.5850.7200.5430.4900.6560.691
MarvastGamma0.7941.1340.7020.6480.8140.7850.6110.7660.8050.7550.9570.7120.4610.5760.8840.8451.0210.5980.6610.8120.711
Entropy0.8721.0920.6990.6280.7680.7820.6230.7800.7830.6550.8560.7490.4210.5420.8990.8491.0030.6150.6820.8430.689
NaeinGamma0.7640.9850.6690.6940.8740.5330.6890.8580.5390.6970.8900.5990.5570.7110.7320.7480.9070.4690.6550.8310.620
Entropy0.8770.9580.6150.5730.7370.7340.6070.8030.6790.7871.1210.5070.4440.5820.8370.7460.8810.3680.5580.7230.706
YazdGamma0.7480.8150.7210.5480.6890.6090.4760.6150.7350.4450.5950.8080.4610.6110.7700.5220.6570.6930.4510.6010.777
Entropy0.8510.7240.7340.5070.6510.7130.4780.6270.7540.4750.6280.7600.3990.5510.8350.5280.6690.6870.4310.5780.798
The bold numbers are the best performance of each station, on its own row.
Table 6. Evaluation criteria for the test period of the models in predicting MSPI.
Table 6. Evaluation criteria for the test period of the models in predicting MSPI.
StationInput
Definer
ANFISANFIS-DEANFIS-GAANFIS-PSOGMDHGRNNLSSVM
MAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWIMAERMSEWI
AnarGamma0.5820.7420.8860.4660.6270.8940.4870.6770.8690.5300.7680.8480.3810.4980.9430.6710.9100.6740.5740.8340.768
Entropy0.5960.7930.8330.4170.5620.9190.4340.5900.9100.5620.8530.7870.2710.4090.9640.6680.8840.7120.5100.7670.816
BiarjmandGamma0.6840.8820.8290.4670.6390.8790.4740.6560.8700.5840.8180.8200.4200.5640.9170.6990.8990.6360.4810.6390.873
Entropy0.6940.8240.8590.4870.6560.8760.5010.6550.8760.5700.7370.8430.3980.5680.9110.7310.9420.5310.4870.6480.872
BoshrouyehGamma1.2351.0860.7720.3950.5350.9100.3860.5280.9120.4590.6490.8690.3250.4760.9380.7721.0680.4650.4360.6000.873
Entropy1.3571.0800.7790.4480.6090.8750.4460.6060.8760.5350.8210.8070.3970.5710.8970.7521.0420.4710.5370.7340.799
East IsfahanGamma0.5860.9050.7810.3790.5520.8980.3770.5490.9000.4160.6150.8750.3220.4980.9160.5260.7010.7740.3650.5300.901
Entropy0.6540.8810.7700.3760.5630.8940.3650.5390.9020.4530.6480.8690.3270.4990.9100.5320.6740.8230.3720.5410.899
IsfahanGamma0.6370.6860.8790.3540.5050.9290.3520.5070.9290.3840.5440.9190.3330.4920.9320.5500.7110.8290.3570.5070.928
Entropy0.6830.7630.8640.3520.5080.9280.3570.5110.9280.3830.5510.9150.3300.4920.9350.5600.7230.8210.3640.5140.924
KabootarabadGamma0.6230.7760.8210.4270.5750.8630.4570.6390.8300.5290.7900.7260.3470.4740.9080.5020.7040.7360.4290.5960.841
Entropy0.6660.6650.8590.4210.5730.8620.4470.5770.8660.4240.5520.8880.3350.4540.9190.4860.6770.7610.4300.5910.843
KashanGamma0.7070.6130.7990.4340.5860.7420.4320.6000.7370.4560.6080.7580.4790.6200.7070.4520.6030.7470.4250.5740.788
Entropy0.5880.6880.7580.3470.5020.8420.3420.5010.8440.3800.5370.8260.3260.4640.8660.4250.5510.7820.3330.4940.844
MarvastGamma0.5900.8820.7830.3820.4900.9160.3680.4630.9240.4280.6340.8770.2960.3890.9510.8020.9530.6330.3950.4920.910
Entropy0.6400.8190.7960.3820.4770.9200.3820.4770.9200.4160.5470.9080.2800.3740.9550.7410.9060.6800.3790.4790.913
NaeinGamma0.6430.8880.8050.5900.7700.7700.5760.7650.7750.6340.8270.7610.4560.6370.8550.6790.8380.6680.6130.8020.743
Entropy0.7240.8920.7670.3620.5280.9100.3630.5180.9150.5210.7730.8290.2900.4580.9380.6920.8470.6510.3820.5500.897
YazdGamma0.5920.6660.8050.3060.4620.8970.3060.4620.8970.3000.5020.8810.2530.3890.9290.4650.6030.7640.3060.4580.896
Entropy0.6330.6270.8440.3070.4630.8970.3050.4610.9000.2930.4600.9010.2570.4190.9170.4700.6100.7640.3050.4580.897
The bold numbers are the best performance of each station, on its own row.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Aghelpour, P.; Mohammadi, B.; Biazar, S.M.; Kisi, O.; Sourmirinezhad, Z. A Theoretical Approach for Forecasting Different Types of Drought Simultaneously, Using Entropy Theory and Machine-Learning Methods. ISPRS Int. J. Geo-Inf. 2020, 9, 701. https://doi.org/10.3390/ijgi9120701

AMA Style

Aghelpour P, Mohammadi B, Biazar SM, Kisi O, Sourmirinezhad Z. A Theoretical Approach for Forecasting Different Types of Drought Simultaneously, Using Entropy Theory and Machine-Learning Methods. ISPRS International Journal of Geo-Information. 2020; 9(12):701. https://doi.org/10.3390/ijgi9120701

Chicago/Turabian Style

Aghelpour, Pouya, Babak Mohammadi, Seyed Mostafa Biazar, Ozgur Kisi, and Zohreh Sourmirinezhad. 2020. "A Theoretical Approach for Forecasting Different Types of Drought Simultaneously, Using Entropy Theory and Machine-Learning Methods" ISPRS International Journal of Geo-Information 9, no. 12: 701. https://doi.org/10.3390/ijgi9120701

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop