1. Introduction
In the wake of what has happened in the past two years, spring and summer of 2024 were again warm and dry in many European regions [
1]. Global warming and increased rainfall extremes are two of the main hallmarks of climate change. At the same time, future projections indicate that precipitation will likely decrease in many areas, enhancing the risk of drought [
2].
Drought exhibits a high variability at both temporal and spatial scales; therefore, there is a great uncertainty in capturing trends compared to other natural hazards. Results of recent studies are quite contradictory at the global scale, while at the pan-European scale, a slightly increasing tendency has been identified in the last decades [
3]. At the continental scale, trend studies have shown a decreasing trend in Northern Europe and an increasing trend in Southern Europe. In particular, in the Mediterranean coastal area, the synergy between reduced precipitation and increased evapotranspiration fostered by higher temperature will affect the hydrological balance, leading to decline in water availability [
2]. Droughts are projected to become more severe, more frequent and longer under moderate emission scenarios and strongly enhanced under severe emission scenarios ([
2] and references therein). Therefore, as spatial scale is one of the main sources of uncertainty, the reconstruction of climate evolution at high spatial resolution is crucial for drought variability assessment and consequently for risk mitigation, preparedness and response [
2,
4].
In recent research, droughts are usually subdivided into four types according to their duration and impact on different systems: meteorological, agricultural, hydrological and socio-economic drought [
5]. This study deals mainly with hydrological type because only meteorological variables are used as input (precipitation and temperature), and data analysis is focused on a 12-month accumulation period, a compromise independent of individual seasons and regimes, which allows for the representation of water shortages caused by a lack of precipitation and/or high temperatures over an entire year. Besides the spatial scale, the selected time period is another important variable, together with the drought index formulation.
In the past, several indices were proposed to objectively quantify and compare droughts in different climatic and hydrological regimes. A detailed review can be found in Mishra and Singh [
6]. Among the existing indices, the standard precipitation index (SPI) [
7] and the standardized precipitation evapotranspiration index (SPEI) [
8] are the most widely used, in particular in the case of datasets from the early instrumental period when precipitation and temperature were the only variables available. While SPI considers precipitation only, SPEI is based on water balance (BAL), i.e., the difference between precipitation and potential evapotranspiration (PET). Precipitation is certainly the main driver of drought, but the role of temperature cannot be neglected, in particular in the Mediterranean region, which can be considered a global warming hotspot [
9].
Both SPI and SPEI are probability-based indices that consider the distribution of precipitation and water balance, respectively. The choice of the most appropriate probability distribution function for precipitation and water balance is crucial, as an improper distribution may impart bias on the index values, exaggerating or minimizing the severity of drought events. Several probability distribution functions (PDFs) have been tested in the literature, but a general consensus has not been reached, as the best-performing distribution depends on location, spatial scale and accumulation period. When working in an area not previously explored or with a new dataset, extensive and rigorous statistical testing is recommended.
Stagge et al. [
10] compared a suite of candidate PDFs and used a new method based on the Shapiro–Wilk (SW) test to evaluate the goodness of fit. Relative distribution ranking was determined by the Akaike information criterion (AIC). For regional studies in Europe, they recommended the two-parameter Gamma distribution when calculating SPI, in agreement with previous studies [
7,
11]. Instead, when computing SPEI, the use of the generalized extreme value (GEV) was a better alternative specifically for the European domain than the three-parameter log-logistic (Log-log3) distribution mostly adopted in the literature [
8,
12]. Angelidis et al. [
13] analyzed a case study in Portugal and found different PDFs depending on the accumulation period, i.e., gamma for time scales up to 6 months and log-normal for 12 months. According to Pieper et al. [
14], the exponentiated Weibull distribution for SPI performs proficiently in simulations and observations for every common accumulation period and virtually everywhere around the globe. The AIC was used to penalize unnecessary complexity of the candidate PDFs. Concerning SPEI, Monish et al. [
15] found that Pearson type 3 (PE3) outperforms the other distributions, especially for shorter timescales, while GEV performs well for longer timescales for most parts of India. The findings of Wang et al. [
16] confirm the choice of the Log-log3 distribution for around 500 stations in China. Ramezani et al. [
17] examined 65 PDFs for the estimation of SPEI in eastern Iran, and the results of Anderson–Darling (AD), Kolmogorov–Smirnov (KS) and chi-square tests indicated the four-parameter Burr distribution as better performing than the Log-log3 distribution. The generalized logistic distribution (Genlog) was the most appropriate for a case study in Ethiopia that included 125 stations and a 30-year dataset [
18].
Passing to the national scale, in Italy, recent studies have focused on the analysis of drought frequency and intensity variations without addressing the issue of the distribution function of drought indices and instead simply adopting the standard approach (SA) in the literature, i.e., gamma and Log-log3 distributions for SPI and SPEI, respectively. Moreover, these studies deal mainly with drought characterization for central and southern regions [
19,
20,
21] and less so for northern ones [
22,
23]. Moccia et al. [
24] tested different PDFs on a 50-year dataset containing records from all over Italy. The log-normal distribution was the best-fitting model to describe almost all the monthly precipitation samples, followed by Weibull (for 1-month scale) and gamma. The fitting performance was evaluated with a modified mean square error normalized (MSEN) and the normality with the SW test.
Concerning drought trends, the two studies focused on northern Italy [
22,
23] are in accordance with the Europe-wide analysis, which shows increasing drought frequencies in Southern Europe [
3]. Nevertheless, drought characteristics, mainly intensity, strongly depend on the geographic area, as well as the spanned period and temporal resolution. Both studies cover the period from around the mid-20th century to the early decades of the 21st century and are based on both SPI and SPEI. Crespi et al. [
23] analyzed a portion of the Po Plain in the Lombardy region and found drying tendencies, in particular in summer, in the southern and western parts of the domain. The more negative trends of SPEI than of SPI were ascribed to the increasing role of evapotranspiration over recent decades triggered by rising temperature. According to Baronetti et al. [
22], since 2001, drought episodes in the Po Valley have increased in terms of frequency and duration, and they seem to be mostly related to changes in the intra-annual precipitation distribution.
The present study fits into this context with the general aim to improve the understanding of drought variability in Padua, Italy, over three centuries. The Padua precipitation and temperature series are the longest ones available in Italy and are used for the first time for drought assessment.
The specific aims are as follows:
To assess the best-performing PDFs for use in the SPI and SPEI normalization of the multi-secular series of temperature and precipitation in Padua;
To investigate the impact of the best-fit approach (BFA) compared to the standard approach (SA) on identifying drought characteristics, i.e., duration, severity and intensity;
To analyze the trends in drought patterns, extreme events and periodicity.
The paper is organized as follows:
Section 2 is devoted to a description of the study site, the datasets and the methodology used for data analysis, divided into several steps, each with a specific purpose; results are discussed in
Section 3, following the general lines of the methodology. Finally, conclusions are drawn in
Section 4.
2. Materials and Methods
2.1. Study Site and Climatic Data
Padua is a city located on the eastern side of the Po Valley, about 40 km west of Venice and the Adriatic Sea (
Figure 1). The municipal territory covers 93 km
2 entirely, with a mean altitude of 12 m a.s.l.
The climate in Padua is classified as humid subtropical (Köppen climate classification Cfa) with cold winters and hot summers, frequently associated with air stagnation [
25].
Although Padua is an urban center, agriculture is intensively practiced in the suburbs that surround the historic center. In terms of production, a prevalence of arable farming is observed, and most of the orchards and vineyards have a family character. As in other parts of the Po Valley, agriculture in Padua is largely dependent on irrigation and, therefore, is strongly influenced by droughts.
The datasets used to assess drought variability in Padua consist of the unbroken series of daily temperature and precipitation, the longest in Italy, covering nearly 300 years, from 1725 to the present day. The original data were recovered, corrected and homogenized with the help of a careful interpretation of the metadata during a demanding study that started around 40 years ago and has been continued up to now. In particular, the bias due to changes in instruments, exposure location, height above the ground and observation protocols were corrected. Details can be found in the rich literary production [
26,
27,
28,
29,
30,
31,
32,
33] and references therein. The final composition of the temperature series was presented by Stefanini et al. [
33]. The comparison with independent datasets, such as data collected at other Italian meteorological stations (Milan, Bologna, Turin, from 100 to 300 km away from Padua) and a reanalysis dataset (ModE-RA), confirmed the general trend and anomalies, assessing the reliability of the Padua temperature series. The completion of the precipitation series involved further work, described here for the first time (
Section 3.1). The assessment of the accuracy of the precipitation series is much more difficult than that for temperature. Given the high variability in precipitation, only stations very close to Padua can be considered, and unfortunately, no station near Padua provides a dataset covering nearly three centuries.
2.2. Methodology for Calculation of Drought Indices
As monthly data are required for the calculation of SPI and SPEI, daily precipitation amounts were summed, and monthly and daily temperature values were averaged month by month. The potential evapotranspiration (PET) was calculated using Thornthwaite’s equation [
34]. It was the only possible solution considering the available data, since it requires only mean temperature values. Other methods require observations of other variables, such as daily minimum and maximum temperatures, vapor pressure and wind speed, which are not available for the early instrumental period.
The evaluation of the best-performing distribution function is the key point in the methodology followed to calculate the drought indices, as the subsequent climatic analysis is based on the index values. The overall methodology is visualized in
Figure 2 and includes the main steps described hereunder. The input datasets of the process are the monthly precipitation totals for SPI and the monthly water balance (BAL) for SPEI.
Application of a fitting distribution to the input dataset;
Calculation of the drought index;
If the index results are normally distributed, go to the next step; otherwise, try another distribution function;
Application of goodness-of-fit tests to the normally distributed index values.
SPI and SPEI are calculated at a monthly temporal resolution, and the accumulation periods considered are 3, 6 and 12 months. The same nomenclature is used as in previous studies, i.e., the index acronym is followed by a number indicating the accumulation period, e.g., SPI3 of a certain month is calculated cumulating the precipitation over the 3-month period ending in that month. Similarly, SPI12 of a certain month is calculated cumulating the precipitation over the 1-year period ending in that month.
The end of each run of the process is the dataset of the values of one index, calculated with the PDF that (i) provides a normally distributed index and (ii) exhibits the best-fitting parameters. The process was applied separately to each index (i.e., SPI and SPEI) for every accumulation period k, first considering the whole dataset (i.e., SPIk and SPEIk, where k = 3, 6, 12) and then separating each month (i.e., SPIj,k, SPEIj,k, where k = 3, 6, 12 and j = 1,…,12).
2.3. Goodness-of-Fit and Normality Tests
In addition to the PDFs recommended in previous SPI/SPEI studies (see
Section 1), new distributions were also considered, limiting the number of parameters to two for SPI and three for SPEI. Some functions were directly available in the R extraDist package [
35], while new scripts were created to calculate the others. Due to the length of the input series, the whole period from 1725 to 2023 was chosen as the reference period for both precipitation and temperature.
The distribution that fit the data best was assessed using the modified mean square error normalized function (MSEN) [
24,
36,
37,
38]. This method has the main advantage that it allows for the simultaneous estimation of the unknown parameters and the identification of the best-performing distribution between the candidates. The AIC was also applied to provide robustness to the results. Once the best PDF was identified for each input series, the SW normality test [
39] was applied to the calculated values of SPI and SPEI. The distributions that yielded values of SPI and SPEI meeting the requirements described by Wu et al. [
40] were considered non-normal and consequently rejected.
2.4. Standard and Best-Fit Approaches
At this point, it is possible to distinguish between two types of approach for calculating the drought indices, i.e., the standard approach (SA) and the best-fit approach (BFA). The SA foresees the use the same PDF for the whole input series, which is gamma for SPI and log-logistic 3 for SPEI. The BFA instead considers the best-fit distribution. This approach can be further split into two: one case considers the same function for all the months, called “general” in the text and indicated as BFAg; the other uses a different PDF for each month, named “monthly” and indicated as BFAm.
Drought and wet events are classified based on SPI/SPEI values [
7] as reported in
Table 1.
The impact of the different approaches on the main drought characteristics, i.e., duration, severity, intensity and frequency, was then evaluated. Drought events were identified following the definition of McKee et al. [
7], i.e., a drought starts in the month (included) when the index value falls below −1 and ends in the month (not included) when its value becomes positive for at least two consecutive months. Duration (D) refers to the period from the beginning to the end of a drought event and is expressed in months. Drought severity (S) is the sum of SPI/SPEI values within the period D, and intensity (I) is the average of SPI/SPEI values, i.e., the ratio of severity and duration. Finally, the number of drought events in a period is defined as the drought frequency (F).
2.5. Trend and Periodicity
One of the most commonly used trend tests is the Mann–Kendall (MK) test [
41,
42], the power of which depends on the preassigned significance level, magnitude of the trend, sample size and amount of variation within a time series [
43]. Moreover, its application is limited by a set of restrictive assumptions, e.g., independent structure of the time series, normal distribution and data length. Sen [
44] introduced a new simple and immediate method to investigate trends in time series, free from the abovementioned assumptions. This innovative trend analysis (ITA) has been already applied to drought trend assessment [
45,
46]. The procedure is quite simple: the time series is divided into two equal parts, and each one sorted in ascending order separately. Then, a scatter plot of the two parts is created, with the first half on the
X-axis and the second half on the
Y-axis. Data points plotted on the 1:1 (45°) ideal line indicate no trend in the time series. There is an increasing or decreasing trend if data are plotted above or below the ideal line, respectively. In addition, the distance from the 1:1 line indicates trend acceleration or strength. Both the MK and ITA tests were applied to assess the presence of trends in the Padua drought series, and the results were compared.
Before performing ITA, the presence of change points in the SPI and SPEI series was investigated using the standard normal homogeneity test (SNH) [
47], a test sensitive to the beginning and end of the series. The SNH test was successfully applied to the temperature and precipitation time series for Padua [
26,
32].
Finally, a continuous wavelet transform (CWT) analysis was performed to examine the periodicities in drought index time series [
48,
49]. The wavelet spectrum was evaluated using the R package biwavelet [
50], while the red-noise spectrum was analyzed using the R package dplR [
51].
2.6. Critical Drought Intensity–Duration–Frequency (IDF) Curves
Drought IDF curves are helpful in evaluating the relation between intensity and frequency for several drought durations, providing useful information about drought events using a single graph that is easily understandable by end users for practical purposes. Aksoy et al. [
52] developed a procedure for estimating IDF curves for critical drought and applied it to the SPI. The same method was recently applied to an SPI-based Italian case study by Moccia et al. [
24] and extended to the SPEI by Arra et al. [
53]. According to Aksoy et al. [
52], a drought of duration D is defined as critical for a certain year when it is the most severe drought in terms of the total deficit over that duration. It is likely that no drought was observed in some years; therefore, a zero value is taken for the critical drought severity in a no-drought year. Critical drought IDF curves were implemented following the procedure described in detail by Aksoy et al. [
52], which was developed through the following steps:
Identify the critical droughts of each duration D for each year;
Compose the time series of the intensities of the critical droughts for each duration D; if a drought of a certain duration was not critical for a given year, the intensity for that year is set equal to zero;
Identify the best-fit probability distribution for each time series using the MSEN method;
Calculate the intensity of critical droughts of a given duration for a fixed return period using the probability distribution identified in (3);
Perform a linear regression for each return period between the intensity and duration: , where D is the duration in months (it varies between 1 and the maximum observed), and and are the regression parameters.
Then, from the critical drought IDF curves, it was possible to find a relation between drought intensity and return period (RP) for each drought duration and thus calculate RP.
4. Conclusions
In this paper, the drought variability in Padua over nearly 300 years was investigated through the study of two well-established indices used to evaluate meteorological drought worldwide. SPI and SPEI have the advantages of being compatible with series starting in the early instrumental period, when only daily mean temperature and precipitation amount are the available variables. As a part of this study, the daily precipitation series of Padua from 1951 to 2023 was completed.
For calculating the drought indices, two alternative approaches to the standard one were considered, both based on the identification of the best-fitting distribution: one is “general”, i.e., it considers the same probability distribution function (PDF) for all the months (BFAg); the other uses a different PDF for each month (BFAm). Concerning SPI, the best PDF according to BFAg and SA was the same, i.e., gamma, for each accumulation period. For SPEI3 and SPEI6, BFAg indicated GEV, while SPEI12 indicated skew-normal instead of the standard log-logistic 3. When passing to BFAm, the results depended on month and accumulation period. In the case of SPI, in addition to gamma, log-normal and Weibull distributions appeared, respectively, for SPI3 and SPI12. Skew-normal was the best PDF for SPEI6 and SPEI12 with few exceptions, while the results for SPEI3 were heterogeneous.
Comparing the standard to the best-fit approaches, the mean bias error was significant only for SPEI. In particular, in case of SPEI3 and SPEI6, BFAg tended to estimate lower values than SA; the opposite was seen for BFAm, with the difference increasing with the accumulation period. For SPEI12, BFAg matched with BFAm. The results for the relative error in associating an index value with a particular class were the same for BFAg and BFAm. With respect to SA, the most significant errors were related to the extremes: the BFAs detected more drought events belonging to the extreme class than the SA. Coherently with the previous findings, the main statistics for SPI were similar, regardless of the approach type. In contrast, the duration and severity of droughts estimated using SPEI3 were higher for BFAm than for the other approaches. Moreover, the drought frequency according to SPEI3 and SPEI6 followed the scale of SA > BFAg > BFAm, while for SPEI12, values were similar.
The comparison between the two drought indices was performed considering both the best-fit approach applied month by month, i.e., BFAm, and the 3- and 12-month accumulation periods as representative of seasonal and annual time scales. The difference in SPI and SPEI time series showed an increase since the mid-20th century, in particular in spring and summer, and can be related to ongoing global warming. Moreover, the longest and most severe drought over the 1725–2023 period identified by the two indices was not the same: according to SPI12, it was from 1737 to 1742; according to SPEI12, it was from 2019 to 2023. This last event continued until the first months of 2024.
The innovative trend analysis indicated a general increase in drought trend, which according to SPI12, was significant only for severe events. Summer and fall were the most affected seasons.
Drought intensity–duration-frequency (IDF) curves provide a straightforward graphical representation of the relationship between intensity, duration and frequency that stakeholders, decision-makers and water resource managers can easily understand. In particular, for each duration, it was possible to calculate the return period of the critical droughts, i.e., the most severe ones. In general, the return period provided by SPEI12 was higher than the one calculated using SPI12 for the same duration. Moreover, SPI12 indicated 1922 as the year in which critical droughts of almost all durations occurred, while SPEI12 indicated the year 2022.
As a general remark, it is recognized that the use of the best-fitting distribution approach (BFA) may add a level of complexity and complicate comparisons across space and time. Nevertheless, in the case of a specific location for which a long, high-resolution meteorological series is available, BFA is preferred to the standard approach, in particular when investigating critical events. In addition, in the context of ongoing climate change, even if precipitation is the main driver of droughts, the role of temperature cannot be neglected, especially in the Mediterranean area. The results of the present study confirm that SPEI deserves adequate research attention as a reasonable and useful drought index.
The evaluation of a drought index for such a long series, which starts in the early instrumental period, i.e., at the beginning of the 18th century, has inherent limitations due to the nature of the input data, which may affect the results. First, the accuracy of the meteorological observations is a crucial point: the further back in time one goes, the more careful reconstruction work and critical analysis must be to compose a homogeneous series and correct biases. Second, the lack of observations for several meteorological variables, in particular in the early instrumental period, when generally only mean temperature and precipitation were recorded, limits the possibility of using methods to calculate PET and thus to estimate SPEI. Last but not least, a long series such as that for Padua is an exception, so it is not possible to find series from neighboring locations to use for comparison.
Finally, this study provides interesting insight into drought characterization in northern Italy over a long-term perspective. The availability of information on drought evolution over the centuries can support risk assessment and mitigation plans, helping to cope with one of the most damaging climate events of the last decades.