1. Introduction
With the rapid development of sensor techniques and signal processing, a variety of sensors are arranged in a complicated system to monitor its operational states. Each sensor can obtain a set of measured values and each reflects one side of the running system, for example, temperature, humidity, pressure, etc. However, those parametric values often have close or relaxed relationships among each other. Moreover, they are affected by noise or interference and it is difficult to judge the operating states directly by those simultaneously measured signals. At the same time, the engineers and researchers are no longer satisfied with real-time monitoring of the running states of a complex system. They want to predict the future operational trend according to the current and previous states. Although each sensor records a real signal independently, for simplicity and practicality, it is better to synthesize a comprehensive one called indicator function taking all measured signals into account. Prior to synthesizing the indicator function, random factors and interference in the measured signals must be eliminated effectively. Based on the synthesized indicator function, the future states of the complex system can be forecast reliably.
To forecast the future operational behaviors or states, the autoregressive moving average (ARMA) model can be directly used [
1]. ARMA is a high-precision short-term prediction method for time series analysis and offers a simple description for correlated linear, random processes [
2,
3]. For a linear time-invariant system, the observed data can be expressed by time-series containing historical observations and measurement noise [
4]. When using ARMA to fit or predict a signal, an important issue is to estimate the order number of ARMA [
5]. ARMA can be considered as the combination of an autoregressive (AR) and a moving averages (MA) model [
3,
6]. It is more popular than them because it has their advantages simultaneously, although it is more complex than them [
7]. In time series prediction, ARMA provides an more effective linear model with the least parameters [
8,
9,
10]. Hence, it is adopted in this paper for the future trend forecast, using the indicator function.
Prior to synthesizing the indicator function, the random noise in the recorded signals should be eliminated [
11,
12]. Many signal processing techniques have been used, including fast Fourier transform (FFT)-based methods, wavelet transform (WT), empirical mode decomposition (EMD), empirical wavelet transform (EWT) and some improved methods [
13,
14]. FFT is the most utilized method in extracting features from analyzing stationary signals [
13]. However, it is not suitable for nonlinear and non-stationary signals [
14]. To overcome the shortcomings of FFT, wavelet transform has been proposed. However, it requires careful selections of the mother wavelet and decomposition levels, in order to obtain a good time-frequency resolution [
15,
16]. In addition, synchrosqueezed wavelet transform (SWT) can also be used to denoise the acquired dynamic signals reliably [
16]. Recently, a methodology based on the SWT, Hilbert transform and Kalman Filter is proposed for parameter identification [
15]. Empirical mode decomposition (EMD) is an adaptive data processing method used to extract the mode information of nonlinear and non-stationary time series [
17,
18,
19,
20,
21]. The major drawback is the lack of explicit mathematical theory and the existence of mode mixing effect [
15,
16,
19]. In particular, EMD may produce too many modes that are difficult to be interpreted [
22].
Empirical wavelet transform (EWT) is proposed recently. Compared with EMD, it has a more consistent decomposition and a rigorous theoretical foundations [
22,
23,
24]. It combines the advantages of Fourier analysis and wavelet theory to extract the different modes in a complicated signal [
23,
25,
26]. Yuan et al. applied EWT in modal identification, by combining with the second-order blind identification (SOBI), to improve the identification performance of modal frequencies [
22]. Amezquita-Sanchez et al. proposed a new multiple signal classification-empirical wavelet transform (MUSIC-EWT) methodology [
27]. It can obtain the individual mono-components according to the actual frequency information [
27,
28]. In addition, Amezquita-Sanchez et al. used this method to estimate the natural frequencies (NF) and damping ratios (DR) of large structures [
28]. Dong et al. also proposed a modified EWT method based on local window maxima (LWM) [
29]. It can obtain the meaningful modes by searching the local maxima of the Fourier spectrum in a proper window and determining the boundaries of spectrum segmentations automatically. In this paper, the modified EWT is used to extract the significant modes in the measured signals from a complex system, in order to synthesize the indicator function for future trend forecasts.
The following sections are organized as follows: the main idea and principle of the proposed method is presented and discussed in detail in
Section 2. The numerical experiments are performed and their results are discussed briefly in
Section 3. Finally, conclusions are drawn in
Section 4.
2. The Trend Forecast Method
In engineering and technical fields, there is a special class of signals that are captured from the same complicated system and have close or relaxed relationships among them. With the system running, each signal only reflects one side of the operational states—for example, temperature, humidity, pressure, etc. If we extract the significant modes of all measured signals, we can synthesize an indicator function, which can be applied to forecast the future operational states of the whole system. Prior to getting the reliable significant mode of each measured signal, the random noise or interference must be carefully reduced, i.e., the extracted significant modes are reliable. In order to achieve the above tasks, we propose a novel method that is based on EWT (empirical wavelet transform) and ARMA (autoregressive and moving average model). The block diagram of the approach is shown in
Figure 1.
Suppose
are
K signals captured from a complex engineering system. Each of them is composed of a set of frequency components. The
ith signal
not only includes its significant mode, but also contains other frequency components, naturally including various noise. To extract the significant mode in
, empirical wavelet transform (EWT) is adopted and performed on
, due to its ability of anti-interference and computation effectiveness. Then, the extracted significant mode
from
is normalized into the interval
, to eliminate the impacts resulted from the numerical ranges of the measured signals. The normalized result of
can be denoted as
, for
. After that, all significant modes
are weighted and summed as the indicator function
l to reflect the comprehensive operational states of the monitored complex system. Finally, ARMA provides an effective linear model by the least number of coefficients and is performed on the slowly changing
l, to reliably predict the future operational states of the complex system. The following subsections will discuss the main ideas above reflected in
Figure 1 in detail.
2.1. Extraction of Significant Modes
In general, the captured signals from a complicated system are inevitably affected by random factors and various noise. If a measured signal has multiple disjoint narrow-band components and wide-band noise, the narrow-band component with maximum relative energy can be considered as the significant mode of the original signal. The significant modes can be effectively extracted from all original signals by empirical wavelet transform (EWT) under the conditions of random interference. EWT essentially designs a set of suitable wavelet filters to get several different bands of a signal (each band corresponds to one mode). In particular, the mode is selected as the significant one due to its outstanding energy. At the same time, the other ones are regarded as the interference components and discarded naturally. In EWT, spectrum segmentation is the most important step to obtain different modes [
25,
26]. It depends on the reliable detection of the local spectrum peaks of the original signal. In classical EWT, the intermediate frequency value between two consecutive spectrum peaks can be seen as their boundary [
20,
26]. Suppose the spectrum interval
of each digital signal is divided into
N segments and their boundaries are denoted by
(
and
). Hence, the empirical wavelet can be constructed by empirical scale function and empirical wavelet function, which are expressed by
and
where
and
are defined by
and
Suppose the
ith signal
is processed by EWT, the approximate coefficients can be obtained by the inner product of the signal and empirical scale function:
where
represents the approximate coefficient,
represents empirical scale function, and
represents the object signal.
represents the inner product of the object signal and the empirical scale function, and
and
represent the Fourier transform results, respectively. The symbol
represents the inverse Fourier transform. Similarly, the detail coefficients of EWT are given by the inner products with the empirical wavelets:
where
represents the
n-th detail factor, and
represents the
n-th empirical wavelet function.
represents the inner product of the object signal and the empirical wavelet function,
and
represent the Fourier transform results, respectively.
The
i-th signal
can be reconstructed by
where
. Therefore, the
n-th empirical modes of each signal can be given by
where
denotes
i-th measured signal and
denotes
n-th empirical mode.
In empirical wavelet transform, FFT is applied to calculate the spectrum of the object signal and then the spectrum peaks are employed to determine the boundaries of the different modes. For a noise-contaminated signal, it may result in incorrect boundaries because the spectrum peaks are sensitive to noise and interference. To improve reliability of spectrum segmentation, the modified EWT, which is based on local window maxima (LWM) [
29], is adopted to extract the significant mode of the measured signals in this paper. It can reliably detect the local spectrum peaks at the cost of computation. The main idea of the method is to find all local maximum values of the spectrum and determine the global maximum value as the first peak. Then, all spectrum values around the global maximum value are set to zero. The other spectrum peaks can be found successively in that way until the number of spectrum peaks meets the predetermined requirement. The modified EWT (LWM-EWT) are not sensitive to noise and can void incorrect spectrum segmentation.
The anti-interference ability of the modified EWT (LWM-EWT) can be verified by the following example. The simulated signal is expressed as:
where
is the
k-th value in the amplitude vector
.
is the
k-th value of the vector
, to tune the
k-th harmonic frequency.
Hz,
,
. The sampling frequency
Hz and sampling time
s. The spectrum peaks obtained by the classical EWT and LWM-EWT [
29] are shown in
Figure 2.
From
Figure 2, LWM-EWT can effectively detect the five significant components contained in the simulated signal and can avoid the excessive spectrum segmentation. Thus, LWM-EWT is adopted to extract the significant modes from the multiple-component signals captured in a complex system in this paper, in order to to obtain more reliable results.
Supposing that
N modes are obtained by EWT performing onto the
i-th signal
(
), they can be denoted as
, for
. The relative energy values of
N modes are calculated by the following equation:
In general, the mode corresponding to the maximum value of relative energy can be considered as the significant one. For a real signal, if five modes are obtained by EWT, their relative energy values are denoted as a vector , then the second mode is considered as the significant one, due to its maximum value.
Suppose
K signals are measured from a complex system, for the
i-th signal
, the significant mode
can be extracted by EWT from
, according to the relative energy equation expressed in Equation (
11), for
. Therefore,
K significant modes can be obtained and denoted as
. The major process of significant mode extraction can be expressed by the following algorithm:
Load all K original signals () measured from a complex system.
For the i-th signal , perform empirical wavelet transform and determine its significant mode
- a.
Set the maximum mode number and window length , according to prior knowledge.
- b.
Calculate the magnitude spectrum of the i-th signal by fast Fourier transform (FFT).
- c.
Use the LWM algorithm to find spectrum peaks, using mask window with length .
- d.
Calculate boundary values for dividing the spectrum, according to peaks.
- e.
Construct empirical wavelets with boundaries, including and .
- f.
Perform the constructed wavelet on and obtain modes, denoted as .
- g.
For modes, calculate the relative energy values, denoted as .
- h.
Select the mode with largest relative energy as the significant one , corresponding to .
Execute the above operations until all K significant modes have been output.
2.2. Synthesize Indicator Function
For a complicated system, suppose
K signals
from different sensors (temperature, humidity, pressure, etc.) can be recorded simultaneously. Each measured signal only reflects one side of operational states. To reveal overall operating state, all measured signals must be considered. For simplicity, the significant modes are taken instead of the original signals. To minimize the impacts resulted from the numerical ranges, the values of the significant modes
must be normalized into [0,1]. The normalization process can be represented by
where
. The values of
and
denote the maximum and minimum of the significant mode
, respectively.
denotes the normalized result of
.
Due to the complex correlating or coupling relationships among the measured signals
, or their significant modes
, we prefer to look for a comprehensive indicator to reveal the operational state of the complex system rather than consider multiple signals simultaneously. For simplicity, the normalized significant modes
are is weighted and summed to make up an indicator function.
where
represents the
i-th weight coefficient corresponding to the normalized significant modes
, for
. In particular, they meet the condition of
.
In general, there are two ways to obtain the weight coefficients
in Equation (
13): one is to derive the weight coefficients according to the accuracy physical model of the complex system. The other is to use the data-driven method to setup the related empirical formula. However, it is very difficult to determine the weight coefficients by these two methods in practice, due to the fact that there are no explicit mathematical models or available empirical formulas to express complicated relationships among the measured signals. For a specific problem, the field engineers may adjust the weight coefficients based on their prior knowledge. Under some unknown conditions, the weight coefficients can be set to the same, i.e.
. It will lead to it being impossible to accurately describe the relationships among the measurement signals, and then lead to inaccuracies of the indication function and predicted results. If the importance of each signals are known, the weight coefficients must be adjusted correspondingly.
2.3. Forecast Future Trend
On the basis of the indicator function
l obtained by (
13), we can accurately forecast the trend which reflects the future operational states of the complex system. The autoregressive moving average (ARMA) model is applied to implement this task in this paper. In the theory of ARMA, a measured signal is considered to be a set of random variables that depend on time
t. Although the individual value which makes up the signal is uncertain, the changes of the entire signal follow a certain rule that can be approximately described by a mathematic model [
3,
30]. The core idea of future trend forecasts is to use the extrapolation mechanism constructed by ARMA to obtain the better prediction result. The process of
can be represented by
where
denotes the indicator function
l depending on time
t.
denotes the autoregressive (AR) coefficients at lags
i.
and
denote the residual or error terms.
denotes the the moving average (MA) coefficients at lags
j.
p and
q denote the number of AR and MA coefficients, respectively.
From (
14), the model of
is a memory system that includes the past states and various noise. That is, the sequential value at the certain moment can be represented by a linear combination of
p historical observations and
q moving average values of a white noise sequence [
4,
5]. The AR coefficient
determines the effect of the historical observations, while the MA coefficient
determines the effect of random factors [
3,
30]. One step ahead prediction can be represented as
Similarly,
steps ahead forecast is expressed as
Before using the
model, we need to test the stationarity of the indicator function, using the Augmented Dickey–Fuller (ADF) criterion. If it is not stationary, the differential transformation is performed until the transformed result is stationary [
1]. Then, we determine the orders of the ARMA model for the minimization of the selected criterion function. The criterion functions include the Final Prediction Error (FPE), Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), etc.
Another important step is parameter estimation of the ARMA model. In general, the maximum likelihood estimation and the least square estimation can be used to estimate the coefficients of the ARMA model.
In this paper, we determine the parameter ranges of
using the autocorrelation function and the partial autocorrelation function, which are performed onto the indicator function. Then, we test the
model in the valid ranges of
, from low orders to high ones successively. The appropriate model can be determined finally according to AIC criterion proposed by Akaike [
31]. In this way, the forecast values can be calculated by (
15) and (
16), using the indicator function
l and the selected
model. The final results can be considered as the predicted results, which reflect the future operational states of the monitored complex system. In general, the predicted results are evaluated by the percentage of relative errors:
where
is the true state value of the complex system.
4. Conclusions
For any complex system in engineering and technical fields, operational state prediction is a very important technique to guarantee safe operation. On the basis of empirical wavelet transform and the autoregressive moving average model, an effective forecast method is proposed and discussed in this paper. For the multiple signals measured from a complex system, their significant modes are extracted reliably, through taking advantage of empirical wavelet transform and relative energy relationships. Those significant modes are very consistent with the original signal, but they have smoother waveforms or higher signal-to-noise-ratio (SNR). To suppress the negative impacts resulting from numerical ranges, those significant modes are normalized, weighted and summed as an indicator function, which reflects the comprehensive operational state over time. According to the simple indicator function, the future running trend of the complex system is reliably predicted by autoregressive moving average technique. The effectiveness and practicability of the presented method have been verified by a set of experiments whose multiple channel signals were recorded from actual complex systems. The experimental results show that the proposed method has been applied successfully in engineering and financial fields. The proposed approach can also be easily extended to science, technical, social and other fields.
To obtain better forecast results of complex systems, the future work related this paper may focus on: (1) improving the robustness of significant component extraction, and (2) enhancing the accuracy of predict methods.