Next Article in Journal
Low-Profile Antenna System for Cognitive Radio in IoST CubeSat Applications
Next Article in Special Issue
Bayesian-Optimized Hybrid Kernel SVM for Rolling Bearing Fault Diagnosis
Previous Article in Journal
Understanding Seepage in Levees and Exploring the Applicability of Using an Optical-Fiber Distributed Temperature System and Smoothing Technique as a Monitoring Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Convolutional Neural Network-Based Transformer Fault Diagnosis Using Vibration Signals

1
School of Electrical Engineering, Beijing Jiaotong University, Beijing 100044, China
2
China Institute of Marine Technology and Economy, Beijing 100081, China
3
Beijing Rail Transit Electrical Engineering Technology Research Center, Beijing 100044, China
4
AAU Energy, Aalborg University, 9220 Aalborg, Denmark
*
Authors to whom correspondence should be addressed.
Submission received: 18 April 2023 / Revised: 10 May 2023 / Accepted: 14 May 2023 / Published: 16 May 2023

Abstract

:
Fast and accurate fault diagnosis is crucial to transformer safety and cost-effectiveness. Recently, vibration analysis for transformer fault diagnosis is attracting increasing attention due to its ease of implementation and low cost, while the complex operating environment and loads of transformers also pose challenges. This study proposed a novel deep-learning-enabled method for fault diagnosis of dry-type transformers using vibration signals. An experimental setup is designed to simulate different faults and collect the corresponding vibration signals. To find out the fault information hidden in the vibration signals, the continuous wavelet transform (CWT) is applied for feature extraction, which can convert vibration signals to red-green-blue (RGB) images with the time–frequency relationship. Then, an improved convolutional neural network (CNN) model is proposed to complete the image recognition task of transformer fault diagnosis. Finally, the proposed CNN model is trained and tested with the collected data, and its optimal structure and hyperparameters are determined. The results show that the proposed intelligent diagnosis method achieves an overall accuracy of 99.95%, which is superior to other compared machine learning methods.

1. Introduction

As one of the most important and expensive piece of equipment in a power system, the power transformer plays a vital role in power conversion and delivery [1]. Power transformers are generally designed to have a lifetime of 20 to 35 years, and can actually last up to 60 years with proper maintenance [2]. However, occasional in-service faults of a transformer can cause catastrophic consequences for the power system and even endanger personal safety; moreover, it is very costly to repair or replace transformers. With the increase in operation time, under the long-term influence of mechanical stress, thermal stress, etc., more and more transformers begin to deteriorate, which brings a great potential threat to the power system and puts forward higher requirements for fault diagnosis technology. In general, transformer faults can be classified as electrical, mechanical, and thermal; how to prevent these faults and ensure a healthy working condition of the transformer is a significant topic. Traditionally, scheduled maintenance makes its plans for inspection and testing based on experience, trying to find a balance between low-risk and low-cost, which can easily result in over-maintenance or under-maintenance. Alternatively, by monitoring the characteristic parameters of a transformer in real-time, condition-based maintenance (CBM) can detect the abnormal state of the equipment and make a diagnosis at the first time, which can minimize the damage to the equipment by failure [3]. Thus, transformer condition monitoring and fault diagnosis techniques have recently attracted extensive attention from researchers and engineers.
Generally, transformer fault diagnosis methods can be classified as offline and online according to the working state of the transformer. The offline methods, due to their simple principle and accurate results, are commonly used for annual maintenance and fault analysis. For instance, frequency response analysis (FRA) can determine the condition of the winding by measuring the impedance or admittance of the winding [4,5,6]. Short-circuit impedance (SCI) is available to evaluate the transformer operating condition [7]. Similarly, the winding resistance measurement is used to evaluate the contact condition of the winding conductors and the tap changer, and the winding ratio test can determine if there are shorted turns or open winding circuits. However, these methods require transformer shutdown during implementation.
By contrast, the online methods can be implemented while the transformer is in operation. Dissolved gas analysis (DGA) can be used to diagnose latent transformer faults by continuously detecting and analyzing the components of different gases dissolved in the insulating oil [8,9]. Similarly, insulating oil quality (IOQ) tests can be used to analyze the condition of the transformer-insulating oil [10]. However, the above approach is only applicable to oil-immersed transformers but not to dry-type transformers. Recently, with the rapid development of sensor technology and signal processing, some non-traditional diagnostic methods are rapidly evolving, such as partial discharge (PD) testing which is utilized to detect whether the partial discharge is occurring in the transformer [11,12]. Ultra-wideband (UWB) signals are used to diagnose mechanical faults in the transformer winding [13]. In addition, the thermal imaging monitoring can detect abnormal thermal faults in a transformer [14]. Nevertheless, some of these methods are expensive or not accurate enough.
Alternatively, vibration analysis provides a new online diagnosis method for transformers with easy and low-cost implementation, which has attracted increasing attention in the recent years. The authors of [15] proved that the vibration intensity of a transformer is related to its location and load current by investigating the distribution characteristics of vibration signals. Different short-circuited turn conditions of the transformer can be recognized by classifying the indicators extracted from vibration signals using support vector machines (SVM), as reported in [16]. Similarly, using the total harmonic distortion (THD) from vibration signals as a fault feature, ref. [17] effectively diagnosed the transformer short-circuit faults. Based on vibration and reactance information, the loose state and deformation of the transformer winding can be monitored, as reported in [18]. An effective feature extraction method from transformer vibration signals was introduced in [19], which decomposed the vibrations into multiple modes using variational mode decomposition (VMD); then, they extracted the feature vector from those modes by wavelet transform. However, most of the above methods require detailed parameters or information about the transformer, which are highly dependent on the expertise and limits their development.
Recent research has shown that fault diagnosis methods with deep learning (DL) can overcome the expertise dependence issue [20]; furthermore, they can also achieve higher accuracy [21]. Typically, there are three main types in DL, which are deep belief network (DBN), recurrent neural network (RNN), and CNN. Since the problem of gradient extinction has been solved and the performance of the graphics processing unit (GPU) has improved, DL has made remarkable progress, especially in the fields of speech recognition [22], image recognition [23], and automatic driving [24]. Meanwhile, some achievements have also been made in transformer fault diagnosis with DL. For instance, RNN was adopted in [3] to capture the hidden patterns of vibration time series directly, which can diagnose the abnormal excitation voltage and turn-to-turn short-circuit faults of the transformer. The authors of [25] recognized converted vibrating images using CNN to identify three working conditions of transformers. Similarly, a multi-scale fusion feature extraction model based on CNN with attention mechanism was designed in [26], which can recognize the operating conditions of the transformer with different voltages and loads. However, the types of faults they can identify are relatively limited; also, most of the current research has focused on oil-immersed transformers, while little research has been done on dry-type transformers. Therefore, it needs further research on how to quickly and effectively implement online multiple fault diagnosis for dry-type transformers.
The main contributions of this study are summarized in the following.
(1)
An intelligent fault diagnosis method for dry-type transformers using vibration signals is proposed, which can quickly identify different faults under various loads of the transformer with high accuracy.
(2)
A CWT method is adopted to convert the raw vibration signals of the transformer to RGB images, which could adequately extract fault features from the different conditions.
(3)
An improved CNN model is designed to accurately classify the RGB images for transformer fault diagnosis, and its optimal structure and parameters are determined.
The rest of this article is organized as follows. Section 2 introduces the theoretical background. Section 3 describes the experimental setup and data. Section 4 presents the proposed method in detail, including the feature extraction and proposed CNN structure. In Section 5, experimental and test results are presented to validate the performance of the proposed method. Finally, the conclusion is drawn in Section 6.

2. Theoretical Background

2.1. Mechanism of Transformer Vibration

The transformer vibrates all the time in service with or without load, and the vibrations are mainly caused by core vibration and winding vibration. Core vibrations are mainly generated by magnetostriction since the geometry of magnetic material changes slightly when it is in a magnetic field, and the vibration occurs when the strength of the magnetic field varies considerably [16]. The fundamental frequency of the core vibration is twice the source. It should be noted that the core vibration will also contain high-frequency harmonics because of the nonlinear property of magnetostriction. The amplitude of core vibrations is basically proportional to the voltage squared, which can be represented by
α core U 2 ,
where α core is the amplitude of core vibrations, U is the voltage.
The winding vibrations are mainly generated by electromagnetic forces due to the interaction between the current in winding and the leakage flux field. Those electromagnetic forces are proportional to the current squared [15]; since the current waveform is practically sinusoidal, the fundamental frequency of the winding vibration is 100 Hz (in the case of a 50 Hz grid). The amplitude of winding vibration is basically proportional to the current squared, which can be represented by
α winding I 2 ,
where α winding is the amplitude of winding vibrations, I is the current.
The vibration of a transformer is highly correlated with its condition [27]; therefore, the vibration is employed in transformer fault diagnosis as a fault feature in this study.

2.2. Wavelet Transform

Wavelet transform is a popular tool for extracting time–frequency information from time-domain signals [28]. It inherits and develops the localization idea of short-time Fourier transform (STFT), and overcomes its shortcomings of a non-changing window size with frequency [29]. The wavelet transform can provide a “time–frequency” window that changes with frequency. Then, the time subdivision at high frequency and frequency subdivision at low frequency can be realized. There are two main types of the wavelet transform, CWT [30] and discrete wavelet transform (DWT) [31]. The difference between them is that CWT operates on all possible combinations of shifting and compression, while the DWT only operates on a specific subset of shifting and compression.
CWT is defined by the wavelet coefficients which are produced by the convolution of the original signal x ( t ) with the mother wavelet function ψ ( t ) . Through the translation (shift in time) and dilation (compression in time) by the mother wavelet function ψ ( t ) , a multi-scale refinement of the original signal x ( t ) is gradually carried out. The transformation process can be described by
W C ( a , b ) = 1 | a | x ( t ) ψ * t b a d t ,
where W C is the wavelet coefficient, a is the scale of the mother wavelet, and b is the translation of the mother wavelet. DWT can transform the discrete input data sequence f = f n = f 0 , f 1 , , f N 1 to a vector matrix form as
α = W f ,
where α is composed of N wavelet coefficients, and W is an orthogonal matrix.
Wavelet decomposition is implemented through two filters: the low-pass filter (scaling filter) and the high-pass filter (wavelet filter) [32]. They share the same set of wavelet filter coefficients, but with alternating signs and in reversed order, which means they complement each other. After the signal down-sampling operation for each decomposition level, the signal reconstruction process is done by applying the inverse way to the decomposition process. Each reconstruction level is followed by a signal up-sampling operation, which is known as the Mallat algorithm, and the procedure is illustrated in Figure 1.

2.3. CNN

CNN is a typical deep learning algorithm, inspired by the concept of the visual nervous system [33], which can reduce image dimensionality and improve the efficiency and accuracy of image processing. It has made great achievements in computer vision [34], natural language processing [35], etc.
The typical CNN structure consists of three types of layers, which are the convolutional layer, pooling layer, and fully connected layer. The process of pooling operation is illustrated in Figure 2. According to task requirements, these layers are combined in different ways to form different CNN models, such as LeNet-5 [36], ResNet [37], EfficientNet [38], and 1-D CNN [39].

3. Experimental Setup and Data

3.1. Experimental Setup

The transformer under study is a customized 50 kVA dry-type transformer with two terminals A and B, which can easily simulate turn-to-turn short circuit faults. Its main parameters are shown in Table 1. The output terminal of the transformer was connected to an adjustable load cabinet, whose power ranges from 0 to 200 kW.
Two accelerometers with the sensitivity of 500 mV/g of type CA-YD-188T were used to collect vibration signals of the transformer. Then, the collected raw signals are processed by the SIRIUSm-4xACC data acquisition instrument with a sampling rate of 8000 Hz, and saved by the Devesoft X3 software. Considering the structural characteristics and insulation safety of the studied transformer, as shown in Figure 3, the above accelerometers were fixed in the vertical direction (CH1) and horizontal direction (CH2) of the core clamp, respectively. The whole experimental system is shown in Figure 4.
The loosening faults of the core, winding, and connection bar were simulated by adjusting the tightness of the clamp bolts from 50 to 80 Nm using a torque wrench, the turn-to-turn short circuit fault was simulated by connecting a resistor between terminals A and B. It is worth mentioning that all fault types have multiple load levels to represent changing loads.

3.2. Data Description and Preprocessing

As shown in Table 2, there are four different transformer faults, respectively, core clamp looseness (CC), winding clamp looseness (WC), connection bar looseness (CB), and turn-to-turn short circuit (TT), which were simulated in this study. Meanwhile, two different load levels are applied for each fault, along with the normal state (NO), and a total of 10 different working conditions are obtained.
In order to train the proposed diagnosis model, 400 segments of the vibration signal were collected for each working condition, which eventually constituted a total dataset of 4000 samples, of which 70% were selected as the training dataset, 20% as the validation dataset, and the remaining 10% as the test dataset. It should be noted that each sample can only be assigned to one dataset, which means that the samples of the testing dataset are completely different from the training dataset and validation dataset.
Figure 5 illustrates the converted RGB image of the normal state with load of 20 kW (NO20), and the remaining 9 cases are shown in Figure 6. It is obvious that the RGB pictures of different conditions have unique features in both the time domain and frequency domain, which demonstrates that the proposed feature extraction method works effectively.

4. Proposed Fault Diagnosis Method

The proposed transformer fault diagnosis method is presented in this section. After the vibration signals are acquired from the transformer, they are converted into RGB images by the CWT method described in Section 2.2. Then, the RGB images are classified by the proposed diagnosis model.

4.1. Feature Extraction

Vibration signals are collected by the high-frequency accelerometers. In order to fully collect transformer vibration characteristics, the sampling rate is usually around 10 kHz. The collected time-domain signals contain rich characteristic information; however, it can hardly be used directly for fault diagnosis. Therefore, a proper feature extraction method is essential.
For the purpose of extracting sufficient feature information from the original vibration signal, CWT is used to process the vibration signal in this study. The length of the selected raw signal segment is 1280 (i.e., 160 ms), and the cmor3-3 (Morlet wavelet) is employed as the mother wavelet with a total scale of 256. It is worth mentioning that the sampling rate is set to 8000 Hz since the vibration frequency of the transformer in this case is basically below 4000 Hz. As shown in Figure 7, the time-domain vibration signals is converted to RGB images after translation and dilation by the mother wavelet. Meanwhile, the images are labeled and proportionally divided into training, validation, and testing datasets.

4.2. Proposed CNN Structure

After converting the raw signals to RGB images, there are n classes of images corresponding to n transformer working conditions. The RGB image can be divided into 3 monochrome layers to meet the requirements of the input format. In order to improve the accuracy of image recognition, the input size of proposed model is set to 64 × 64 in this study.
Based on experience and comparison, the proposed CNN structure was finally determined as shown in Figure 8. There are two alternating convolutional and pooling layers in the proposed CNN structure. The size of the convolution kernels (filter) in the first and second convolutional layers is 6@5 × 5 and 16@5 × 5, respectively, which determines the number and dimensionality of the feature maps. The process of pooling operation can reduce the size of the image by selecting the dominant pixels on the feature map, and the kernel size of both pooling layers is 2 × 2. Meanwhile, to fully capture the features of the images and control the size of feature maps, in this study, the strides of convolutional kernels and pooling kernels are set to 1 and 2, respectively. In addition, three successive fully connected layers are designed to calculate the final feature information by converting the pooled feature maps to the 1-D vector. Eventually, the image classification is implemented by a softmax process.
Some other initial hyperparameters of the structure are set as follows: learning rate = 0.015, batch size = 12. The optimal combination of the above parameters will be discussed in Section 4. Finally, the flowchart of the proposed method is shown in Figure 9.

5. Experimental Verification and Discussion

In this section, an experimental setup was designed to simulate different faults, and the corresponding vibration signals were collected to train and test the proposed diagnosis model. Moreover, the performances of different parameters in the proposed model were compared to select the optimal combination. The CNN model is written in Python 3.7 with PyTorch and runs on windows 10 with two Nvidia RTX 2080Ti GPUs.

5.1. Comparison of Different Structures

The structure of the proposed model has a crucial impact on diagnosis accuracy. In order to find the best combination of structures, the performances of different structures were compared, and the results are shown in Table 3, where CNN-x-y-z means that there are x, y, and z neurons in the first, second, and third fully connected layer, respectively. For example, CNN-2704-126 means that there are 2704 neurons in the first layer, 126 neurons in the second layer, and there is no third layer in this structure.
Each model was run ten times, and the maximum, minimum, mean, and standard deviation (SD) of the testing accuracy were employed as criteria to evaluate the performance of diagnostic models. From the results shown in Table 3, it can be concluded that the model of CNN-2704-126-64 achieves the best performance on CH2. Its maximum, minimum, mean, and SD of testing accuracy are 100%, 97.5%, 98%, and 1.96%, respectively. All of those criteria are superior to the other structures compared. It should be noted that all six models performed better on CH2 than CH1, which indicates that the horizontal component of the transformer vibration signal contains richer fault characteristics than the vertical component in this study.
Figure 10 shows the training process of CNN-2704-126-64. It can be seen that when the epoch was around 70, the accuracy of the training dataset is close to 100%, and the training loss is minimized accordingly, which indicates that the structure has good fitting performance.

5.2. Comparison of Different Hyperparameters

The batch size (BS) is one of the most important hyperparameters in deep learning, which represents the number of samples picked for a training session. It affects the degree of model optimization as well as the speed of optimization by changing the GPU memory usage. In order to select the most suitable BS, the diagnosis performances of different BS are compared, which are shown in Figure 11. The results show that the model achieves the best performance when BS = 20; its maximum, minimum, mean, and SD of testing accuracy are 100%, 97%, 99.2%, and 0.95%, respectively.
The learning rate (LR) determines whether and when the objective function can converge to a local minimum. A suitable LR can make the objective function converge fast and efficiently. To this end, the diagnostic performances of different LR are compared, and the results are shown in Figure 12, from which it can be seen that the best performance with a mean accuracy of 99.95% is achieved when LR = 0.02. In addition, it has a low SD of 0.32%, which indicates that the proposed parameter combination has very stable performance.
Based on the above comparison and analysis, the hyperparameters of the proposed diagnosis model are finally determined as BS = 20 and LR = 0.02. The confusion matrix of diagnosis results is illustrated in Figure 13, where the columns represent prediction labels and the rows represent actual labels, and the intersection of them represents that the predicted conditions are consistent with the actual conditions. As shown in Figure 13, all the 400 testing samples, divided into 10 conditions, are matched with an accuracy rate of 100%, which demonstrates that the proposed method is quite effective in transformer fault diagnosis.

5.3. Verification of Superiority

To verify the superiority of the proposed diagnosis method in this study, the performances of different methods are compared, including ANN [40], DBN [41], 1D-CNN, Hilbert–Huang Transform (HHT)-CNN, short-time Fourier transform (STFT)-CNN, and CWT-CNN. It is worth mentioning that the vibration signals used in all methods are collected by CH2, and each method was run ten times. The results are shown in Table 4. It can be seen that the proposed CWT-CNN method achieves the best performance, and the maximum, minimum, mean, and SD of its prediction accuracy are 100%, 99.5%, 99.95%, and 0.32%, respectively. Compared with other methods, CWT-CNN can perform better feature extraction and identification from the raw vibration signal in this study.

6. Conclusions

This study proposed a deep learning-based fault diagnosis method for transformers, which converted vibration signals into RGB images to extract the corresponding fault features using CWT and then achieved fault diagnosis through an improved CNN model. In order to train and validate the proposed model, an experimental setup was designed to simulate transformer faults, including core clamp looseness, winding clamp looseness, connection bar looseness, and turn-to-turn short circuit. The optimal structural and hyperparameters of the proposed model were determined by comparing their diagnostic performances. Compared with other methods, the proposed diagnosis method can achieve the highest mean accuracy of 99.95% and the lowest SD of 0.32%. Moreover, due to the offline training strategy, the feature extraction and diagnosis process took less than 7 s, which can provide fast and accurate online fault diagnosis for the transformer. This study can expand the field of transformer fault diagnosis and offer technical support for condition-based maintenance of operating transformers.

Author Contributions

Conceptualization, C.L. and J.C.; methodology, C.L. and P.D.; software, C.L. and J.Y.; validation, C.L. and J.Y.; formal analysis, C.L.; investigation, C.L.; resources, C.L.; data curation, C.L. and C.Y.; writing—original draft preparation, C.L.; writing—review and editing, C.L. and P.D.; visualization, C.L.; supervision, J.C. and P.D.; project administration, C.L. and J.C.; funding acquisition, Z.L. and J.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Fundamental Research Funds for the Central Universities (2018JBZ004).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tightiz, L.; Nasab, M.A.; Yang, H.; Addeh, A. An intelligent system based on optimized ANFIS and association rules for power transformer fault diagnosis. ISA Trans. 2020, 103, 63–74. [Google Scholar] [CrossRef] [PubMed]
  2. Wang, M.; Vandermaar, A.J.; Srivastava, K.D. Review of condition assessment of power transformers in service. IEEE Electr. Insul. Mag. 2002, 18, 12–25. [Google Scholar] [CrossRef]
  3. Zollanvari, A.; Kunanbayev, K.; Akhavan Bitaghsir, S.; Bagheri, M. Transformer Fault Prognosis Using Deep Recurrent Neural Network over Vibration Signals. IEEE Trans. Instrum. Meas. 2020, 70, 1–11. [Google Scholar] [CrossRef]
  4. Akhmetov, Y.; Nurmanova, V.; Bagheri, M.; Zollanvari, A.; Gharehpetian, G.B. A new diagnostic technique for reliable decision-making on transformer FRA data in interturn short-circuit condition. IEEE Trans. Ind. Inform. 2020, 17, 3020–3031. [Google Scholar] [CrossRef]
  5. Wu, Z.; Zhou, L.; Wang, D.; Zhou, M.; Jiang, F.; Yu, X.; Tang, H.; Zhao, H. Feature Analysis of Oscillating Wave Signal for Axial Displacement in Autotransformer. IEEE Trans. Instrum. Meas. 2021, 70, 1–13. [Google Scholar] [CrossRef]
  6. Abbasi, A.R.; Mahmoudi, M.R.; Arefi, M.M. Transformer Winding Faults Detection Based on Time Series Analysis. IEEE Trans. Instrum. Meas. 2021, 70, 1–10. [Google Scholar] [CrossRef]
  7. Ye, Z.; Yu, W.; Gou, J.; Tan, K.; Zeng, W.; An, B.; Li, Y. A Calculation Method to Adjust the Short-Circuit Impedance of a Transformer. IEEE Access 2020, 8, 223848–223858. [Google Scholar] [CrossRef]
  8. Wang, L.; Littler, T.; Liu, X. Gaussian Process Multi-Class Classification for Transformer Fault Diagnosis Using Dissolved Gas Analysis. IEEE Trans. Dielectr. Electr. Insul. 2021, 28, 1703–1712. [Google Scholar] [CrossRef]
  9. Ma, X.; Hu, H.; Shang, Y. A New Method for Transformer Fault Prediction Based on Multifeature Enhancement and Refined Long Short-Term Memory. IEEE Trans. Instrum. Meas. 2021, 70, 1–11. [Google Scholar] [CrossRef]
  10. Soni, R.; Chakrabarti, P.; Leonowicz, Z.; Jasiński, M.; Wieczorek, K.; Bolshev, V. Estimation of Life Cycle of Distribution Transformer in Context to Furan Content Formation, Pollution Index, and Dielectric Strength. IEEE Access 2021, 9, 37456–37465. [Google Scholar] [CrossRef]
  11. Gao, C.; Yu, L.; Xu, Y.; Wang, W.; Wang, S.; Wang, P. Partial discharge localization inside transformer windings via fiber-optic acoustic sensor array. IEEE Trans. Power Deliv. 2019, 34, 1251–1260. [Google Scholar] [CrossRef]
  12. Sharifinia, S.; Allahbakhshi, M.; Ghanbari, T.; Akbari, A.; Mirzaei, H.R. A New Application of Rogowski Coil Sensor for Partial Discharge Localization in Power Transformers. IEEE Sens. J. 2021, 21, 10743–10751. [Google Scholar] [CrossRef]
  13. Alehosseini, A.; Hejazi, M.A.; Mokhtari, G.; Gharehpetian, G.B.; Mohammadi, M. Detection and classification of transformer winding mechanical faults using UWB sensors and Bayesian classifier. Int. J. Emerg. Electr. Power Syst. 2015, 16, 207–215. [Google Scholar] [CrossRef]
  14. Mariprasath, T.; Kirubakaran, V. A real time study on condition monitoring of distribution transformer using thermal imager. Infrared Phys. Technol. 2018, 90, 78–86. [Google Scholar] [CrossRef]
  15. Jiang, P.; Zhang, Z.; Dong, Z.; Wu, Y.; Xiao, R.; Deng, J.; Pan, Z. Research on distribution characteristics of vibration signals of ±500 kV HVDC converter transformer winding based on load test. Int. J. Electr. Power Energy Syst. 2021, 132, 107200–107210. [Google Scholar] [CrossRef]
  16. Huerta-Rosales, J.R.; Granados-Lieberman, D.; Garcia-Perez, A.; Camarena-Martinez, D.; Amezquita-Sanchez, J.P.; Valtierra-Rodriguez, M. Short-circuited turn fault diagnosis in transformers by using vibration signals, statistical time features, and support vector machines on fpga. Sensors 2021, 21, 3598. [Google Scholar] [CrossRef]
  17. Bagheri, M.; Nezhivenko, S.; Naderi, M.S.; Zollanvari, A. A new vibration analysis approach for transformer fault prognosis over cloud environment. Int. J. Electr. Power Energy Syst. 2018, 100, 104–116. [Google Scholar] [CrossRef]
  18. Cao, C.; Xu, B.; Li, X. Monitoring Method on Loosened State and Deformational Fault of Transformer Winding Based on Vibration and Reactance Information. IEEE Access 2020, 8, 215479–215492. [Google Scholar] [CrossRef]
  19. Hong, K.; Wang, L.; Xu, S. A Variational Mode Decomposition Approach for Degradation Assessment of Power Transformer Windings. IEEE Trans. Instrum. Meas. 2019, 68, 1221–1229. [Google Scholar] [CrossRef]
  20. Xie, T.; Huang, X.; Choi, S.K. Intelligent Mechanical Fault Diagnosis Using Multisensor Fusion and Convolution Neural Network. IEEE Trans. Ind. Inform. 2022, 18, 3213–3223. [Google Scholar] [CrossRef]
  21. Saufi, S.R.; Ahmad, Z.A.B.; Leong, M.S.; Lim, M.H. Gearbox Fault Diagnosis Using a Deep Learning Model with Limited Data Sample. IEEE Trans. Ind. Inform. 2020, 16, 6263–6271. [Google Scholar] [CrossRef]
  22. Zhang, Z.; Geiger, J.; Pohjalainen, J.; Mousa, A.E.D.; Jin, W.; Schuller, B. Deep learning for environmentally robust speech recognition: An overview of recent developments. ACM Trans. Intell. Syst. Technol. 2018, 9, 1–28. [Google Scholar] [CrossRef]
  23. Jiang, F.; Lu, Y.; Chen, Y.; Cai, D.; Li, G. Image recognition of four rice leaf diseases based on deep learning and support vector machine. Comput. Electron. Agric. 2020, 179, 105824–105832. [Google Scholar] [CrossRef]
  24. Rastgoo, M.N.; Nakisa, B.; Maire, F.; Rakotonirainy, A.; Chandran, V. Automatic driver stress level classification using multimodal deep learning. Expert Syst. Appl. 2019, 138, 112793–112803. [Google Scholar] [CrossRef]
  25. Hong, K.; Jin, M.; Huang, H. Transformer winding fault diagnosis using vibration image and deep learning. IEEE Trans. Power Deliv. 2021, 36, 676–685. [Google Scholar] [CrossRef]
  26. Xiao, R.; Zhang, Z.; Wu, Y.; Jiang, P.; Deng, J. Multi-scale information fusion model for feature extraction of converter transformer vibration signal. Meas. J. Int. Meas. Confed. 2021, 180, 109555–109566. [Google Scholar] [CrossRef]
  27. Arroyo, A.; Martinez, R.; Manana, M.; Pigazo, A.; Minguez, R. Detection of ferroresonance occurrence in inductive voltage transformers through vibration analysis. Int. J. Electr. Power Energy Syst. 2019, 106, 294–300. [Google Scholar] [CrossRef]
  28. Chen, B.; Shen, B.; Chen, F.; Tian, H.; Xiao, W.; Zhang, F.; Zhao, C. Fault diagnosis method based on integration of RSSD and wavelet transform to rolling bearing. Meas. J. Int. Meas. Confed. 2019, 131, 400–411. [Google Scholar] [CrossRef]
  29. Gao, J.; Wang, B.; Wang, Z.; Wang, Y.; Kong, F. A wavelet transform-based image segmentation method. Optik 2020, 208, 164123–164130. [Google Scholar] [CrossRef]
  30. Mojahed, A.; Bergman, L.A.; Vakakis, A.F. New inverse wavelet transform method with broad application in dynamics. Mech. Syst. Signal Process. 2021, 156, 107691–107712. [Google Scholar] [CrossRef]
  31. Chen, R.; Huang, X.; Yang, L.; Xu, X.; Zhang, X.; Zhang, Y. Intelligent fault diagnosis method of planetary gearboxes based on convolution neural network and discrete wavelet transform. Comput. Ind. 2019, 106, 48–59. [Google Scholar] [CrossRef]
  32. Guo, M.F.; Yang, N.C.; You, L.X. Wavelet-transform based early detection method for short-circuit faults in power distribution networks. Int. J. Electr. Power Energy Syst. 2018, 99, 706–721. [Google Scholar] [CrossRef]
  33. Li, Z.; Liu, F.; Yang, W.; Peng, S.; Zhou, J. A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 6999–7019. [Google Scholar] [CrossRef]
  34. Yang, R.; Singh, S.K.; Tavakkoli, M.; Amiri, N.; Yang, Y.; Karami, M.A.; Rai, R. CNN-LSTM deep learning architecture for computer vision-based modal frequency detection. Mech. Syst. Signal Process. 2020, 144, 106885–106902. [Google Scholar] [CrossRef]
  35. Liu, J.; Yang, Y.; Lv, S.; Wang, J.; Chen, H. Attention-based BiGRU-CNN for Chinese question classification. J. Ambient Intell. Humaniz. Comput. 2019, 1–12. [Google Scholar] [CrossRef]
  36. LeCun, Y. LeNet-5, Convolutional Neural Networks. 2015; Volume 20, p. 14. Available online: http://yann.lecun.com/exdb/lenet (accessed on 17 April 2023).
  37. He, K.; Zhang, X.; Ren, S.; Sun, J. Identity mappings in deep residual networks. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Proceedings, Part IV 14. Springer: Berlin/Heidelberg, Germany, 2016; pp. 630–645. [Google Scholar]
  38. Tan, M.; Le, Q. Efficientnet: Rethinking model scaling for convolutional neural networks. In Proceedings of the 36th International Conference on Machine Learning (ICML), Long Beach, CA, USA, 10–15 June 2019; pp. 6105–6114. [Google Scholar]
  39. Zhao, B.; Zhang, X.; Li, H.; Yang, Z. Intelligent fault diagnosis of rolling bearings based on normalized CNN considering data imbalance and variable working conditions. Knowl.-Based Syst. 2020, 199, 105971–105986. [Google Scholar] [CrossRef]
  40. Ben Ali, J.; Fnaiech, N.; Saidi, L.; Chebel-Morello, B.; Fnaiech, F. Application of empirical mode decomposition and artificial neural network for automatic bearing fault diagnosis based on vibration signals. Appl. Acoust. 2015, 89, 16–27. [Google Scholar] [CrossRef]
  41. Shao, H.; Jiang, H.; Zhang, X.; Niu, M. Rolling bearing fault diagnosis using an optimization deep belief network. Meas. Sci. Technol. 2015, 26, 115002. [Google Scholar] [CrossRef]
Figure 1. Mallat algorithm of wavelet decomposition and reconstruction.
Figure 1. Mallat algorithm of wavelet decomposition and reconstruction.
Sensors 23 04781 g001
Figure 2. Process of the pooling operation.
Figure 2. Process of the pooling operation.
Sensors 23 04781 g002
Figure 3. Position of the accelerometer on the studied transformer.
Figure 3. Position of the accelerometer on the studied transformer.
Sensors 23 04781 g003
Figure 4. Experimental system of transformer fault diagnosis.
Figure 4. Experimental system of transformer fault diagnosis.
Sensors 23 04781 g004
Figure 5. CWT conversion image of the normal state.
Figure 5. CWT conversion image of the normal state.
Sensors 23 04781 g005
Figure 6. Converted RGB images of nine conditions.
Figure 6. Converted RGB images of nine conditions.
Sensors 23 04781 g006
Figure 7. Feature extraction procedure.
Figure 7. Feature extraction procedure.
Sensors 23 04781 g007
Figure 8. The structure of the proposed diagnosis model.
Figure 8. The structure of the proposed diagnosis model.
Sensors 23 04781 g008
Figure 9. Flowchart of the proposed diagnosis method.
Figure 9. Flowchart of the proposed diagnosis method.
Sensors 23 04781 g009
Figure 10. Training process of the proposed structure.
Figure 10. Training process of the proposed structure.
Sensors 23 04781 g010
Figure 11. Diagnosis result of different batch sizes.
Figure 11. Diagnosis result of different batch sizes.
Sensors 23 04781 g011
Figure 12. Diagnosis result of different learning rates.
Figure 12. Diagnosis result of different learning rates.
Sensors 23 04781 g012
Figure 13. Confusion matrix of the proposed method.
Figure 13. Confusion matrix of the proposed method.
Sensors 23 04781 g013
Table 1. Main parameters of the studied transformer.
Table 1. Main parameters of the studied transformer.
CategoriesParameters
Rated power50 kVA
Rated frequency50 Hz
Type of coolingair natural cooling
Service conditionIndoor
Host weight330 kg
Shape size740 × 460 × 790 mm
Rated voltage (primary)10 kV
Rated voltage (secondary)0.4 kV
Table 2. Working states of the studied transformer.
Table 2. Working states of the studied transformer.
Working StatesLoads (kW)Categories
Normal state20NO20
40NO40
Core clamp looseness20CC20
40CC40
Winding clamp looseness20WC20
40WC40
Connection bar looseness20CB20
40CB40
Turn-to-turn short circuit20TT20
40TT40
Table 3. Result of CNN models with different structures.
Table 3. Result of CNN models with different structures.
StructuresTesting Accuracy (%)
MaxMinMeanSD
CH1CH2CH1CH2CH1CH2CH1CH2
CNN-
2704-126
96.597.558.56393.9595.312.316.30
CNN-
2704-256
959865.58792.394.1514.929.11
CNN-
2704-126-32
10099.58479.594.5596.354.814.39
CNN-
2704-126-64
9910095.597.595.15982.941.96
CNN-
2704-126-128
10010087.593.593.8595.35.193.03
Table 4. Diagnosis performance of different methods.
Table 4. Diagnosis performance of different methods.
MethodsTesting Accuracy (%)
MaxMinMeanSD
ANN84.555.571.739.25
DBN87.56882.18.9
1D-CNN92.584.591.525.47
HHT-CNN95.58993.252.84
STFT-CNN9587.594.143.93
CWT-CNN10099.599.950.32
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, C.; Chen, J.; Yang, C.; Yang, J.; Liu, Z.; Davari, P. Convolutional Neural Network-Based Transformer Fault Diagnosis Using Vibration Signals. Sensors 2023, 23, 4781. https://doi.org/10.3390/s23104781

AMA Style

Li C, Chen J, Yang C, Yang J, Liu Z, Davari P. Convolutional Neural Network-Based Transformer Fault Diagnosis Using Vibration Signals. Sensors. 2023; 23(10):4781. https://doi.org/10.3390/s23104781

Chicago/Turabian Style

Li, Chao, Jie Chen, Cheng Yang, Jingjian Yang, Zhigang Liu, and Pooya Davari. 2023. "Convolutional Neural Network-Based Transformer Fault Diagnosis Using Vibration Signals" Sensors 23, no. 10: 4781. https://doi.org/10.3390/s23104781

APA Style

Li, C., Chen, J., Yang, C., Yang, J., Liu, Z., & Davari, P. (2023). Convolutional Neural Network-Based Transformer Fault Diagnosis Using Vibration Signals. Sensors, 23(10), 4781. https://doi.org/10.3390/s23104781

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop