Next Article in Journal
What Is Beyond Hyperbola Detection and Characterization in Ground-Penetrating Radar Data?—Implications from the Archaeological Site of Goting, Germany
Previous Article in Journal
A Novel Beam-Domain Direction-of-Arrival Tracking Algorithm for an Underwater Target
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Automatic Method for Detecting Deformation Cracks in Landslides Based on Multidimensional Information Fusion

1
State Key Laboratory of Geo-Hazard Prevention and Geo-Environmental Protection, Chengdu University of Technology, Chengdu 610059, China
2
Zhejiang Huadong Construction Engineering Co., Ltd., Hangzhou 310000, China
*
Author to whom correspondence should be addressed.
Submission received: 3 October 2024 / Revised: 27 October 2024 / Accepted: 28 October 2024 / Published: 31 October 2024
(This article belongs to the Topic Landslides and Natural Resources)

Abstract

:
As cracks are a precursor landslide deformation feature, they can provide forecasting information that is useful for the early identification of landslides and determining motion instability characteristics. However, it is difficult to solve the size effect and noise-filtering problems associated with the currently available automatic crack detection methods under complex conditions using single remote sensing data sources. This article uses multidimensional target scene images obtained by UAV photogrammetry as the data source. Firstly, under the premise of fully considering the multidimensional image characteristics of different crack types, this article accomplishes the initial identification of landslide cracks by using six algorithm models with indicators including the roughness, slope, eigenvalue rate of the point cloud and pixel gradient, gray value, and RGB value of the images. Secondly, the initial extraction results are processed through a morphological repair task using three filtering algorithms (calculating the crack orientation, length, and frequency) to address background noise. Finally, this article proposes a multi-dimensional information fusion method, the Bayesian probability of minimum risk methods, to fuse the identification results derived from different models at the decision level. The results show that the six tested algorithm models can be used to effectively extract landslide cracks, providing Area Under the Curve (AUC) values between 0.6 and 0.85. After the repairing and filtering steps, the proposed method removes complex noise and minimizes the loss of real cracks, thus increasing the accuracy of each model by 7.5–55.3%. Multidimensional data fusion methods solve issues associated with the spatial scale effect during crack identification, and the F-score of the fusion model is 0.901.

1. Introduction

Rockfall, debris flows, and other landslide disasters are major threats to humans [1]. Landslides generally undergo long deformation development and evolution processes before a large-scale instability event occurs. As the ground surface can be affected by different types of stress, ground surfaces also exhibit different crack stages and matching characteristics [2]. Therefore, surface cracks, one of the main features of slope deformation, can provide effective forecasting information that is useful for the early identification of landslides, analyzing landslide instability models, and inferring the deformation direction [3,4]. However, as a result of the terrain conditions in mountainous areas, it is difficult to efficiently achieve accurate large-scale inspection results and refined surface crack measurements in these areas using manual, on-site inspection methods [5].
In recent years, the use of a variety of remote sensing data sources, such as Digital Orthophoto Maps (DOMs), point clouds, and Digital Surface Models (DSM) obtained by aerial remote sensing technology, has become a hot research topic for scholars involved in identifying cracks and landslides [6,7,8]. At present, the methods used to detect surface cracks based on remote sensing data mainly rely on manual visual interpretations and manual measurements [9]. However, these cracks are widely distributed, slender, and multibranched, and the background noise is often complex, causing these manual operations to exhibit certain errors and low efficiencies.
With the development of machine learning and information technologies, crack detection based on digital images has become the primary method for automatically extracting cracks. Effective results have been obtained from many studies aiming to identify cracks in concrete pavement, tunnel linings, building walls, and other single background types [10,11,12]. These studies use traditional image processing algorithms such as edge detection and threshold segmentation to obtain crack edge or texture features to achieve the purpose of crack identification [13]. However, when automatically identifying landslide deformation cracks under complex and changeable background conditions, the use of only image data containing 2D color information is particularly weak [14]. The main defect of these data involves the difficulty in solving the problem of distinguishing surface cracks from background noise (such as shrubs, rock accumulations, and light and shadow differences) [15], resulting in low model recognition accuracies. However, few studies on crack identification and extraction methods based on 3D morphological features have been conducted worldwide [16,17]. Among the existing studies, the main representative method is the use of point cloud data obtained by UAVs, as proposed by Al-Rawabdeh et al. [16], to detect landslide scarps. The study obtained results for bare surface data without vegetation, and no noise-filtering methods were performed with regard to shrubs, boulders, or tree stumps. In addition to traditional target detection methods, using deep learning methods such as the Convolutional Neural Network (CNN) to identify cracks is also an important means, and the recognition accuracy is higher in a certain type of specific scene [18]. However, due to the variability in landslide site environments and the expense of data collection, the model may suffer from underfitting due to insufficient sample size [6,19]. In summary, most current studies focus on crack feature detection in 2D digital images, ignoring the 3D morphological changes caused by different landslide crack scale effects. At the same time, there is no specific filtering algorithm for noise points such as shrubs and shadows in the scene of landslide cracks, which ultimately makes it difficult to comprehensively detect landslide cracks.
In this research, the main purpose is to use point clouds and digital images obtained by UAV-based photography and to achieve automatic identification of landslide cracks under small-scale sample training through image processing, filtering denoising, data fusion, etc. It is necessary to consider the crack image characteristics under different scale conditions and the coupling combination between different characterization indicators. The main innovation is to propose a set of automatic landslide crack detection workflows in which multidimensional remote sensing information is fused, taking into full consideration the recognition effect of landslide cracks under different dimensional data. In this process, a noise filtering method based on crack direction, frequency, and length is proposed to improve the accuracy of crack recognition. The rest of this article is organized as follows: Section 2 describes the basic situation of the study area, the data collection process, and the authenticity of the on-site investigation. Section 3 contains the technical methodological principles of this research, including initial crack extraction, noise filtering, and data fusion. Section 4 provides the crack identification results derived from the above methods in the demonstration area and discusses the different models. Section 5 presents the main conclusions of this research.

2. Description of the Datasets

2.1. Study Area

The mountainous areas in Southwest China feature well-developed valleys and multi-directional rivers, making them key regions for hydropower construction. However, the rise in the upstream water storage level has caused significant water pressure and the erosion of reservoir bank slopes, leading to the rapid development of crack deformation [9]. We selected WuLiPo as the study site, where deformation is occurring on the riverbank of the BaiHeTan Hydropower Station, the second-largest hydropower station in China (Figure 1). WuLiPo spans 300 m in length and 330 m in width, covering an area of approximately 7 × 104 m2. The slope is located on the east side of the Jinsha River in Qiaojia County, Shaotong City, Yunnan Province (Figure 1a). Since this reservoir area was impounded in mid-April 2021, the front-edge slopes closest to the riverbank have deformed and loosened (Figure 1b), gradually forming a chair-shaped topographical structure (Figure 1c). This has resulted in the formation of numerous scarps and tension cracks on the slope surface and highway pavement (now abandoned), which continue to expand over time.

2.2. Data Acquisition and Description

UAV photogrammetry is important for landslide investigation due to its high resolution, large-scale imaging ability, rapid mapping ability, and inexpensive cost [5,20,21]. Compared to airborne light detection and ranging (LiDAR), the data acquisition cost is greatly reduced when UAV photogrammetry is employed. Additionally, it offers greater flexibility in alpine and canyon areas with significant relief. However, its main limitation is the difficulty in obtaining terrain information in densely vegetated areas. With the introduction of SFM (Structure From Motion), multiple overlapping aerial images can capture the 3D features of the target through feature matching and 3D reconstruction [22], which provides an efficient means for obtaining landslide images and terrain information in low-vegetation mountainous areas. There are only a small number of shrubs or branches in the study area, which do not cover the cracks. Therefore, we adopted UAV aerial photography and SFM to complete the image data acquisition and processing. In this research, we used the PIX4D mapper (from URL (accessed on 28 October 2024): https://www.pix4d.com/) software to implement the specific work of aerial image processing.
The FeiMa D200 multirotor drone was selected for the field flight (Figure 2b). The UAV hardware parameters and flight information are listed in Table 1. The terrain-following flight route was planned through preliminary terrain detection (Figure 2c). The 3D point cloud (with a density of approximately 205/m2) was generated in the WuLiPo deformation area through dense image matching (Figure 2e). Next, a DOM with a ground resolution of 3.3 cm was generated through Triangular Irregular Network (TIN) grid construction and automatic texture mapping (Figure 2d). In the collection process, a total of six control points were selected to spatially correct the image (Figure 2a). The plane error of the image before correction was ±0.02059 m and the elevation error was ±0.01697 m. The modeling accuracy could satisfy the subsequent crack identification and evaluation requirements.

2.3. Description of Ground Truth

As shown in Figure 3a, the landslide occurred on a reverse-inclined rock slope with structural characteristics of upper hard rock and lower soft rock. The upper part consists of hard and thick quartz sandstone and dolomite limestone. According to the on-site investigation, the layered rock mass exposed on the surface was broken due to the cutting of two other groups of fractures affected by valley downcutting and unloading [9]. The landslide body formed a potential sliding surface along the interface between the highly weathered rock mass and the medium-weathered rock mass, and a landslide scarp of nearly 200 m was formed on the rear edge. There are a large number of cracks in the middle of the landslide. To evaluate the accuracy and precision of the method, we conducted an interpretation of crack images and field investigation verification in the Wulipo deformation zone from May to July 2021. This investigation identified a total of 133 large-scale tension cracks, accounting for 6.68% of the study area, and a total of 256 local shear cracks on slope surfaces and highways, accounting for 1.31% of the study area. The survey results are shown in Figure 3b. Most of the slope surface cracks are primarily formed in the Quaternary coarse-grained soil layer, exhibiting localized stepped displacements (Figure 3a). The maximum dislocation width is 8.2 m (Figure 3c-④). A large number of shear cracks on the road surface are mainly distributed in the slip areas on both sides of the landslide, resulting in a steep slope of nearly 4.37 m (Figure 3c-⑥). Finally, we numbered each crack and measured their geometric parameter information.

3. Methodology

The DOM and point cloud obtained through UAV photogrammetry are used as data sources. The method can be mainly divided into the following 5 steps. First, we completed the pre-processing of data to achieve data standardization. Second, we analyzed the multidimensional image features and spatial scale effects of surface cracks and defined the scope and type of crack identification. Third, we adopted 6 automatic landslide crack identification models, including roughness, eigenvalue ratio, slope, gray threshold, edge gradient, and supervised classification models, representing remote sensing data sources with different dimensions. Next, the preliminarily extracted cracks were repaired using morphological methods, and 3 filtering methods (the orientation, frequency, and length methods) were proposed to remove errors caused by vegetation and mountain shadows. Finally, the minimum-risk Bayesian probability decision-making method is proposed to perform decision-level fusion on the results of the 6 models listed above. The technical process of this research is shown in Figure 4.

3.1. Data Pre-Processing

Most researchers tackling image-based crack extraction tasks rely on RGB values as the primary features to distinguish cracks from the background surface. In complex mountainous areas, obtaining optical image data necessitates a large amount of information due to low contrast and the challenging working environment [6]. These conditions result in significant noise, small target images, and substantial influence from light and weather conditions. This research employs image contrast enhancement, filtering denoising, and grayscale image pre-processing methods. The purpose is to minimize the noise caused by ground objects or aerial photography angles and normalize the image data.
The Contrast-Limited Adaptive Histogram Equalization processing method (CLAHE) [23] is selected to enhance the image contrast. The principle of this method involves using a fixed-size matrix to collect samples covering the entire image range, setting a certain pixel frequency threshold, and then evenly distributing the regions in which the threshold is exceeded to each gray level. The block-suppressed contrast histogram equalization method increases the differences between cracks and background surfaces under different light intensities and prevents excessive noise amplification, improving the harmoniousness of the entire image. For the filtering and denoising tasks, we adopt bilateral filtering [24]. This algorithm uses 2 Gaussian filters, space and edge intensity difference, to smooth local noise while ensuring the pixel edge features are unchanged; in addition, further crack feature mining often requires only single-variable or binarized image information, so image grayscale processing is necessary. The purpose of this task is to change the crack pixel values from three channels to one channel. This process not only reduces the feature dimensions but also removes irrelevant variables. The weighted conversion grayscale value method is used to convert the three-channel RGB values into a single-channel grayscale value [25]. The above operation process is shown in Figure 5.
During the collection of point cloud data, noise is often generated, which can affect the experimental results. For example, smooth surfaces of soil or rock layers can cause light reflections, while flying birds or dense branches in the air can impact crack extraction accuracy. Therefore, it is necessary to remove these noises using a local distance mean filtering algorithm [26], a common point cloud filtering method that will not be elaborated upon here.

3.2. Crack Identification Model and Principle

3.2.1. Types of Surface Cracks Observed on Landslides

In the early deformation stage, cracks are mostly caused by slope sliding. The width of the cracks in the slope varies under different accumulated deformations. For example, in the WeiHe Basin of China, the initial width of deformation is about 2–10 mm. As the slip surface advances, this deformation width can generally reach 1000–3000 mm [27]. The manifestations of cracks of different sizes in both 2D and 3D spaces are also quite variable.
In the 2D image, the fresh surfaces of cracks formed in slopes composed of different materials exhibit varying colors, but they all produce the same black shadow under sunlight. The shadow is an important feature for identifying cracks [6]. Cracks with different openings produce varying black shadow ranges under different lighting angles. Specifically, cracks with larger openings exhibit almost no black shadows, even if the sunlight is squint, this feature makes the subsidence areas of these cracks indistinguishable from the surrounding flat surfaces (as shown for cracks No. 1 to 3 in Figure 6a). The above reasons make it difficult to separate these cracks from the background surface.
In the 3D scene, the point cloud density generated by the UAV photogrammetry generally ranges from 20–200/m2, and the ground resolution of the DSM ranges from 10–50 cm. A point cloud containing 3D information can effectively reflect surface crack fluctuations and is not affected by the angle of illumination. However, for small deformation cracks, especially those with widths smaller than 10 cm, the identification effectiveness is poor (as shown for cracks No. 4 to 5 in Figure 6b).
Different opening sizes will cause different size effects on the cracks in terms of shape and texture. Landslide cracks can be divided into mode I (opening), mode II (sliding), and mode III (tearing) based on the formation mechanism [3]. In addition, there are also some collapsible or closed cracks on the surface near some landslides [13]. This article mainly considers the recognition effect of different data types on different crack size effects. Therefore, the landslide crack deformation size type is agreed as follows: cracks caused by tensile stress, which include tension cracks, grooves produced by the trailing edges of landslides, and scarps caused by sliding crack deformation, are called “large deformation cracks” (as shown in Figure 6c). The cracks caused by shear stress, expansion, and contraction, which include shear cracks on both sides of the landslide, front-edge bulging and tearing, and collapsible ground cracks, are called “small deformation cracks”.
Due to the scale effects of cracks, it is difficult to comprehensively identify complex slope deformation cracks from any single data source. Considering that the resolution of aerial images captured by drone cameras is approximately 2–5 cm, small deformation cracks can produce noticeable black shadows under most lighting angles, making their identification from ortho-images advantageous. Conversely, point cloud data reflect the 3D steepness information of large deformation cracks. Therefore, we leverage the strengths of both data sources to identify cracks of different sizes.

3.2.2. Crack Extraction Based on Point Cloud

Based on the research foundation provided by Al-Rawabdeh et al. [16], three identification indicators in a point cloud, the roughness, eigenvalue rate, and slope, were obtained to identify large deformation cracks. In terms of data indexing, the K-D tree data structure is widely used for nearest-neighbor searches and approximate nearest-neighbor searches of key data in multidimensional spaces due to its space partitioning ability [28]. The advantage of the K-D tree over the octree is that it is more adaptable, can handle data of any dimension, has higher memory efficiency, especially in low-dimensional cases, usually has better search performance, and performs well when data are unevenly distributed. The K-D tree is used to collect local point cloud (X, Y, and Z) coordinates in a 3D space in this research. This process begins from the first point, collects point cloud data at the specified search radius, and calculates the identification indicators (as shown in Figure 7). The definition of each identification indicator is described below.
The roughness indicator can be defined as the irregularity of the terrain surface. Clear surface roughness differences can be found between undisturbed areas and cracks. Equation (1) can be used to calculate the standard deviation of the Z values of the point cloud under the local search radius to represent the roughness ( r i ) value of the crack region:
r i = Z Z ¯ 2 n 1
where r i is the roughness at point i , Z ¯ is the average height at all points within the search range of point i , and n is the number of all point clouds within the local search range.
The eigenvalue ratio can describe whether a point cloud set has directionality. If the spatial distribution of surface cracks shows a local downward dislocation trend, the corresponding point cloud will show subsidence and concavity on the flat slope. Principal Component Analysis (PCA) is used to calculate the eigenvalue ratios of the 3 principal vectors of all points in the local neighborhood. If a crack has strong ductility in a fixed direction, the eigenvalue λ 1 (the largest eigenvalue) will be much larger than λ 2   o r   λ 3 . At the same time, the λ 2 or λ 3 values will be close to 0. The calculation steps of the local point cloud eigenvalue ratios ( d i ) are described as follows. First, we calculate the covariance matrix ( C O V 3 × 3 ) using Equation (2). The eigenvalues λ 1 , λ 2 , ,   a n d   λ 3 and eigenvectors e 1 , e 2 ,   a n d e 3 can be obtained by matrix transformation using Equation (3) and are arranged in descending order. Equation (4) can then be used to calculate the eigenvalue ratio ( d i ).
C O V 3 × 3 = 1 n i = 1 n P iX P i Y P i Z P cX ¯ P c Y ¯ P c Z ¯ P iX P i Y P i Z P cX ¯ P c Y ¯ P c Z ¯ T
C O V 3 × 3 = W Λ W T = e 1 e 2 e 3 λ 1 0 0 0 λ 2 0 0 0 λ 3 e 1 T e 2 T e 3 T
d i = λ 3 λ 2
where P iX , P i Y , and P i Z are the X , Y , and Z coordinate values of point i , respectively, P cX ¯ , P c Y ¯ , and P c Z ¯ are the X, Y, and Z coordinates of the centroid of the point cloud set, respectively, and C O V 3 × 3 is the covariance matrix.
The slope indicator, which refers to the undulation size of the fitting plane in the point cloud neighborhood set, is an important parameter in slope stability analyses. A sudden change in the gradient between two consecutive steep slopes means that a scarp is present on the slope [29]. The slope angle at each point can be estimated using Equation (5). The fitting plane utilized herein is the coplanar components of e 1 and e 2 , as processed by PCA in the previous step. The 3 eigenvectors are perpendicular to each other, and the slope ( θ i ) can be calculated using Equation (5):
θ i = tan 1 N x 2 + N y 2 N Z 180 π
where θ i is the inclination angle of cloud characterizing point i and N x , N y , and N z are the components of the smallest eigenvector ( e 3 ) about the X-axis, Y-axis, and Z-axis, respectively, in the local point cloud set.

3.2.3. Crack Extraction Based on Digital Images

According to the expression of pixels, digital images can be divided into color images and binary images, and the crack identification methods applied to different pixel units can be divided into three categories: (1) grayscale image threshold segmentation, (2) edge detection methods, and (3) crack supervised classification based on machine learning [30].
The gray threshold segmentation is the simplest crack identification method. Cracks often exist as black pixels in images. Following the grayscale processing task, a binary image can be segmented using a certain gray value threshold to achieve crack extraction (as shown in Figure 8a,b).
Edge detection is the process of judging whether a pixel belongs to the edge of an object by calculating the size of the gray gradient. Edge detection can be used to effectively detect the outline of an object. Therefore, in the field of computer image processing, various edge detection operators have been developed and are often used to identify cracks or abnormal objects [21]. In this research, the Sobel edge detection operator is used to extract surface cracks. The Sobel operator is a first-order derivative function, and its main principle is an edge-enhancement method that causes the extreme value to appear at the edge by weighting the adjacent gray levels in the local window [31]. In this process, two convolution kernels with a size of 3 × 3 in the horizontal and vertical directions are used to perform convolution calculations on each binary pixel; Then, Equations (6)–(8) are used to calculate the pixel gradient value ( f ) to represent the size of the edged feature. When the f value is relatively large, the pixel value changes relatively fast and is biased toward the edge of the contour (as shown in Figure 8c,d):
g x = f x = ( P 7 + 2 P 8 + P 9 ) P 1 + 2 P 2 + P 3
g y = f x = P 3 + 2 P 6 + P 9 P 1 + 2 P 2 + P 7
f = g x 2 + g y 2
where P1–P9 represent the local-window pixel values, g x is the horizontal change gradient, g y is the vertical change gradient, and ∇ f is the first-order pixel gradient.
The maximum likelihood supervised classification based on RGB values is used to complete the image distinction task according to the image features (cracks and surrounding objects). This method is a statistical method based on Bayesian probability. The principle involves obtaining parameters such as the mean and variance in each category by counting the area of interest to determine the classification function. Then, each pixel in the image is substituted into the classification function of each category, and the category exhibiting the largest return value of the function is used as the attribution category of the scanned pixel [32]. Figure 9 provides the spatial distributions of different ground object pixels following the application of the maximum likelihood estimation supervised classification.

3.3. Crack Pixel Repair and Filtering

Following the preliminary extraction process, crack images may still have some noise. Thus, a small portion of the cracks will inevitably be misidentified. We propose 3 filtering algorithms based on the orientation, length, and frequency of cracks. These algorithms can filter out the noises outside the crack area and retain the real pixels unchanged. Notably, to facilitate matrix operations, the data utilized for the crack pixel repair are the binarized raster data following the initial extraction process from the various models described above (a crack is given a value of 1, while the background is given a value of 0). In particular, the extracted crack point cloud is also interpolated into raster data with the same dimensions as the image pixels.
  • Morphological repair
In the centers of the initially extracted crack pixel regions, a small portion of the real crack values may be misidentified, especially when using the pixel gradient model. This model can only identify crack edge features, leading to the overlook of a large number of positive samples. Therefore, the morphological closure operation method is used to repair the pixel [33]. The principle of this method involves applying an expansion morphological operation followed by an erosion operation on the extracted crack binary grid. This process fills gaps in the foreground object while maintaining the overall position and shape of the object. The main steps of this method are described as follows: (1) A fixed-size rectangular sampling frame is used to traverse each binarized pixel (as shown in Figure 10a,b). (2) Expanding pixels are identified by calculating the “|” (OR) operation of all the pixels in the sampling frame with each pixel as the center (that is, when there is at least one true value in the sample, the true value is assigned to the center point) (as shown in Figure 10c). (3) Corroding pixels are identified by performing the “&” (AND) operation on the pixels in the expanded sampling frame (that is, when there is at least one false value in the sample, a false value is assigned to the center point) (as shown in Figure 10d,e).
2.
Orientation filtering
Cracks have a certain directional ductility, meaning they extend in thin branches along one or more orientations. This characteristic is a key feature that distinguishes cracks from background noise (hill shadows, bushes, etc.). The ability of PCA to evaluate crack-directionality ductility has been discussed above. Using this feature, each real crack pixel is examined with a fixed-size rectangular frame. The Eigenvalue ratios ( λ 1 / λ 2 λ 1 > λ 2 ) of the truth pixels can be calculated as the orientation index of each center pixel, and a threshold can be set according to the image crack feature to conduct refiltering to remove pixel points that do not exhibit strong directionality (as shown in Figure 11a,b). Additionally, the real crack pixels are partially filtered if a reduction in the orientation index at the branch junction occurs. If this happens, the morphological repair can be used to perform the repair process once again.
3.
Length and frequency filtering
The excessive use of the local sampling frame filtering method causes distortion in small cracks, especially at the ends of small cracks. Therefore, refined crack-filtering methods should be based on analyzing each crack as a whole to distinguish the crack from background noise. The filtering steps applied in this work are described as follows:
First, we use the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm [34]. We cluster pixels according to the following parameter requirements: Eps = 2 (represents the neighborhood radius when making density accessibility judgments, which is fixed to ensure that the diagonal distance between two adjacent pixels in a crack is not exceeded),and min_samples = 2 (represents the minimum number of samples in a cluster, which is fixed to ensure that each pixel of a crack has at least two adjacent crack pixels within the cluster search radius). These parameter values are set to ensure that adjacent pixels with at least one common endpoint are clustered; then, all densely concentrated binary pixels are classified into a single crack (as shown in Figure 11f). Second, taking each clustered crack as an index unit, the pixel frequency and the minimum circumscribed rectangle diagonal length of each crack are counted and used as the filter indicators of the frequency and length, respectively. Finally, we set a certain refiltering threshold according to the derived distribution characteristics (as shown in Figure 11g,h). The overall analysis of the fitted crack pixels is performed using the DBSCAN clustering index method to prevent the risk of small-crack end regions being filtered out. This method addresses the disadvantages associated with filtering methods such as the mean filtering method, in which a fixed convolution kernel size is used as a window (as shown in Figure 11b–d).

3.4. Multidimensional Information Fusion

As mentioned above, the types and shapes of cracks identified from different dimensions can vary significantly. Therefore, it is necessary to use the mutual compensation abilities of various algorithms to achieve the optimal recognition effect. This process is known as multidimensional information fusion.
At present, image information fusion levels can be mainly divided into three levels: the pixel level, feature level, and decision level [35]. Decision-level information fusion is the most reliable information fusion method. The decision-level information fusion model can effectively overcome the influence of the errors caused by the limitations of the individual information sources on the true and false target evaluation results. Meanwhile, this method permits the absence of some information sources, thus reducing the possibility of global misjudgments arising due to local errors. In this article, we propose a decision-level Bayesian probability fusion method based on minimum risk.
Due to the differences in the data structures of point clouds and images, the recognition results from different dimensions need to be integrated into the same format. Raster data (the file format is “.tif”) are a data model of geographic spatial information that can be used to describe continuous geographic phenomena, such as terrain, land cover, etc. [36]. Therefore, the point cloud feature indicators (Eigenvalue ratios, Roughness, and Slope) of the same range can be converted into raster format, and then the same number of rows, columns, and corner point coordinates as the digital image (grayscale, edge gradient, and RGB) can be set to complete the data registration. For 3D point cloud data, in this work, we use the TerraScan module in Terrasolid software (from URL (accessed on 28 October 2024): https://terrasolid.com/) to perform point cloud Kriging interpolation and generate grids with equal resolution.
Bayesian decision theory is a basic statistical pattern recognition method in which classifiers are generated based on probabilistic data analyses [37]. This fusion method considers the probability of each feature combination relationship being assigned to each decision category. Therefore, it is not necessary to consider the number of features or the affiliation of the fusion information. In fact, each feature combination only needs to be independent of each other combination, thus allowing the full-feature information fusion of the 6 crack identification models used to be realized.
The Bayesian decision principle involves judging the posterior probability that the input feature pattern belongs to each decision category. However, this method does not take into account the risks introduced by each decision. Therefore, we added the risk coefficient generated by each decision in the final selection process to obtain the optimal crack information fusion results under different risk conditions. The applied implementation steps are described as follows:
(1) The number of input features ( n ) and permutation and combination types are determined to obtain the number of multidimensional information fusion sample features ( x i , w h e r e   i = 1 , 2 , 3 , n ).
(2) The output category type is determined by obtaining the final Bayesian decision classification result ( ω j , w h e r e   j = 1 , 2 , 3 , m ).
(3) According to the statistics obtained for different decision results in various scenarios through existing historical cases, the prior probability distribution ( P ω j ) of the decision result is obtained.
(4) The statistical probability ( P x i ω j ) values of the different feature distributions derived under each decision in the sample function are obtained.
(5) The Bayesian posterior probabilities ( P ω j x i ) of different classification results ( ω j ) are obtained under the condition of each feature combination ( x i ) using Equation (9).
P ω j x i = P x i ω j · P ω j c = 1 m P x i ω c · P ω c
(6) The risk coefficient ( λ α , β , where λ α , β ) is determined; this coefficient represents the loss that may be caused when the decision classifies the x feature as the ω α class but x actually belongs to the ω β class (where α , β 1 , 2 , 3 m ).
(7) Finally, it is necessary only to calculate the risk of x falling into each class according to Equation (10) and select the classification R ω α x with the smallest risk as the x i fusion decision.
R ω α x = β = 1 m λ α , β · P ω β x

4. Results and Discussion

4.1. Remote Sensing Data Pre-Processing

To obtain the final 2D image data, we use the OpenCV-python computer vision image processing library (Version: 4.9.0.80) to complete the DOM pre-processing task. This open-source platform provides image-processing algorithm functions, including pixel editing, geometric transformation, image segmentation, etc., which can complete the image data preprocessing in this work. For 3D point cloud data, we use the TerraScan module in Terrasolid software (Version: 10.09.01.01) to denoise the point cloud. The results are shown in Figure 12.

4.2. Initial Extraction

Based on the six crack indicator identification methods proposed in Section 3.2, the point cloud and orthophoto image collected in the demonstration area are processed as follows:
(1) To better highlight terrain changes, and considering that a smaller radius increases sensitivity to noise such as isolated stones, deadwood, and branches, we choose a neighborhood search radius of 1 m. The search radius is adjustable and generally roughly the same as the average width of the cracks to be identified. To obtain the roughness, eigenvalue ratio, and slope, Equations (1), (4) and (5) are used (the results are shown in Figure 13a–c). To facilitate the subsequent matrix operations, the point cloud is converted into a raster with the same unit pixel size as the DOM (the results are shown in Figure 13d–f).
(2) The pre-processed grayscale image is shown in Figure 13h. The OpenCV-python image processing library is used to complete the calculations described by Equations (6)–(8), and the calculated gradient value is scaled to an 8-bit image depth (the results are shown in Figure 13g). For the RGB images, the maximum likelihood method toolbox of the ArcGIS software (Version: 10.3) is used. By combining the RGB images with the characteristics of the WuLiPo image, the pixels are divided into 8 categories and 2000 pixel points are collected manually for each sample category. The sampling positions and prediction results by the maximum likelihood method are shown in Figure 14.
In the six crack identification models listed above, except for the supervised learning classification, all methods are pixel-oriented crack data binarization processing methods. Therefore, the results only need to pass a threshold value to complete the two-classification task to obtain crack pixels (i.e., crack and non-crack pixels). The Receiver Operating Curve (ROC) test can be used to analyze the predictive ability of a classifier under different thresholds [38]. The vertical coordinate of the curve is the True Positive Ratio (TPR), that is, the proportion of positive samples correctly predicted by the model as positive cases under different thresholds: TPR = TP/(TP + FN). The abscissa is obtained by calculating the False Positive Ratio (FPR), that is, the proportion of false positives in all inverse samples derived under different thresholds: FPR = FP/(FP + TN). The meanings of the abbreviations TN, FP, FN, and TP are shown in Figure 15.
In this research, 100,000 pixels representing both positive and negative samples are randomly selected from the test set of large and small deformation cracks, and the 5 binary classification models described above are applied to draw ROC curves (the result is shown in Figure 16).
Figure 16 shows that when identifying large deformation cracks, the AUCs of the slope and roughness classification models are both above 0.8, indicating a good identification effect. The classification effect of the eigenvalue ratio model is poor. The main reason for this result is that this model is highly sensitive to uneven unidirectional changes. Consequently, it has a strong recognition effect on unclassified boulder blocks and low vegetation, resulting in its relatively low recognition accuracy. In the recognition of small deformation cracks, the grayscale segmentation model exhibited the best recognition effect, with an AUC value reaching 0.83. The pixel gradient model has a poor predictive ability, mainly because the edge gradient has an optimal recognition effect only on the boundary crack pixels, while the recognition effect on the black pixels in the middle of the crack and the surrounding regions is not optimal.
On a ROC curve, the closer a point is to (0, 1), the better the segmentation threshold corresponding to that point is, meaning that while ensuring a high positive rate, the false positive rate should be minimized to the greatest possible extent. Therefore, the point with the smallest distance ( x 2 + y 2 ) from (0, 1) on the curve is the optimal segmentation threshold of the corresponding model. Figure 16 shows that when the best segmentation threshold is obtained for various models, the TPR is lower than 80%. To better meet actual engineering needs, it is necessary to reduce the certain threshold standard for the segmentation task. Finally, the corresponding threshold of each model under the true case rate of 0.8 is used as the segmentation threshold, which is set following the adjustment process to complete the semantic segmentation of cracks. The extraction results are shown in Figure 17.

4.3. Repair and Filtering

Figure 17 shows that, with an increase in the TPR, the noise of the segmented false cracks gradually increases, and the overall recognition precision of the model begins to decline. Therefore, using the pixel-repairing and filtering method proposed in Section 3.3, the morphological repair is conducted using a 5 × 5 Sampling range for the operation. Filter processing utilizes three indicators, including the eigenvalue ratio ( λ 1 / λ 2 ), crack frequency, and length, where the filtering threshold is 20% of the maximum value of each indicator. The results are shown in Figure 18.
To evaluate the accuracy of each model after the repair filtering step, the results extracted from each model are compared with the test sets derived from manual field checking. The precision, TPR, and FPR values of the crack identification results are counted based on 500,000 pixels in a typical scene area in WuLiPo (the number of positive and negative samples is set to be the same). The F-score is applied to comprehensively characterize the precision and TPR values. This metric can be used to evaluate the overall characteristics of each model before and after repair filtering. The above precision and F-score calculation methods are shown in Equations (11) and (12). The evaluation results are shown in Table 2 and Figure 19.
P r e c i s i o n = T P T P + F P
F - Score = 2 · P r e c i s i o n · T P R P r e c i s i o n + T P R
The filtering algorithm proposed in this article reduces the FPR of each model by 27.9–82.7%, thus greatly reducing the crack identification error rate. Inevitably, due to the complexity of the background, a small number of real cracks are still filtered out. A small decrease in the TPR can be observed, of which the value ranges between 1.4% and 12.4%, but the accuracy rate is increased by 7.5% to 55.3%. Especially for the edge detection algorithm model, the morphological repair can not only “fill” the positive samples inside the edge contour but the filtering algorithm based on the crack feature can also reduce 82.7% of the noise caused by the boundary shadows of trees and rock blocks. Moreover, the precision metric has increased by 55.3%. Table 2 shows that the F-score of the model exhibited an improvement following the filtering process.
Among the identification methods of small and large deformation cracks, gradient value segmentation and slope segmentation models have the best effect. The crack results extracted by the two models are superimposed and compared with the manual investigation results (as shown in Figure 20).

4.4. Multidimensional Information Fusion Results

4.4.1. The Result of Fusion

Six different models are selected for deformation cracks, and these models are all independent of each other. Therefore, the six studied models can be used together as feature samples. According to the fusion method described in Section 3.4, the detailed decision-level fusion steps are described below.
(1) The decision-level fusion process considers the output of each model, so the judgment results of the six models for a single pixel can have values of only 0 (non-crack) or 1 (crack). The permutation and combination relationships of the six models can be used to determine the total number (N) of all feature combinations: N = 2 × 2 × 2 × 2 × 2 × 2 = 64 .
The number “2” in the calculation of N above represents the possible value of each model result, that is, the sample feature number x = x i x i = A , B , C , D , E , F , A ~ F 0,1 , where A~F refers to the roughness, slope, eigenvalue ratio, grayscale segmentation, supervised classification, and edge gradient values output by each model. The results of each feature displayed in the WuLiPo image are shown in Figure 21a.
(2) The prior probability is a preexisting known quantity in Bayesian probability theory. Therefore, according to the investigation of crack development characteristics and historical data in the demonstration area, P ω 1 = 8.27% and P ω 2 = 91.72%, where ω 1 and ω 2 refer to the judgment results of cracks and noncracks, respectively.
(3) Different feature distribution probabilities ( P x i ω j ) are then obtained. When the results of ω 1 and ω 2 are known, the distribution probabilities P x i ω 1 and P x i ω 2 of x i can be calculated from the training set.
(4) According to Equation (9), the Bayesian posterior probabilities P ω 1 x i and P ω 1 x i of different classification results ( ω j ) under the condition of each feature combination x i are calculated; the results are shown in Figure 21b.
(5) The decision risk coefficient ( λ α , β ) is then substituted. There are four types of decision risk coefficient combinations considered, that is, λ 1 , 1 , λ 1,2 , λ 2 , 1 , a n d   λ 2 , 2 , among which λ 1 , 1 , λ 2 , 2 = 0. To analyze the impacts of different risk factors on the decision-making results, a total of six risk conditions are considered: λ 1 , 1 : λ 1,2 : λ 2 , 1 : λ 2 , 2 = 0 : 1.5 : 1 : 0 , 0 : 1.2 : 1 : 0 , 0 : 1 : 1 : 0 , 0 : 1 : 2 : 0 , 0 : 1 : 4 : 0 , and ( 0 : 1 : 7 : 0 ) .
(6) Finally, through Equation (10), we compare the risk values of R ω 1 x i and R ω 2 x i and perform the decision fusion task (as shown in Figure 22).

4.4.2. Evaluation of the Fusion Results

The same test set as that used in the previous section was used here to calculate the F-scores of the fusion methods. The results are shown in Table 3. The fused model was more robust than any individual recognition model. In addition, it can be seen from the table that the Bayesian decision-level fusion results are not necessarily better than the layer-stacking fusion results. The various risk coefficients exhibit great differences in the fusion results. It can be seen from Figure 23 that as the value of λ 12 : λ 21 decreases, TPR begins to increase, indicating that as the risk ratio decreases, the true-positive rate increases. When the risk coefficient decreases to 0.5, the true-positive rate exceeds the optimal effect of the optimal layer superposition fusion (as shown in Figure 23a). As the true-positive rate increases, the false-positive rate begins to increase due to the decrease in the risk coefficient. When the risk coefficient exceeds 0.25, the probability of judging non-cracks as true cracks suddenly increases, causing the accuracy rate to begin to decline (as shown in Figure 23b,c). At this time, the F-score is maximized with a value of 0.901 (as shown in Figure 23d). Therefore, the crack identification effect using Bayesian probability fusion changes with the change in the risk coefficient. When the crack identification accuracy is given priority, the risk coefficient should be appropriately increased; when the crack identification recall rate is given priority, the risk coefficient should be minimized. If both are considered comprehensively, the risk coefficient should be maintained at 0.25 according to the results of this research.

5. Conclusions

In the research, we investigated the key technologies related to landslide cracks based on multidimensional image data, including their automatic identification, extraction, repair, filtering, and fusion. The major conclusions are as follows:
(1) A methodological system for automatically identifying cracks and collecting crack information based on fusing multidimensional information was constructed. The proposed system addressed the shortcomings of the currently available individual remote sensing data identification technologies and could be applied to identify and investigate large-scale, multidimensional complex landslide deformation cracks. In the pixel unit-oriented automatic identification algorithm, starting from the crack image features of different scales, six different crack identification models were used and tested using the calculated ROC values. The derived AUC values were all between 0.6 and 0.85.
(2) Due to the influence of light angles, shrubs, boulders, and branches, there are often noise points in the identification of cracks, which can mislead people’s judgment of the instability characteristics of the landslide. The morphological repairing and filtering algorithm proposed in this research could effectively reduce the background noise of cracks and improve the overall accuracy of the applied models. Based on morphological repair and orientation, frequency, and length filtering tasks, which could sufficiently remove complex noise points (thus reducing the FPR value) and minimize the loss of real cracks (thus reducing the TPR as much as possible), the precision of each model increased by 7.5–55.3%, making the crack identification results more realistic and reliable.
(3) We proposed Bayesian probability decision-making with minimum risk methods to significantly improve the crack identification ability of any individual model. The highest F-score value obtained for the studied model following this fusion process was 0.901, and the risk coefficient could be adjusted according to the actual needs of a project to obtain improved and more practical results.
The investigation and statistical analysis of landslide cracks conducted in past work were often time-consuming and laborious, especially the crack information measurement work conducted in obviously deformed and damaged highly dangerous areas. The use of remote sensing technology to generate multidimensional data that can be used to automatically identify cracks can achieve an efficient operation method. These technologies can be used to solve the difficulties and bottlenecks encountered in traditional geological disaster investigations and are critical for further promoting the early identification of geological disasters and for ensuring the intelligence and automation of monitoring and early warning systems. As mentioned in the introduction, the environmental conditions of the site are important factors that determine whether the model can accurately identify landslide cracks. The crack identification process proposed in this research has multiple threshold and filtering methods to choose from. These choices are not absolute. Therefore, we systematically summarize the principles and ranges of threshold selection in the method section. However, there may still be problems such as noise filtering transitions and difficulty in processing large datasets. Therefore, determining how to perform adaptive parameter selection and large data cluster processing algorithms may be the focus of research in the next step. In addition, considering the data costs and sparse vegetation in the field, in this research, we adopted the UAV oblique photography method to obtain the target point cloud. However, in densely vegetated fields, the use of airborne LiDAR would effectively “penetrate” the vegetation layer to obtain real surface data, and this could improve the crack identification accuracy [39]. Therefore, if conditions permit, the use of LiDAR to obtain point cloud data is also a good choice.

Author Contributions

B.D.: conceptualization, methodology, investigation, writing—original draft. Q.X.: conceptualization, methodology, resources, funding acquisition. X.D.: formal analysis, investigation, validation, writing—original draft. W.L.: formal analysis. M.W.: writing—original draft. Y.J.: writing—original draft, visualization. Q.H.: writing—original draft. All authors have read and agreed to the published version of the manuscript.

Funding

This research was financially supported by the National Key Research and Development Program of China (2022YFC3003205), the General Program of the National Natural Science Foundation of China (Grant No. 42072306), and the State Key Laboratory of Geohazard Prevention and Geoenvironment Protection Independent Research Project (SKLGP2023Z008).

Data Availability Statement

The raw/processed data required to reproduce these findings cannot be shared at this time as the data also form part of an ongoing study.

Conflicts of Interest

Mingtang Wu is employed by Zhejiang Huadong Construction Engineering Co., Ltd. The authors declare no conflicts of interest. And they have no known competing financial interest or personal relationship that could have appeared to influence the work reported in this paper.

References

  1. Hungr, O.; Leroueil, S.; Picarelli, L. The Varnes Classification of Landslide Types, an Update. Landslides 2014, 11, 167–194. [Google Scholar] [CrossRef]
  2. Xu, Q.; Tang, M.G.; Xu, K.X.; Huang, X.B. Research on space-time evolution laws and early warning-prediction of landslides. Chin. J. Rock Mech. Eng. 2008, 27, 1104–1112. [Google Scholar]
  3. Stumpf, A.; Malet, J.P.; Kerle, N.; Niethammer, U.; Rothmund, S. Image-Based Mapping of Surface Fissures for the Investigation of Landslide Dynamics. Geomorphology 2013, 186, 12–27. [Google Scholar] [CrossRef]
  4. Xu, L.; Dai, F.C.; Tham, L.G.; Zhou, Y.F.; Wu, C.X. Investigating Landslide-related Cracks along the Edge of Two Loess Platforms in Northwest China. Earth Surf. Process. Landf. 2012, 37, 1023–1033. [Google Scholar] [CrossRef]
  5. Xu, Q.; Zhao, B.; Dai, K.; Dong, X.; Li, W.; Zhu, X.; Yang, Y.; Xiao, X.; Wang, X.; Huang, J.; et al. Remote Sensing for Landslide Investigations: A Progress Report from China. Eng. Geol. 2023, 321, 107156. [Google Scholar] [CrossRef]
  6. Cheng, Z.; Gong, W.; Tang, H.; Juang, C.H.; Deng, Q.; Chen, J.; Ye, X. UAV Photogrammetry-Based Remote Sensing and Preliminary Assessment of the Behavior of a Landslide in Guizhou, China. Eng. Geol. 2021, 289, 106172. [Google Scholar] [CrossRef]
  7. Lian, X.; Li, Z.; Yuan, H.; Liu, J.; Zhang, Y.; Liu, X.; Wu, Y. Rapid Identification of Landslide, Collapse and Crack Based on Low-Altitude Remote Sensing Image of UAV. J. Mt. Sci. 2020, 17, 2915–2928. [Google Scholar] [CrossRef]
  8. Fang, C.; Fan, X.; Zhong, H.; Lombardo, L.; Tanyas, H.; Wang, X. A Novel Historical Landslide Detection Approach Based on LiDAR and Lightweight Attention U-Net. Remote Sens. 2022, 14, 4357. [Google Scholar] [CrossRef]
  9. Yi, X.; Feng, W.; Li, B.; Yin, B.; Dong, X.; Xin, C.; Wu, M. Deformation Characteristics, Mechanisms, and Potential Impulse Wave Assessment of the Wulipo Landslide in the Baihetan Reservoir Region, China. Landslides 2023, 20, 615–628. [Google Scholar] [CrossRef]
  10. Jiang, S.; Zhang, J. Real-time Crack Assessment Using Deep Neural Networks with Wall-climbing Unmanned Aerial System. Comput. -Aided Civ. Infrastruct. Eng. 2020, 35, 549–564. [Google Scholar] [CrossRef]
  11. Kheradmandi, N.; Mehranfar, V. A Critical Review and Comparative Study on Image Segmentation-Based Techniques for Pavement Crack Detection. Constr. Build. Mater. 2022, 321, 126162. [Google Scholar] [CrossRef]
  12. Yu, T.; Zhu, A.; Chen, Y. Efficient Crack Detection Method for Tunnel Lining Surface Cracks Based on Infrared Images. J. Comput. Civ. Eng. 2017, 31, 04016067. [Google Scholar] [CrossRef]
  13. Yang, K.; Hu, Z.; Liang, Y.; Fu, Y.; Yuan, D.; Guo, J.; Li, G.; Li, Y. Automated Extraction of Ground Fissures Due to Coal Mining Subsidence Based on UAV Photogrammetry. Remote Sens. 2022, 14, 1071. [Google Scholar] [CrossRef]
  14. Wang, H.; Nie, D.; Tuo, X.; Zhong, Y. Research on Crack Monitoring at the Trailing Edge of Landslides Based on Image Processing. Landslides 2020, 17, 985–1007. [Google Scholar] [CrossRef]
  15. Han, W.; Zhang, X.; Wang, Y.; Wang, L.; Huang, X.; Li, J.; Wang, S.; Chen, W.; Li, X.; Feng, R.; et al. A Survey of Machine Learning and Deep Learning in Remote Sensing of Geological Environment: Challenges, Advances, and Opportunities. ISPRS J. Photogramm. Remote Sens. 2023, 202, 87–113. [Google Scholar] [CrossRef]
  16. Al-Rawabdeh, A.; He, F.; Moussa, A.; El-Sheimy, N.; Habib, A. Using an Unmanned Aerial Vehicle-Based Digital Imaging System to Derive a 3D Point Cloud for Landslide Scarp Recognition. Remote Sens. 2016, 8, 95. [Google Scholar] [CrossRef]
  17. Yu, Y.T.; Li, J.; Guan, H.Y.; Wang, C. 3D Crack Skeleton Extraction from Mobile LiDAR Point Clouds. In Proceedings of the 2014 IEEE Geoscience and Remote Sensing Symposium, Quebec City, QC, Canada, 13–18 July 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 914–917. [Google Scholar] [CrossRef]
  18. Sandric, I.; Chitu, Z.; Ilinca, V.; Irimia, R. Using High-Resolution UAV Imagery and Artificial Intelligence to Detect and Map Landslide Cracks Automatically. Landslides 2024, 21, 2535–2543. [Google Scholar] [CrossRef]
  19. Liu, Z.; Cao, Y.; Wang, Y.; Wang, W. Computer Vision-Based Concrete Crack Detection Using U-Net Fully Convolutional Networks. Autom. Constr. 2019, 104, 129–139. [Google Scholar] [CrossRef]
  20. Lucieer, A.; Jong, S.M.D.; Turner, D. Mapping Landslide Displacements Using Structure from Motion (SfM) and Image Correlation of Multi-Temporal UAV Photography. Prog. Phys. Geogr. Earth Environ. 2014, 38, 97–116. [Google Scholar] [CrossRef]
  21. Ruzgienė, B.; Berteška, T.; Gečyte, S.; Jakubauskienė, E.; Aksamitauskas, V.Č. The Surface Modelling Based on UAV Photogrammetry and Qualitative Estimation. Measurement 2015, 73, 619–627. [Google Scholar] [CrossRef]
  22. Mancini, F.; Dubbini, M.; Gattelli, M.; Stecchi, F.; Fabbri, S.; Gabbianelli, G. Using Unmanned Aerial Vehicles (UAV) for High-Resolution Reconstruction of Topography: The Structure from Motion Approach on Coastal Environments. Remote Sens. 2013, 5, 6880–6898. [Google Scholar] [CrossRef]
  23. Reza, A.M. Realization of the Contrast Limited Adaptive Histogram Equalization (CLAHE) for Real-Time Image Enhancement. J. VLSI Signal Process. -Syst. Signal Image Video Technol. 2004, 38, 35–44. [Google Scholar] [CrossRef]
  24. Tomasi, C.; Manduchi, R. Bilateral Filtering for Gray and Color Images. In Proceedings of the Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), Bombay, India, 7 January 1998; Narosa Publishing House: Delhi, India, 1998; pp. 839–846. [Google Scholar] [CrossRef]
  25. Zhang, X.; Wang, X. Novel survey on the color-image graying algorithm. In Proceedings of the 2014 IEEE International Conference on Computer and Information Technology, Xi’an, China, 11–13 September 2014; IEEE: Piscataway, NJ, USA, 2014. [Google Scholar]
  26. Han, X.F.; Jin, J.S.; Wang, M.J.; Jiang, W.; Gao, L.; Xiao, L. A Review of Algorithms for Filtering the 3D Point Cloud. Signal Process. Image Commun. 2017, 57, 103–112. [Google Scholar] [CrossRef]
  27. Chen, Z.; Yuan, Z.; Peng, J.; Li, X.; Mao, S.; Hui, X. Basic characteristics about ground crack’s development of weihe basin. J. Eng. Geol. 2007, 15, 441–447. [Google Scholar]
  28. Zhou, K.; Hou, Q.; Wang, R.; Guo, B. Real-Time KD-Tree Construction on Graphics Hardware. ACM Trans. Graph. 2008, 27, 126. [Google Scholar] [CrossRef]
  29. Lu, Y.; Chen, X.; Wang, L. Research on Fracture Mechanism and Stability of Slope with Tensile Cracks. Appl. Sci. 2022, 12, 12687. [Google Scholar] [CrossRef]
  30. Zhang, F.; Hu, Z.; Yang, K.; Fu, Y.; Feng, Z.; Bai, M. The Surface Crack Extraction Method Based on Machine Learning of Image and Quantitative Feature Information Acquisition Method. Remote Sens. 2021, 13, 1534. [Google Scholar] [CrossRef]
  31. Kanopoulos, N.; Vasanthavada, N.; Baker, R.L. Design of an Image Edge Detection Filter Using the Sobel Operator. IEEE J. Solid-State Circuits 1988, 23, 358–367. [Google Scholar] [CrossRef]
  32. Liang, S.; Cheng, J.; Zhang, J. Maximum Likelihood Classification of Soil Remote Sensing Image Based on Deep Learning. Earth Sci. Res. J. 2020, 24, 357–365. [Google Scholar] [CrossRef]
  33. Dougherty, E.R.; Lotufo, R.A. Hands-on Morphological Image Processing; SPIE Press: Bellingham, WA, USA, 2003. [Google Scholar]
  34. Birant, D.; Kut, A. ST-DBSCAN: An Algorithm for Clustering Spatial–Temporal Data. Data Knowl. Eng. 2007, 60, 208–221. [Google Scholar] [CrossRef]
  35. Meher, B.; Agrawal, S.; Panda, R.; Abraham, A. A Survey on Region Based Image Fusion Methods. Inf. Fusion 2019, 48, 119–132. [Google Scholar] [CrossRef]
  36. Johnson, D.L.; Miller, A.C. A Spatially Distributed Hydrologic Model Utilizing Raster Data Structures. Comput. Geosci. 1997, 23, 267–272. [Google Scholar] [CrossRef]
  37. Körding, K.P.; Wolpert, D.M. Bayesian Decision Theory in Sensorimotor Control. Trends Cogn. Sci. 2006, 10, 319–326. [Google Scholar] [CrossRef] [PubMed]
  38. Hajian-Tilaki, K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Casp. J. Intern. Med. 2013, 4, 627–635. [Google Scholar]
  39. Guo, C.; Xu, Q.; Dong, X.; Li, W.; Zhao, K.; Lu, H.; Ju, Y. Geohazard Recognition and Inventory Mapping Using Airborne LiDAR Data in Complex Mountainous Areas. J. Earth Sci. 2021, 32, 1079–1091. [Google Scholar] [CrossRef]
Figure 1. Location of study area: (a) the location and traffic conditions of the study area on satellite images; (b) optical image of the landslide (photographed by UAV in May 2021); (c) main deformation area and DSM at the landslide site.
Figure 1. Location of study area: (a) the location and traffic conditions of the study area on satellite images; (b) optical image of the landslide (photographed by UAV in May 2021); (c) main deformation area and DSM at the landslide site.
Remotesensing 16 04075 g001
Figure 2. Flight route and terrain products of the UAV operation in WuLiPo: (a) planned flight plane route and checkpoint positions; (b) FeiMa D200 drone; (c) terrain-following flight route; (d) DOM; (e) 3D point cloud.
Figure 2. Flight route and terrain products of the UAV operation in WuLiPo: (a) planned flight plane route and checkpoint positions; (b) FeiMa D200 drone; (c) terrain-following flight route; (d) DOM; (e) 3D point cloud.
Remotesensing 16 04075 g002
Figure 3. Results of the field investigation at Wulipo: (a) section A–A’ and material composition characteristics; (b) manual survey results of cracks; (c) on-site photos of the main cracks (numbers correspond to the shooting range of the black rectangular frame in (b)).
Figure 3. Results of the field investigation at Wulipo: (a) section A–A’ and material composition characteristics; (b) manual survey results of cracks; (c) on-site photos of the main cracks (numbers correspond to the shooting range of the black rectangular frame in (b)).
Remotesensing 16 04075 g003
Figure 4. Flow chart showing the automatic landslide crack detection process utilizing multidimensional data fusion.
Figure 4. Flow chart showing the automatic landslide crack detection process utilizing multidimensional data fusion.
Remotesensing 16 04075 g004
Figure 5. Schematic diagram representing the image pre-processing method.
Figure 5. Schematic diagram representing the image pre-processing method.
Remotesensing 16 04075 g005
Figure 6. 2D and 3D characteristics of landslide cracks with different scales: (a,b) the texture and morphology of the same landslide crack in the image and point cloud (the same numbered frames represent crack comparisons at the same location); (c) schematic diagram of the tensile crack formation; (d) schematic diagram of the shear crack formation. The base map of c and d is digitized from [3].
Figure 6. 2D and 3D characteristics of landslide cracks with different scales: (a,b) the texture and morphology of the same landslide crack in the image and point cloud (the same numbered frames represent crack comparisons at the same location); (c) schematic diagram of the tensile crack formation; (d) schematic diagram of the shear crack formation. The base map of c and d is digitized from [3].
Remotesensing 16 04075 g006
Figure 7. Schematic diagram of the principle by which a K-D tree is used to search the local neighborhood in the point cloud and generate various crack-extraction indicators.
Figure 7. Schematic diagram of the principle by which a K-D tree is used to search the local neighborhood in the point cloud and generate various crack-extraction indicators.
Remotesensing 16 04075 g007
Figure 8. Schematic diagram showing the crack edge threshold segmentation principle in which grayscale images and the Sobel gradient map are used: (a) grayscale image of a crack; (b) local grayscale feature of the crack and the background surface; (c) gradient feature of the crack image processed by the Sobel operator; and (d) edge binarization effect of the crack image.
Figure 8. Schematic diagram showing the crack edge threshold segmentation principle in which grayscale images and the Sobel gradient map are used: (a) grayscale image of a crack; (b) local grayscale feature of the crack and the background surface; (c) gradient feature of the crack image processed by the Sobel operator; and (d) edge binarization effect of the crack image.
Remotesensing 16 04075 g008
Figure 9. Object classification results derived based on maximum likelihood supervision: (a) image map and sampling points; (b) distribution of objects after the classification process; (c,d) spatial distribution and categories of sample pixels before and after the classification process, respectively.
Figure 9. Object classification results derived based on maximum likelihood supervision: (a) image map and sampling points; (b) distribution of objects after the classification process; (c,d) spatial distribution and categories of sample pixels before and after the classification process, respectively.
Remotesensing 16 04075 g009
Figure 10. Process by which the crack binary image is repaired using the morphological closure operation: (a,b) cracks and background surfaces after binarization, respectively; (c,d) expanding and corroding effects of local crack pixels, respectively; and (e) repaired crack.
Figure 10. Process by which the crack binary image is repaired using the morphological closure operation: (a,b) cracks and background surfaces after binarization, respectively; (c,d) expanding and corroding effects of local crack pixels, respectively; and (e) repaired crack.
Remotesensing 16 04075 g010
Figure 11. Schematic diagram showing the principles of the crack-filtering method: (a) local image of crack binary image orientational filtering convolution (from the rectangular box in (d)); (b) eigenvalues after orientational filter convolution; (c) original image with cracks; (d) preliminary automatically extracted crack image; (e) crack identification image after orientation, frequency, and length filtering; (f) principle by which a single crack is clustered using DBSCAN; (g) local characteristics of cracks after clustering (from the rectangular box in (d)); and (h) local characteristics of cracks after orientation and frequency filtering (from the rectangular box in (e)).
Figure 11. Schematic diagram showing the principles of the crack-filtering method: (a) local image of crack binary image orientational filtering convolution (from the rectangular box in (d)); (b) eigenvalues after orientational filter convolution; (c) original image with cracks; (d) preliminary automatically extracted crack image; (e) crack identification image after orientation, frequency, and length filtering; (f) principle by which a single crack is clustered using DBSCAN; (g) local characteristics of cracks after clustering (from the rectangular box in (d)); and (h) local characteristics of cracks after orientation and frequency filtering (from the rectangular box in (e)).
Remotesensing 16 04075 g011
Figure 12. WuLiPo orthophoto image and 3D point cloud pre-processing results.
Figure 12. WuLiPo orthophoto image and 3D point cloud pre-processing results.
Remotesensing 16 04075 g012
Figure 13. Recognition results of the WuLiPo cracks obtained by each model: (ac) calculation results of the point cloud roughness, eigenvalue ratios, and slope, respectively; (df) grid conversion results corresponding to panels (ac); (g) binary image transformed by Sobel; (h) preprocessed grayscale image.
Figure 13. Recognition results of the WuLiPo cracks obtained by each model: (ac) calculation results of the point cloud roughness, eigenvalue ratios, and slope, respectively; (df) grid conversion results corresponding to panels (ac); (g) binary image transformed by Sobel; (h) preprocessed grayscale image.
Remotesensing 16 04075 g013
Figure 14. WuLiPo image classification results derived based on maximum likelihood supervised learning: (a) Orthophoto and manually selected sampling locations; (b) distribution of sample categories after prediction.
Figure 14. WuLiPo image classification results derived based on maximum likelihood supervised learning: (a) Orthophoto and manually selected sampling locations; (b) distribution of sample categories after prediction.
Remotesensing 16 04075 g014
Figure 15. Crack pixel binary classification confusion matrix.
Figure 15. Crack pixel binary classification confusion matrix.
Remotesensing 16 04075 g015
Figure 16. ROC curve test results of each crack identification and classification model.
Figure 16. ROC curve test results of each crack identification and classification model.
Remotesensing 16 04075 g016
Figure 17. Semantic segmentation results of cracks in WuLiPo derived using various models.
Figure 17. Semantic segmentation results of cracks in WuLiPo derived using various models.
Remotesensing 16 04075 g017
Figure 18. Effects of the repairing and filtering processes on the initial extraction crack results of each model (The red areas in the image are the identified crack pixels).
Figure 18. Effects of the repairing and filtering processes on the initial extraction crack results of each model (The red areas in the image are the identified crack pixels).
Remotesensing 16 04075 g018
Figure 19. Statistical chart of TPR, FPR, and precision metrics of the crack extraction models before and after repair filtering.
Figure 19. Statistical chart of TPR, FPR, and precision metrics of the crack extraction models before and after repair filtering.
Remotesensing 16 04075 g019
Figure 20. Crack identification results of Wulipo: (a) automatic detection results of gradient value segmentation and slope segmentation model; (b) manual investigation results.
Figure 20. Crack identification results of Wulipo: (a) automatic detection results of gradient value segmentation and slope segmentation model; (b) manual investigation results.
Remotesensing 16 04075 g020
Figure 21. Distribution of the image fusion features and the posterior probability comparison results derived for WuLiPo cracks: (a) distribution of 64 fusion feature samples; (b) posterior probability values of 64 fusion feature samples, and the reference line with red font indicates that the fusion result has equal probability of cracks and non-cracks.
Figure 21. Distribution of the image fusion features and the posterior probability comparison results derived for WuLiPo cracks: (a) distribution of 64 fusion feature samples; (b) posterior probability values of 64 fusion feature samples, and the reference line with red font indicates that the fusion result has equal probability of cracks and non-cracks.
Remotesensing 16 04075 g021
Figure 22. Bayesian probability fusion results derived under different risk factors: (af) are the results of crack fusion recognition when λ 1 , 1 : λ 1,2 : λ 2 , 1 : λ 2 , 2 = (0:1.5:1:0), (0:1.2:1:0), (0:1:1:0), (0:1:2:0), (0:1:4:0), and (0:1:7:0), respectively. The red areas in the image are the identified crack pixels.
Figure 22. Bayesian probability fusion results derived under different risk factors: (af) are the results of crack fusion recognition when λ 1 , 1 : λ 1,2 : λ 2 , 1 : λ 2 , 2 = (0:1.5:1:0), (0:1.2:1:0), (0:1:1:0), (0:1:2:0), (0:1:4:0), and (0:1:7:0), respectively. The red areas in the image are the identified crack pixels.
Remotesensing 16 04075 g022
Figure 23. Changes in model evaluation indicators under different risk ratios based on Bayesian probability fusion: (ad) represent the changes of TPR, FPR, Precision, and F-score under different λ 12 : λ 21 , respectively.
Figure 23. Changes in model evaluation indicators under different risk ratios based on Bayesian probability fusion: (ad) represent the changes of TPR, FPR, Precision, and F-score under different λ 12 : λ 21 , respectively.
Remotesensing 16 04075 g023
Table 1. UAV hardware parameters and flight information.
Table 1. UAV hardware parameters and flight information.
Hardware Parameters of the DroneField Flight Configuration
Drone modelFeiMa D200 DroneGround spatial resolution1.5 cm
Load modelSONY ILCE-6000Overlap rate between flight bands80%
Sensor size23.5 × 15.6 mmCourse overlap rate85%
Effective Pixels6000 × 4000Aerial photography altitude80 m
Lens parameters20 mm (focus)Number of routes13
Differential modePPK/RTK and its fusion modeRoute length3.9 km
Control radius5 kmAerial photography area0.07 km2
Table 2. Two-class evaluation results of crack identification model before and after repair filtering.
Table 2. Two-class evaluation results of crack identification model before and after repair filtering.
CategoryMethodTPRFPRPrecisionF-Score
Small deformation cracksGrayscale value segmentation modelInitial81.32%32.54%71.42%0.760
After filtering71.23%21.5%76.81%0.739
Supervised classification modelInitial79.67%29.64%72.88%0.761
After filtering72.31%17.34%80.66%0.763
Gradient value segmentation modelInitial79.84%59.63%57.24%0.667
After filtering82.56%10.32%88.89%0.856
Large deformation crackEigenvalue Ratios segmentation modelInitial77.34%48.32%61.54%0.685
After filtering73.54%34.86%67.84%0.706
Roughness segmentation modelInitial82.45%31.98%72.05%0.769
After filtering76.35%18.86%80.19%0.782
Slope segmentation modelInitial78.46%27.64%73.95%0.761
After filtering77.36%13.56%85.09%0.810
Table 3. Statistics of the multidimensional data fusion results.
Table 3. Statistics of the multidimensional data fusion results.
Fusion MethodTypeTPRFPRPrecisionF-Score
The Layer stacking fusionSlope + Edge Gradient80.81%12.74%86.38%0.835
The Bayesian probability fusion method based on minimum risk λ 12 : λ 21 = 1.5 : 1 1.24%0.13%90.51%0.024
λ 12 : λ 21 = 1.2 : 1 61.59%3.24%95.03%0.750
λ 12 : λ 21 = 1 : 1 76.67%8.97%89.53%0.826
λ 12 : λ 21 = 1 : 2 83.73%10.64%87.15%0.864
λ 12 : λ 21 = 1 : 4 92.36%12.36%85.37%0.901
λ 12 : λ 21 = 1 : 7 93.52%34.32%71.96%0.822
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Deng, B.; Xu, Q.; Dong, X.; Li, W.; Wu, M.; Ju, Y.; He, Q. Automatic Method for Detecting Deformation Cracks in Landslides Based on Multidimensional Information Fusion. Remote Sens. 2024, 16, 4075. https://doi.org/10.3390/rs16214075

AMA Style

Deng B, Xu Q, Dong X, Li W, Wu M, Ju Y, He Q. Automatic Method for Detecting Deformation Cracks in Landslides Based on Multidimensional Information Fusion. Remote Sensing. 2024; 16(21):4075. https://doi.org/10.3390/rs16214075

Chicago/Turabian Style

Deng, Bo, Qiang Xu, Xiujun Dong, Weile Li, Mingtang Wu, Yuanzhen Ju, and Qiulin He. 2024. "Automatic Method for Detecting Deformation Cracks in Landslides Based on Multidimensional Information Fusion" Remote Sensing 16, no. 21: 4075. https://doi.org/10.3390/rs16214075

APA Style

Deng, B., Xu, Q., Dong, X., Li, W., Wu, M., Ju, Y., & He, Q. (2024). Automatic Method for Detecting Deformation Cracks in Landslides Based on Multidimensional Information Fusion. Remote Sensing, 16(21), 4075. https://doi.org/10.3390/rs16214075

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop