Next Article in Journal
Study on the Dynamic Magnification Effect of Structure Stiffness Based on the Gust Coupling Analysis of Civil Aircraft
Previous Article in Journal
Flow Analysis of a 300 MW F-Class Heavy-Duty Gas Turbine 1.5 Stage Compressor
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Machine Learning Approach for the Autonomous Identification of Hardness in Extraterrestrial Rocks from Digital Images

by
Shuyun Liu
1,2,
Haifeng Zhao
1,2,*,
Zihao Yuan
2,
Liping Xiao
1,2,
Chengcheng Shen
1,2,
Xue Wan
1,2,
Xuhai Tang
3,4 and
Lu Zhang
2,*
1
University of Chinese Academy of Sciences, Beijing 100039, China
2
Technology and Engineering Center for Space Utilization, Chinese Academy of Sciences, Beijing 100094, China
3
School of Civil Engineering, Wuhan University, Wuhan 430072, China
4
Wuhan University Shenzhen Research Institute, Shenzhen 518057, China
*
Authors to whom correspondence should be addressed.
Submission received: 12 December 2024 / Revised: 30 December 2024 / Accepted: 31 December 2024 / Published: 31 December 2024
(This article belongs to the Special Issue Aerospace Technology and Space Informatics)

Abstract

:
Understanding rock hardness on extraterrestrial planets offers valuable insights into planetary geological evolution. Rock hardness correlates with morphological parameters, which can be extracted from navigation images, bypassing the time and cost of rock sampling and return. This research proposes a machine-learning approach to predict extraterrestrial rock hardness using morphological features. A custom dataset of 1496 rock images, including granite, limestone, basalt, and sandstone, was created. Ten features, such as roundness, elongation, convexity, and Lab color values, were extracted for prediction. A foundational model combining Random Forest (RF) and Support Vector Regression (SVR) was trained through cross-validation. The output of this model was used as the input for a meta-model, undergoing linear fitting to predict Mohs hardness, forming the Meta-Random Forest and Support Vector Regression (MRFSVR) model. The model achieved an R2 of 0.8219, an MSE of 0.2514, and a mean absolute error of 0.2431 during validation. Meteorite samples were used to validate the MRFSVR model’s predictions. The model is used to predict the hardness distribution of extraterrestrial rocks using images from the Tianwen-1 Mars Rover Navigation and Terrain Camera (NaTeCam) and a simulated lunar rock dataset from an open-source website. The results demonstrate the method’s potential for enhancing extraterrestrial exploration.

1. Introduction

In recent years, interest in exploring deep space grows. The focus of such exploration often includes the Moon, Mars, and other celestial bodies [1]. Researchers across disciplines have extensively studied celestial bodies, with a significant focus on extraterrestrial rocks. These rocks, abundant on the surfaces of celestial bodies, are crucial components of their crusts. They provide valuable insights into planetary geology, including information about the temperature and pressure conditions necessary for their formation. These data are essential for understanding the chemical composition and geological environments of planets [2]. Furthermore, physical and chemical changes can be exhibited by rocks after their formation, as evidenced by their altered shape characteristics, indirectly reflecting the geological processes of planets, such as climate and erosion. Insights into the geological evolution history of these celestial bodies can be gained by studying the surface rocks with specific characteristics and conducting analysis. This provides a material analysis basis for addressing scientific questions related to cosmic life exploration [3,4,5].
Launched by NASA, the Mars Exploration Rover, “Curiosity”, has completed over 1660 Martian days on the surface, equivalent to 24 h, 37 min, and 22 s. It has traversed approximately 16 km, ascending geological profiles up to a height of about 220 m. Along its journey, it has gathered 16 rock samples and numerous fine-grained sediment samples, which have undergone detailed analysis to obtain mineralogical and chemical information, recording hundreds of target compositions [6]. Rock samples were collected by the “Curiosity” Mars Exploration Rover during its exploration of landing points. Additionally, the rover utilized the Mars Hand Lens Imager (MAHLI) to capture numerous surface images of Mars, facilitating a direct and rapid understanding of the planet. Moreover, “Mars Odyssey” and “Mars Science Laboratory” are two rovers within NASA’s Mars Exploration Rover (MER) program. Both rovers were equipped with a panoramic camera (Pancam), a microscopic imaging device (MI), and a set of rock abrasion tools. The Pancam is a high-resolution color three-dimensional CCD camera capable of investigating the structure, geology, mineralogy, etc., at landing points [7]. The microscopic imaging device (MI) employs CCD detectors and electronic components identical to those used in the Pancam to capture high-resolution images of rocks. The rock abrasion tools (RAT) can eliminate dust covering rocks in a circular area with a diameter of 45 mm, making direct contact with soil to expose true material components for further investigation [8].
Xiao et al. [9] detected minerals in their custom Martian rock dataset for navigation avoidance using principal component analysis and low-level representation detection methods. An open framework for classifying Martian rock types was developed by Yang et al. [10]. This framework is based on spectral features derived from capturing spectral features of rocks to characterize rock composition and physical properties, ultimately achieving rock-type identification. In 2023, Tang et al. [11] conducted micro-scale mechanical experiments (micro-RME) to determine the macroscopic modulus of meteorites. They used three methods: Voigt-Reuss-Hill (V-R-H), Mori-Tanaka (M-T), and an accurate grain model (AGBM) to measure the modulus of granite samples and the HaH346 meteorite. The study demonstrated that the AGBM method offers a viable approach to investigating the mechanical properties of meteorites. Kelsey et al. [12] utilized open data from the Mars Odyssey mission to determine the elastic modulus, unconfined compressive strength, and other properties of Martian soil. The study revealed that the elastic modulus and unconfined compressive strength of Martian soil increase with depth. Marteau et al. [13] utilized the grinding drill tool of the Mars 2020 Perseverance rover to evaluate the strength of Martian soil across three simulated types with differing mechanical properties. This study showcased the effectiveness of standard scientific instruments in controlled soil analysis, introducing innovative methods and essential technologies for future Mars missions, thus revealing new scientific opportunities. Foucher et al. [14] applied the CaliPhoto method to process images of rock powders captured by standard digital cameras, investigating the classification of igneous rock powders. Under uncontrolled lighting conditions, the method achieved over 90% accuracy. Gutiérrez-Cano et al. [15] used an improved near-field scanning microwave microscope to study the distribution and properties of minerals in rocks through non-contact complex permittivity measurements. This method can produce high-resolution permittivity maps of rocks, offering a new tool for mineral identification and microwave energy applications in the geological field. Nowadays, artificial intelligence technology is extensively utilized in the study of rock properties. Allan et al. [16] applied different machine-learning methods in 2021 to evaluate the classification performance of rock types in geotechnical engineering. Di et al. [17] employed the mean shift algorithm to partition the images obtained by the Mars Exploration Rover “Curiosity” into homogeneous objects. Subsequently, they extracted large-scale rocks from the 3D point cloud data, fitted the ground surface, and determined the physical characteristics of the rocks, including angularity, roundness, width, height, and width-to-height ratio. Houshmand et al. [18] employed deep learning techniques to classify rock types using physical, chemical, and core imaging data. They utilized core sample images from five different rock types, as well as rock Mohs hardness and chemical properties, for the classification. In 2017, Hong et al. [19] introduced a method that combines image processing, fractal theory, and artificial neural networks to quantitatively determine geotechnical strength indicators using rock surface images. In 2023, Lee et al. [20] utilized machine learning to predict rock Leeb hardness based on the physical and chemical characteristics of rocks. Tang et al. [21] used convolutional neural networks, combined with physical data, to efficiently assess rock permeability from its three-dimensional (3D) image.
For manned space missions, astronauts require field research skills to conduct planetary exploration, necessitating specialized training and reliable support equipment. The European Space Agency (ESA) developed the Electronic FieldBook (EFB) to assist in this endeavor. This platform enables astronauts to gather geolocated data during geological surveys, interact with sensors, record samples, and take notes. The ground science team can analyze these data in real time and provide guidance to astronauts. Additionally, the EFB can execute neural network models for tasks such as geological or mineral classification, enabling astronauts to autonomously classify samples. As artificial intelligence advances, it will increasingly assist astronauts in decision-making, signaling a deepening connection between planetary exploration and field research [22].
The five types of rocks selected in this study, granite, limestone, rhyolite, basalt, and sandstone, possess significant geological value due to their long histories. Granite, which forms in the deep part of the Earth’s crust, is an intrusive igneous rock primarily composed of feldspar and quartz. Its formation process is slow, leading to an old geological age. Limestone, a typical sedimentary rock, forms in shallow marine environments and is mainly composed of calcium carbonate. Rhyolite is typically formed during volcanic activities, and its formation age is often associated with the timing of such eruptions, generally in relatively recent geological periods. Basalt, originating from mantle-derived magma, is formed during volcanic eruptions and reflects the geological age of the volcanic event. Sandstone often forms in environments such as rivers, beaches, or deserts. The geological age of sandstone varies widely, ranging from hundreds of thousands to hundreds of millions of years, depending on the sedimentary conditions and diagenesis processes [23]. While these rocks differ in their origins and geological ages, their mechanical properties, particularly hardness, share a strong correlation with their fracture behavior. The hardness of rocks is closely related to their tendency to fracture in a brittle manner. Granite is typically a harder rock that is less ductile and more prone to clean sharp fractures under stress. In contrast, rocks like limestone, which has medium hardness, often develop irregular fractures. Rhyolite, with both high hardness and high silica content, commonly forms columnar joints and radial fractures, typical of volcanic rocks formed during volcanic eruptions. The age of rhyolite formation is generally associated with more recent geological periods. Similarly, basalt, another volcanic rock, is formed from magma originating in the mantle during volcanic eruptions and is characterized by columnar joint fractures. Its geological age corresponds directly to volcanic activity periods. These distinct surface textures provide a foundation for predicting rock hardness through morphological analysis of rock images, as variations in hardness directly influence fracture patterns [24,25].
This paper employed a regression neural network to predict rock hardness based on geometric shape features and color texture. A custom rock dataset comprising five classes (granite, gray sandstone, conglomerate sandstone, sandstone, and gneiss) is prepared, encompassing igneous and sedimentary rocks akin to those found on Mars and the Moon. A total of 1494 images of rocks are captured and processed with binarizations and noise reduction techniques. Finally, 10 shape features are extracted from each image piece, including roundness, ellipticity, convexity, elongation, entropy, correlation, homogeneity, contrast, fractal dimensionality (FD), and Lab-mean value. Here, the Lab-mean value is the mean value of a device-independent Lab color space and specifies colors along three separate scales: lightness (L), red to green (A), and yellow to blue (B). In this study, four different machine learning algorithms are employed: linear regression, RF, SVR, and the MRFSVR model proposed in this paper. It is observed that the MRFSVR model outperformed the other algorithms in predicting rock hardness. Mohs hardness, recorded in numerous authoritative mineralogy handbooks, is chosen as the prediction target due to its strong correlation with Leeb Hardness, which can determine uniaxial compressive strength (UCS), thus offering a better correlation with other rock properties. The future of deep space exploration will increasingly rely on a combination of machine learning and human expertise to support both manned and unmanned missions. This study aids rovers in swiftly identifying the mechanical properties of planetary surface rocks. Moreover, in the future, it could also assist astronauts in planetary geology exploration. The conceptual diagram of the working scenario is shown in Figure 1.
Despite significant advancements in identifying extraterrestrial rock properties, a gap remains in leveraging geometric features and machine learning to predict hardness. This study aims to fill this gap by developing a predictive model based on rock morphological parameters, validated through extraterrestrial rock datasets.
The paper is structured as follows: Section 2 presents the conceptual framework of the model, including principles and metrics. Section 3 describes the rock datasets and the extraction of rock features. Section 4 details the generation of training data for the linear regression, SVR, RF, and MRFSVR algorithms proposed in this paper. This section also presents the main results, including the validation of the algorithms, testing of four architectures, and the impact of morphology features on rock properties. Section 5 provides a conclusion.

2. Method

2.1. Meta Learning of Random Forests and Support Vector Regression (MRFSVR)

In this study, a meta-learning approach, combining Random Forest and Support Vector Regression (MRFSVR), is employed to train the rock hardness prediction network. Meta learning is a machine-learning paradigm that incorporates two key concepts: meta learning and base models. The base model is trained for a specific task, while the meta-model is trained across multiple tasks to learn how to adjust or initialize the base model.
The meta-learning methodology used in this study involves k-fold cross-validation. This process involves dividing the dataset into k subsets of approximately equal size, followed by k iterations [26]. In each iteration, one subset is selected as the test set, while the remaining k-1 subsets serve as the training set. The performance is evaluated over k iterations, and the average is taken as the final performance metric. In this study, k is set to 20. The foundational model is built by combining an RF model and an SVR model, both trained using k-fold cross-validation. The predictions from the foundational models are then used as inputs for the meta-model, which undergoes linear fitting to consolidate the prediction results, resulting in the final predicted hardness. The entire training process and resulting model are referred to as the MRFSVR model. The flowchart of the MRFSVR process is depicted in Figure 2. And Figure 3 illustrates the process of k-fold cross-validation.

2.2. Method of Support Vector Regression (SVR)

SVR is a branch of support vector machine that is applied to fit data or predict data [27]. In this study, SVR is employed as a fundamental training model to predict rock hardness. SVR constructs an optimal hyperplane in the feature space and maps the sample points to a higher-dimensional space to predict target values. SVR denotes input X as feature vectors and takes the target values of y . SVR thinks of a feature vector and corresponding target value as one dataset point. The goal of SVR is to learn a function f ( X ) that best fits the training data while keeping the prediction error within a certain tolerance ε. SVR constrains the target variable, denoted as ‘ y ’, to a narrow strip-shaped region, where the width of the strip is governed by the hyperparameter ε. Traditional regression methods only consider one correct case that regression function f ( X ) is exactly equal to target values y . SVR considers that the deviation between f ( X ) and y is small is enough. So, SVR has better generalization ability and robustness. SVR formula as follows:
f X = ω T φ X + b
where X is the input feature vectors, ω is the feature apace weight vector, φ X is a nonlinear mapping function, and b is the bias vector. The principle of SVR is shown in Figure 4.
SVR minimizes the L 2 n o r m of the weight vector through a composite objective function comprising a loss term and a regularization term. This optimization process aims to determine the optimal regression function that falls within the predefined tolerance boundaries. The role of L 2 n o r m is to prevent SVR training from overfitting. Its optimization aim is as follows:
min 1 2 ω 2
s . t .   ( y i ω T φ X + b ε , i

2.3. Method of Random Forest (RF)

Bagging is an ensemble learning method [28] that constructs multiple weak learners without dependencies and combines them using a decision method to form a strong learner. RF is an improvement in the bagging algorithm [29] and consists of multiple decision trees. Each decision tree represents a hierarchical structure, where each node corresponds to a feature, and each leaf node represents a target value. Decision trees recursively partition the data into different subsets for prediction. For a dataset with N samples, the training set N k for each decision tree can be obtained by resampling N samples. RF is an ensemble of all these decision trees. The principle of RF is illustrated in Figure 5.
The primary method used by RF to address regression problems consists of several key steps. First, given a dataset with N samples, M features, and K decision trees to be built, the process is repeated K times. In each iteration, a subset of N samples is drawn to create a child dataset N k . Next, from each N k , a random selection of m features is made, where m is less than M . A decision tree is then trained using this subset of data and its selected features. Finally, the predicted value for the regression problem is obtained by averaging the results from all decision trees, with the exact integration method depending on the specific RF algorithm used.
Meanwhile, a linear regression model was used as a compare model in this paper. Linear models in machine learning predict a target variable by adding a weighted sum of input features to a bias term. The weights and bias are adjusted to minimize a loss function, which reflects the error between predicted and actual values. This approach creates a simple linear relationship for making predictions [30]. The generic formula for a linear model is as follows:
y ^ = ω 0 + ω 1 x 1 + ω 2 x 2 + + ω n x n
Here, y ^ represents the predicted output, ω 0 is the bias term, and ω 1 , ω 2 , …, ω n are the weights assigned to each feature x 1 , x 2 , …, x n , respectively.

2.4. Performance Parameters

In the work, three parameters are adapted to evaluate the prediction accuracy of the rock hardness. The three regression evaluation metrics, namely mean squared error (MSE), mean absolute error (MAE), and R-squared, are used. The MSE measures the average squared difference between the actual and predicted values in a set of data [31]. MAE measures the average absolute difference between the actual and predicted values [32]. R-squared is a statistical metric quantifying the proportion of variance in the dependent variable predictable from the independent variable(s) [33,34]. It serves as an indicator of the effectiveness of independent variables in elucidating the variability of the dependent variable. The scale of R-squared extends from 0 to 1, where 0 denotes a model incapable of explaining any variability and 1 signifies a model that comprehensively accounts for all observed variability. The three metrics the formulas for three metrics is
M S E = 1 n i = 1 n y i y i ^ 2
M A E = 1 n i = 1 n y i y i ^
R 2 = 1 i n y j y i ^ 2 i n y j y i 2
n is the numbers of data points, (1494 data points in this work), y j is the true rock hardness value, y i ^ is the predicted value, and y i is the mean of the actual values.

3. Datasets

3.1. Dataset Preparation

The objective of this study is to predict the hardness of rocks using morphological parameters. This study employs a custom-made rock dataset due to the limited availability of publicly accessible rock datasets, which lack diversity in rock types, suffer from poor image quality, and have inconsistent image dimensions. To accurately extract features and establish connections with extraterrestrial rocks, a self-compiled rock dataset was built. On the basis of rocks from Mars and the Moon, igneous and sedimentary rocks are the primary focus. To establish a scientifically effective dataset, five categories of rocks with attributes similar to extraterrestrial rocks were collected on Earth. The categories include granite, limestone, rhyolite, sandstone, and basalt, each with 100 samples (except for rhyolite). Table 1 shows the details of this dataset. The data collection method included capturing images of each rock from three fixed angles to ensure complete contours and clear surfaces. In total, a dataset of 1494 rock images is obtained, with each image having dimensions of 4608 × 2592 pixels. A plot of the dataset is presented in Figure 6.
A total of 10 features, which include roundness, ellipticity, elongation, convexity, contrast, correlation, energy, homogeneity, fractal dimension, and the mean values in the Lab color space are selected in this study. Those morphological features are extracted using binary images, and image preprocessing is performed using MATLAB software. The calculation schemes are described in the next session. Figure 7 illustrates the process of rock image preprocessing. To achieve a clear binary image of the rock, the following steps are followed: (i) the RGB image is converted to grayscale and the Roberts operator detects edges. (ii) Noise is then removed from the binary image through an opening operation, and any holes within the rock are filled. (iii) Redundant connected areas are subsequently deleted, leaving only the white areas representing the rock, resulting in a pure binary image.

3.2. Feature Calculation Method

3.2.1. Roundness

Roundness describes the shape characteristics of rock particles or rocks, indicating their proximity to a circular shape [35]. In geology and petrology, roundness is vital for assessing changes in the shape of particles or rocks caused by erosion, abrasion, and other geological processes. This parameter helps scientists and geologists understand the history and origins of particles or rocks, as well as the geological processes they have undergone. It is crucial for studying and comprehending the morphology and evolution of rocks on Earth and other celestial bodies.
Roundness is typically quantified on a scale from zero to one, with zero representing highly non-round shapes (like elongated forms) and one indicating perfectly circular shapes. The roundness calculation method employed in this paper involves several steps. (i) The total sum of white pixels in the binary image of rocks is computed, denoted as “A”. (ii) The minimum circumscribed circle radius of the rock is determined. (iii) Using this radius, a binary image of the minimum circumscribed circle is generated, and the total sum of white pixels in this circle is calculated, denoted as “B”. The roundness is then calculated as the ratio of “A” to “B”. Figure 8 shows the minimum circumscribed circle legend.

3.2.2. Ovality, Extension, and Convexity

These three parameters are crucial for characterizing rock morphology. Ovality measures the deviation of the rock from an elliptical shape, indicating the flatness of its elliptical form. Extension is used to describe the ratio between the length and width of rocks, indicating whether they are more elongated or more rounded. Convexity indicates the degree of curvature of the rock surface [36]. These three parameters’ computation depends on the m a x l e n g t h and m a x w i d t h of the minimum bounding rectangle of the rock image, as depicted in Figure 9.
Ovality is typically a value ranging from zero to one, where zero indicates that the rock tends toward a circular shape and one suggests a highly elliptical form. Its formula is defined as follows:
O v a l i t y = m a x l e n g t h m a x w i d t h m a x l e n g t h
The extension formula is denoted as follows:
E x t e n s i o n = m a x l e n g t h m a x w i d t h
Convexity is utilized to measure the irregularity of the rock’s surface. It is particularly valuable for investigating the wear and erosion history of rocks and their interactions with other another [37,38]. Its formula is expressed as follows:
C o n v e x i t y = 4 a r e a m a x l e n g t h 2 π

3.2.3. Fractal Dimension

Fractal dimension is used to measure the complexity of objects, capturing the characteristics of irregular fractal structures that cannot be easily described using traditional Euclidean geometry [39,40,41]. Fractal dimension values are typically non-integer, reflecting fractal properties like bifurcations, branching, and self-similarity on the object’s surface or contour. This aids in understanding and describing the complex structures and phenomena within the irregular geometric shapes of rocks.
In the field of computer vision and image processing, the fractal dimension can be utilized for the analysis of the texture and shape of rock images. To calculate the fractal dimension of a rock boundary, a square grid with a side length of D covers the boundary, and the number of squares required is counted. The grid’s side length is then halved to D/2, and the process repeats, counting squares to cover the boundary. This iteration continues, halving the grid size until the entire boundary is covered. The fractal dimension is determined by plotting the logarithmic relationship between the number of squares and their side lengths and calculating the slope of the resulting line. The fractal dimension calculation method used in this paper is as follows:
F r a c t a l   d i m e n s i o n = log N D log D

3.2.4. The Mean Value in the Lab Color Space

The Lab color space, also known as CIELAB, is a device-independent color space defined by the International Commission on Illumination (CIE) [42,43]. It is designed to mimic human color perception and provides excellent visual consistency. Evaluating rocks based on the mean in the Lab color space provides characteristics similar to human observation. ‘L’ represents the luminance or brightness of the image on a scale from 0 to 100, with higher values indicating increased brightness. The ‘a’ parameter signifies the presence of red or green tones, with positive values indicating red/magenta and negative values indicating green. Similarly, the ‘b’ parameter represents the presence of yellow or blue tones, with positive values indicating yellow and negative values indicating blue [44,45,46]. Although there is no standardized range, typical values for ‘a’ and ‘b’ fall within the intervals of [−100, 100] or [−128, 127) [47,48]. The Lab color space separates color and brightness, making it effective for distinguishing the distinct colors of five rock types. The transformation equation for converting an RGB image to the Lab color space is delineated as follows, resulting in the computation of the mean values for the three components, as follows:
X Y Z = 0.49 0.177     0 0.31 0.812     0.1 0.2 0.071 0.99 × R G B
L * = 116 f Y Y n 16
a * = 500 f X X n f Y Y n
b * = 200 f Y Y n f Z Z n
f t = t 1 3 , t > 6 29 3 1 3 29 6 2 t + 4 29 , t 6 29 3

3.2.5. Contrast, Correlation, Energy, and Homogeneity

The Gray-Level Co-occurrence Matrix (GLCM) is a valuable tool for texture analysis in images, facilitating the identification of specific textures in diverse rock samples. GLCM primarily consists of four fundamental features: Energy, Contrast, Correlation, and Homogeneity. Energy measures the intensity of gray level variations between pixel pairs, offering insights into texture roughness. Contrast quantifies dissimilarity in gray levels within the image, indicating textural heterogeneity. Correlation evaluates the linear relationship between pixel pairs, with higher values indicating a stronger linear association within the texture. Homogeneity reflects the closeness of the distribution of elements in the GLCM with respect to the GLCM diagonal [49,50,51,52]. The formula for the four parameters is as follows:
c o n t r a s t = i , j i j 2 p i , j
c o r r e l a t i o n = i , j i μ i j μ j p i , j σ i σ j
e n e r g y = i , j p i , j 2
h o m o g e n e i t y = i , j p i , j 1 + i j
In these formulas, p i , j denotes the element in row i and column j of the gray level co-occurrence matrix, and i and j are the values of the gray level. μ is the mean of GLCM and σ is the variance of GLCM. The GLCM calculation process is shown in Figure 10.

3.3. Feature Analysis

The dataset prepared for this study includes 300 images for each rock type, with 10 extracted features per image. Mean values are then calculated based on 300 feature sets for each rock type, resulting in the distribution chart presented below, as shown in Figure 11. Data analysis in Figure 12 reveals that among the five rock types, rhyolite exhibits the highest values in homogeneity, contrast, and entropy, which are the three texture parameters. This indicates that rhyolite possesses a more intricate surface texture and color compared to the other rocks, which aligns with conclusions drawn from visual observations. Basalt exhibits maximum roundness and minimum ellipticity, distinguishing it from limestone. It also displays the highest elongation and fractal dimension among the five rock types, signifying the most intricate boundary shape resulting from significant wear and tear during geological evolution. Furthermore, basalt is more easily transportable compared to the other four rock types.
Due to differing numerical ranges, Lab features are discussed separately. The mean values in the Lab color space for the five rock types were used to generate a bar chart, As shown in Figure 13. The chart indicates that the Lab mean values for granite and rhyolite are similar, while those for limestone and basalt are alike, with limestone positioned in the middle. This suggests that under identical lighting conditions, the surfaces of granite and rhyolite exhibit similar brightness, limestone has a mid-range brightness, and basalt and sandstone have the lowest reflectance under illumination. It is possible that smooth stones have undergone a polishing process.
Additionally, a correlation distribution chart was plotted to analyze the relationships among the various features of the dataset, as shown in Figure 14. The chart reveals that the features exhibit a normal distribution. The fractal dimension feature appears to be relatively independent and is not significantly influenced by other features.

4. Experiments

4.1. Computational Settings

This study uses Windows 11, Python 3.7, and PyTorch 1.12.1 to train the RF, SVR, Linear model, and MRFSVR models. All methods are executed on a computer equipped with an NVIDIA GeForce RTX 2060 and 16GB of memory. The SVR model uses the sigmoid kernel function for training. The RF model consists of 200 decision trees.
The random state is set to 45 (initialized as a parameter for the random number generator) to ensure that the generated random number sequence remains consistent across each run of the model and facilitates result reproducibility in experiments. In the MRFSVR model, the n-splits (the number of subsets into which the dataset is divided) are set to 20, and the random state is set to 45.

4.2. Comparison of Four Models

This section discusses the predictive performance of the four models and compares the predictive rock hardness values of different models. Figure 17 illustrates the comparison between the actual hardness values and predicted hardness values of 300 data points in the test set, demonstrating the predictive performance of MRFSVR compared with three types of baseline methods. In this figure, the X-axis represents the rock serial number, and the Y-axis represents the true and predicted hardness values of the corresponding rock. For clarity, the plot is displayed using 10 interval points.
Linear regression is a statistical method used to model the relationship between the dependent variable and independent variables by fitting a linear equation to the predicted rock hardness. Although it is computationally efficient and does not require a high number of computational resources, it is less applicable to the problem of rock prediction. The relationship between rock hardness and 10 input features is not linear. The linear model tends to overestimate predictions at the highest hardness values and exhibits poor performance in predicting lower hardness, resulting in an overall low prediction accuracy. The model demonstrates the MSE value of 0.7393, the R-squared value of 0.4764, and the MAE value of 0.6722.
The SVR method is utilized to handle the non-linear problem of hardness prediction by employing a sigmoid kernel function. This approach offers flexibility in capturing the complex relationship between morphological values and rock hardness. The rationale behind this lies in the utilization of the sigmoid kernel trick, which allows SVR to implicitly map the input data into a higher-dimensional space. This facilitates the model’s ability to capture non-linear relationships without explicitly computing the transformed feature space. However, SVR is less sensitive to the maximum hardness value, as it prioritizes data points near the decision boundary (support vectors) instead of considering all points. The predictive performance of the SVR model is as follows: MSE is 0.35511, R2 is 0.77485, and MAE is 0.4134. Compared to the linear model, the MSE is reduced by 0.3842, the R2 is increased by 0.2721, and the MAE is reduced by 0.2588. These findings affirm that predicting hardness is a non-linear challenge.
The RF model is robust and less susceptible to overfitting. It is suitable for a wide range of datasets. The ensemble nature of the algorithm, which combines predictions from multiple trees, helps mitigate the impact of hardness outliers on the overall models. It outperforms SVR and linear regression in predicting the maximum hardness values. The MSE value is 0.2595 and is reduced by 26.92% compared to the SVR algorithm, the R2 value is 0.8162, and the MAE value is 0.2462. These two values for the RF method increase by 9.05% and decrease by 35.89% compared with the SVR method.
The MEFSVR model can capture both linear and nonlinear relationships between rock morphology parameters and hardness. It can reduce overfitting by employing cross-validation to select optimal hyperparameters for both models and improve accuracy by combining two different types of regression models. The model is trained using the K-fold cross-validation method, which is flexible and adaptable to different types of rocks with varying shapes. K-fold cross-validation involves dividing the available data into multiple folds or subsets, using one of these folds as the validation set, and training the model on the remaining folds. This process is repeated multiple times, with each repetition using a different fold as the validation set. K-fold cross-validation entails training the model on different combinations of these folds and averaging the results. It prevents overfitting, provides a realistic estimate of the model’s generalization performance, and enables training models with a smaller dataset. The MEFSVR has the highest R2 value and the smallest MSE value. The MRFSVR model demonstrated excellent performance, achieving a mean squared error of 0.2514, an R2 of 0.8219, and a mean absolute error of 0.2431. Specifically, in terms of mean squared error, it outperformed the linear model by a significant 65.98% reduction, the SVR model by 29.18%, and even the RF model by 3.12%. This marked reduction in overall error clearly indicates a substantial performance enhancement. The evaluation results of the four models are depicted in Figure 15. The detailed evaluation results for the four models are listed in Table 2. In addition, Figure 16 shows the residual analysis results of the four models.
This Figure 17 presents a comparative analysis of results obtained from four distinct models. The graph reveals that the predictive efficacy shows an ascending trend from the linear model, SVR model, and RF model to the model proposed in this study. Further scrutiny of the evaluation metrics reveals that the MRFSVR model exhibits the highest R-squared value and the lowest mean squared error, highlighting the effectiveness of the proposed training methodology in enhancing the predictive capabilities of the model.

4.3. Experimental Verification

To further validate the accuracy of the model, the hardness of actual meteorites was measured and compared with the experimental values to assess predictions. Two pieces of KERIYA001 meteorites and three unknown meteorites, as shown in Figure 18, are measured for Mohs hardness by a specialized testing laboratory, all revealing a hardness level of 5.
Each meteorite was photographed from three angles, and 10 sets of features were extracted, as outlined in the Section 3. Using the MRFSVR model, the hardness is predicted, resulting in 15 data points. Table 3 summarizes the error assessment with metrics MSE and MAE, presenting both the predicted values and evaluation results. Different shooting angles introduce biases in hardness estimation. Figure 19 illustrates the visualization of errors. Table 3 presents the experimental and predicted hardness values of meteorites and evaluates the results. The experimentally predicted mean absolute error (MAE) is 0.6815, and the mean squared error (MSE) averages 1.0271.

4.4. Rock Hardness Prediction from the Images of Mars Rover

Predicting rock mechanical parameters is crucial for understanding extraterrestrial rocks in the context of deep space exploration. The neural network developed in this study is ultimately aimed at predicting the characteristics of such rocks. To accurately predict the hardness of extraterrestrial rocks using neural networks, it is necessary to gather morphological parameters from both lunar and Martian sources. NASA has published the mars dataset, which labels features on the Martian surface, including soil, bedrock, sand, and large rocks. Additionally, this paper used MATLAB’s Image Labeler toolbox to mark rocks in NaTeCam images, supplementing the Tianwen dataset. Due to the monotony of Martian landscapes and significant background noise, as well as the objectives of this study, a U-NET segmentation network was chosen to segment rocks on the Martian surface. The U-NET network is trained on the MATLAB platform and achieved a 76.3% accuracy. This allowed us to extract morphological parameters of Martian rocks and NaTeCam rocks images, resulting in a raw segmentation map based on rock distribution. However, the raw map included noise from the sky, instruments, and sandy gaps, which required post-processing to clean it up. Morphological operations like opening and closing were used to remove small noise holes. Regions of interest (ROIs) are manually selected to blur out irregular areas with instrument and sky noise. Furthermore, the removal of the largest connected areas helped to eliminate major noise from the sky and instruments. The final binary image retained essential information about the rocks. The segmentation and post-processing steps are illustrated in Figure 20.
The application process is as follows. First, obtain the segmented image using the U-NET network and image processing step. Then, use MATLAB’s ROI tool to extract the RGB and binary images for each rock from the segmented image. The RGB images are used to extract features like correlation, contrast, homogeneity, energy, and Lab mean, while the binary images are used to extract features like roundness, ellipticity, concavity, elongation, and fractal dimension. Finally, the MRFSVR model presented in this paper is used to predict the hardness of the rocks. The application process is shown in Figure 21.
Since lunar studies are more investigated, a significant amount of pre-segmented lunar rock data is available online. To examine the geometric characteristics of lunar rocks, this paper utilized the Artificial Lunar Landscape Dataset from the Kaggle platform [53]. This dataset features virtual lunar surface images rendered from lunar DEM elevation maps using Planetside Software’s Terragen, with annotations for small lunar rocks and bedrock. Building on this dataset, the project extracted the segmentation results for small lunar rocks and converted them into binary images. The flowchart for predicting lunar rock hardness is shown below. The application in lunar rock is shown in Figure 22.
In the future, the work presented in this paper will be applied to a broader range of extraterrestrial rock predictions, with the study of rock properties diversifying due to technological advancements. This expansion will encompass a wider scope and deeper content, integrating related fields like geology and environmental science with planetary science.

5. Conclusions

Despite significant advancements in identifying extraterrestrial rock properties, a gap remains in leveraging geometric features and machine learning to predict hardness. This study addresses this gap by developing the Meta-Random Forest and Support Vector Regression (MRFSVR) model. A dataset of 1494 self-produced rock images, including granite, limestone, basalt, and sandstone, was used for model training. The trained model achieved an R2 of 0.8219, an MSE of 0.2514, and a mean absolute error of 0.2431. Experiments were conducted using meteorite samples, further demonstrating the model’s robustness and accuracy in predicting Mohs hardness based on morphological parameters.
The model was applied to Tianwen-1 images and a simulated lunar rock dataset, successfully predicting the hardness distribution of extraterrestrial rocks. This method bypasses the time and cost of physical sampling and offers a scalable approach to analyzing planetary surface properties. Its performance confirms the potential of integrating machine learning with navigation images for planetary geological research.
Consequently, a link between the morphology of extraterrestrial rocks and their mechanical properties can be established by the findings of this study, offering deeper insights into the geological evolution of extraterrestrial planets. This research enables rovers to autonomously identify the mechanical properties of rocks on planetary surfaces, offering valuable support for astronauts in future planetary geology exploration.

Author Contributions

Conceptualization, S.L. and H.Z.; methodology, S.L., H.Z., Z.Y., and L.X.; software, S.L. and C.S.; validation, S.L., Z.Y., L.X., C.S., and X.W.; investigation, S.L., H.Z., X.W., and X.T.; writing—original draft preparation, S.L.; writing—review and editing, H.Z. and L.Z.; supervision, H.Z. and L.Z.; project administration, H.Z. and L.Z.; funding acquisition, H.Z. All authors have read and agreed to the published version of the manuscript.

Funding

H.F. Zhao acknowledges the support from the foundation from China Manned Space Engineering Program.

Data Availability Statement

The datasets generated and supporting the finds of this article are available from the corresponding author H. Z. upon request. The Tianwen-1 Mars Rover Navigation and Terrain Camera (NaTeCam) images used in this study are processed and produced by the GRAS of China’s Lunar and Planetary Exploration Program and provided by CNSA at https://clpds.bao.ac.cn/web/enmanager/mars1 (accessed on 30 November 2024).

Acknowledgments

H.Z. and X.W. acknowledge the data granted from China’s Lunar and Planetary Exploration Program.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Fan, J.; Zhang, X.; Zou, Y. Hierarchical path planner for unknown space exploration using reinforcement learning-based intelligent frontier selection. Expert Syst. Appl. 2023, 230, 120630. [Google Scholar] [CrossRef]
  2. Stentz, A. Optimal and Efficient Path Planning for Unknown and Dynamic Environments. IEEE Int. Conf. Robot. Autom. 1994, 4, 3310–3317. [Google Scholar]
  3. Ewing, R.C.; Lapotre, M.G.A.; Lewis, K.W.; Day, M.; Stein, N.; Rubin, D.M.; Sullivan, R.; Banham, S.; Lamb, M.P.; Bridges, N.T.; et al. Sedimentary processes of the Bagnold Dunes: Implications for the eolian rock record of Mars. J. Geophys. Res. Planets 2017, 122, 2544–2573. [Google Scholar] [CrossRef] [PubMed]
  4. Zhang, W.; Cheng, Q.; Li, J. Technical progress in the utilization and exploitation of small celestial body resources. Acta Astronaut. 2023, 208, 219–255. [Google Scholar] [CrossRef]
  5. Zhang, W.; Cheng, Q.; Zhou, W.; Li, J.; Yu, T.; Li, F.; Xu, Y.; Zhang, X. An automatic assisted drill system for sampling deep layer regolith of extraterrestrial celestial bodies. Acta Astronaut. 2023, 207, 375–391. [Google Scholar] [CrossRef]
  6. Blake, D.F.; Morris, R.V.; Kocurek, G.; Morrison, S.M.; Downs, R.T.; Bish, D.; Ming, D.W.; Edgett, K.S.; Rubin, D.; Goetz, W.; et al. Curiosity at Gale Crater, Mars: Characterization and Analysis of the Rocknest Sand Shadow. Science 2013, 341, 1239505. [Google Scholar] [CrossRef]
  7. Cabrol, N.A.; Grin, E.A.; Carr, M.H.; Sutter, B.; Moore, J.M.; Farmer, J.D.; Greeley, R.; Kuzmin, R.O.; DesMarais, D.J.; Kramer, M.G.; et al. Exploring Gusev Crater with Spirit: Review of science objectives and testable hypotheses. J. Geophys. Res. 2003, 108, E12. [Google Scholar] [CrossRef]
  8. Golombek, M.; Kipp, D.; Warner, N.; Daubar, I.J.; Fergason, R.L.; Kirk, R.L.; Beyer, R.; Huertas, A.; Piqueux, S.; Putzig, N.; et al. Selection of the InSight Landing Site. Space Sci. Rev. 2016, 211, 5–95. [Google Scholar] [CrossRef]
  9. Xiao, X.; Cui, H.; Yao, M.; Tian, Y. Autonomous rock detection on Mars through region contrast. Adv. Space Res. 2017, 60, 626–635. [Google Scholar] [CrossRef]
  10. Yang, J.; Kang, Z.; Yang, Z.; Xie, J.; Xue, B.; Yang, J.; Tao, J. A Laboratory Open-Set Martian Rock Classification Method Based on Spectral Signatures. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–15. [Google Scholar] [CrossRef]
  11. Tang, X.; Xu, J.; Zhang, Y.; Zhao, H.; Paluszny, A.; Wan, X.; Wang, Z. The rock-forming minerals and macroscale mechanical properties of asteroid rocks. Eng. Geol. 2023, 321, 107154. [Google Scholar] [CrossRef]
  12. Crane, K.; Rich, J. Lithospheric strength and elastic properties for Mars from InSight geophysical data. Icarus 2023, 400, 115581. [Google Scholar] [CrossRef]
  13. Marteau, E.; Wehage, K.; Higa, S.; Moreland, S.; Meirion-Griffith, G. Geotechnical assessment of terrain strength properties on Mars using the Perseverance rover’s abrading bit. J. Terramech. 2023, 107, 13–22. [Google Scholar] [CrossRef]
  14. Foucher, F.; Bost, N.; Guimbretière, G.; Courtois, A.; Hickman-Lewis, K.; Marceau, E.; Martin, P.; Westall, F. Igneous rock powder identification using colour cameras: A powerful method for space exploration. Icarus 2021, 375, 114848. [Google Scholar] [CrossRef]
  15. Gutiérrez-Cano, J.D.; Catalá-Civera, J.M.; López-Buendía, A.M.; Plaza-González, P.J.; Penaranda-Foix, F.L. High-resolution detection of rock-forming minerals by permittivity measurements with a near-field scanning microwave microscope. Sensors 2022, 22, 1138. [Google Scholar] [CrossRef] [PubMed]
  16. Santos, A.E.M.; Lana, M.S.; Pereira, T.M. Evaluation of machine learning methods for rock mass classification. Neural Comput. Appl. 2021, 34, 4633–4642. [Google Scholar] [CrossRef]
  17. Di, K.; Yue, Z.; Liu, Z.; Wang, S. Automated rock detection and shape analysis from Mars rover imagery and 3D point cloud data. J. Earth Sci. 2013, 24, 125–135. [Google Scholar] [CrossRef]
  18. Houshmand, N.; Goodfellow, S.; Esmaeili, K.; Calderón, J.C.O. Rock type classification based on petrophysical, geochemical, and core imaging data using machine and deep learning techniques. Appl. Comput. Geosci. 2022, 16, 100104. [Google Scholar] [CrossRef]
  19. Hong, K.; Han, E.; Kang, K. Determination of geological strength index of jointed rock mass based on image processing. J. Rock Mech. Geotech. Eng. 2017, 9, 702–708. [Google Scholar] [CrossRef]
  20. Lee, J.; Cook, O.J.; Argüelles, A.P.; Mehmani, Y. Imaging geomechanical properties of shales with infrared light. Fuel 2023, 334, 126467. [Google Scholar] [CrossRef]
  21. Tang, P.; Zhang, D.; Li, H. Predicting permeability from 3D rock images based on CNN with physical information. J. Hydrol. 2022, 606, 127473. [Google Scholar] [CrossRef]
  22. Turchi, L.; Payler, S.J.; Sauro, F.; Pozzobon, R.; Massironi, M.; Bessone, L. The Electronic FieldBook: A system for supporting distributed field science operations during astronaut training and human planetary exploration. Planet. Space Sci. 2021, 197, 105164. [Google Scholar] [CrossRef]
  23. Karaman, K.; Kesimal, A. A comparative study of Schmidt hammer test methods for estimating the uniaxial compressive strength of rocks. Bull. Eng. Geol. Environ. 2014, 74, 507–520. [Google Scholar] [CrossRef]
  24. Panchuk, K. Physical Geology; First University of Saskatchewan Edition; University of Saskatchewan: Saskatoon, SK, Canada, 2017. [Google Scholar]
  25. Sun, W.; Wang, L.; Wang, Y. Mechanical properties of rock materials with related to mineralogical characteristics and grain size through experimental investigation: A comprehensive review. Front. Struct. Civ. Eng. 2017, 11, 322–328. [Google Scholar] [CrossRef]
  26. Li, J.; Gao, F.; Lin, S.; Guo, M.; Li, Y.; Liu, H.; Qin, S.; Wen, Q. Quantum k-fold cross-validation for nearest neighbor classification algorithm. Physica A 2023, 611, 128435. [Google Scholar] [CrossRef]
  27. Luo, C.; Keshtegar, B.; Zhu, S.P.; Niu, X. EMCS-SVR: Hybrid efficient and accurate enhanced simulation approach coupled with adaptive SVR for structural reliability analysis. Comput. Methods Appl. Mech. Eng. 2022, 400, 115499. [Google Scholar] [CrossRef]
  28. Ngo, G.; Beard, R.; Chandra, R. Evolutionary bagging for ensemble learning. Neurocomputing 2022, 510, 1–14. [Google Scholar] [CrossRef]
  29. Jiang, M.; Wang, J.; Hu, L.; He, Z. Random forest clustering for discrete sequences. Pattern Recognit. Lett. 2023, 174, 145–151. [Google Scholar] [CrossRef]
  30. Chander, G.P.; Das, S. Hesitant t-spherical fuzzy linear regression model based decision making approach using gradient descent method. Eng. Appl. Artif. Intell. 2023, 122, 106074. [Google Scholar] [CrossRef]
  31. Wong, T.T. Parametric methods for comparing the performance of two classification algorithms evaluated by k-fold cross validation on multiple data sets. Pattern Recognit. 2017, 65, 97–107. [Google Scholar] [CrossRef]
  32. Kim, B.; Ryu, K.H.; Heo, S. Mean squared error criterion for model-based design of experiments with subset selection. Comput. Chem. Eng. 2022, 159, 107667. [Google Scholar] [CrossRef]
  33. Tang, Y.; Shang, L.; Zhang, R.; Li, J.; Fu, H. Hybrid divergence based on mean absolute scaled error for incipient fault detection. Eng. Appl. Artif. Intell. 2024, 129, 107662. [Google Scholar] [CrossRef]
  34. Hwang, J.; Kim, S.; Mer, V.N. Non-homogeneous Riemannian gradient equations for sum of squares of Bures–Wasserstein metric. J. Comput. Appl. Math. 2024, 438, 115555. [Google Scholar] [CrossRef]
  35. Šimunović, V.; Baršić, G. Evaluating the spindle error of the roundness measurement device. Meas. Sens. 2024, 32, 101038. [Google Scholar] [CrossRef]
  36. Nguyen, H.N.; Lisser, A.; Liu, J. Convexity of linear joint chance constrained optimization with elliptically distributed dependent rows. Results Control Optim. 2023, 12, 100285. [Google Scholar] [CrossRef]
  37. da S. Bessa, J.; Da Silva, J.V.; Frederico, M.N.; Ricarte, G.C. Sharp Hessian estimates for fully nonlinear elliptic equations under relaxed convexity assumptions, oblique boundary conditions and applications. J. Differ. Equ. 2023, 367, 451–493. [Google Scholar]
  38. Zhang, H.; Hu, X.; Wang, L.; Zhao, E.; Liu, C. Effect mechanism of block convexity on the shear behaviors of soil-rock mixtures by the developed 3D spherical harmonics-based modeling approach. Comput. Geotech. 2023, 155, 105183. [Google Scholar] [CrossRef]
  39. Rabal, H.; Grumel, E.; Cap, N.; Buffarini, L.; Trivi, M. A descriptor of speckle textures using box fractal dimension curve. Opt. Lasers Eng. 2018, 106, 47–55. [Google Scholar] [CrossRef]
  40. Bian, J.; Ma, Z.; Wang, C.; Huang, T.; Zeng, C. Early warning for spatial ecological system: Fractal dimension and deep learning. Physica A 2024, 633, 129401. [Google Scholar] [CrossRef]
  41. Dong, S.; Yu, X.; Zeng, L.; Ye, J.; Wang, L.; Ji, C.; Fu, K.; Wang, R. Relationship between box-counting fractal dimension and properties of fracture networks. Unconv. Resour. 2024, 4, 100068. [Google Scholar] [CrossRef]
  42. Muniraj, M.; Dhandapani, V. Underwater image enhancement by modified color correction and adaptive Look-Up-Table with edge-preserving filter. Signal Process. Image Commun. 2023, 113, 116939. [Google Scholar] [CrossRef]
  43. Sahrir, C.D.; Ruslin, M.; Lee, S.Y.; Lin, W.C. Effect of various post-curing light intensities, times, and energy levels on the color of 3D-printed resin crowns. J. Dent. Sci. 2024, 19, 357–363. [Google Scholar] [CrossRef]
  44. Yung, D.; Tse, A.K.; Hsung, R.T.; Botelho, M.G.; Pow, E.H.; Lam, W.Y. Comparison of the colour accuracy of a single-lens reflex camera and a smartphone camera in a clinical context. J. Dent. 2023, 137, 104681. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, N.; Jiang, Z.; Li, J.; Zhang, D. Multiple color representation and fusion for diabetes mellitus diagnosis based on back tongue images. Comput. Biol. Med. 2023, 155, 106652. [Google Scholar] [CrossRef]
  46. Fu, R.; Li, J.; Yang, C.; Li, J.; Yu, X. Image colour application rules of Shanghai style Chinese paintings based on machine learning algorithm. Eng. Appl. Artif. Intell. 2024, 132, 107903. [Google Scholar] [CrossRef]
  47. Yang, B.; Zhu, C.; Li, F.W.; Wei, T.; Liang, X.; Wang, Q. IAACS: Image aesthetic assessment through color composition and space formation. Virtual Real. Intell. Hardw. 2023, 5, 42–56. [Google Scholar] [CrossRef]
  48. Prakash, K.; Saradha, S. Efficient prediction and classification for cirrhosis disease using LBP, GLCM and SVM from MRI images. Mater. Today Proc. 2023, 81, 383–388. [Google Scholar] [CrossRef]
  49. Fajardo, J.I.; Paltán, C.A.; López, L.M.; Carrasquero, E.J. Textural analysis by means of a gray level co-occurrence matrix method. Case: Corrosion in steam piping systems. Mater. Today Proc. 2022, 49, 149–154. [Google Scholar] [CrossRef]
  50. Wang, Y.; Sun, S. A rock fabric classification method based on the grey level co-occurrence matrix and the Gaussian mixture model. J. Nat. Gas Sci. Eng. 2022, 104, 104627. [Google Scholar] [CrossRef]
  51. Utaminingrum, F.; Sarosa, S.J.A.; Karim, C.; Gapsari, F.; Wihandika, R.C. The combination of gray level co-occurrence matrix and back propagation neural network for classifying stairs descent and floor. ICT Express 2022, 8, 151–160. [Google Scholar] [CrossRef]
  52. Pare, S.; Bhandari, A.; Kumar, A.; Singh, G. An optimal color image multilevel thresholding technique using grey-level co-occurrence matrix. Expert Syst. Appl. 2017, 87, 335–362. [Google Scholar] [CrossRef]
  53. Artificial Lunar Landscape Dataset. Available online: https://www.kaggle.com (accessed on 12 June 2019).
Figure 1. Conceptual diagram of the working scenario of this paper.
Figure 1. Conceptual diagram of the working scenario of this paper.
Aerospace 12 00026 g001
Figure 2. Flow chart of MRFSVR.
Figure 2. Flow chart of MRFSVR.
Aerospace 12 00026 g002
Figure 3. Flowchart of k-fold cross-validation.
Figure 3. Flowchart of k-fold cross-validation.
Aerospace 12 00026 g003
Figure 4. Schematic diagram of the principle of SVR.
Figure 4. Schematic diagram of the principle of SVR.
Aerospace 12 00026 g004
Figure 5. Schematic diagram of the principle of RF.
Figure 5. Schematic diagram of the principle of RF.
Aerospace 12 00026 g005
Figure 6. Example of the original rock dataset created for this study (grid scale: 1 cm): (a1a3) Granite; (b1b3) Rhyolite; (c1c3) Basalt; (d1d3) Sandstone; (e1e3) Limestone.
Figure 6. Example of the original rock dataset created for this study (grid scale: 1 cm): (a1a3) Granite; (b1b3) Rhyolite; (c1c3) Basalt; (d1d3) Sandstone; (e1e3) Limestone.
Aerospace 12 00026 g006aAerospace 12 00026 g006b
Figure 7. (a) Rock RGB image; (b) Edge detection graph; (c) Denoising results.
Figure 7. (a) Rock RGB image; (b) Edge detection graph; (c) Denoising results.
Aerospace 12 00026 g007
Figure 8. (a) Illustration of the minimal circumscribed circle; (b) The smallest circumscribed circle generated.
Figure 8. (a) Illustration of the minimal circumscribed circle; (b) The smallest circumscribed circle generated.
Aerospace 12 00026 g008
Figure 9. Illustration of rocks and its corresponding minimum enclosing rectangle.
Figure 9. Illustration of rocks and its corresponding minimum enclosing rectangle.
Aerospace 12 00026 g009
Figure 10. Schematic diagram of the GLCM.
Figure 10. Schematic diagram of the GLCM.
Aerospace 12 00026 g010
Figure 11. Feature distribution plots for five types of rocks: (a) Lab mean value; (b) Radius; (c) Contrast; (d) Correlation; (e) Energy; (f) Homogeneity; (g) Fractal dimension; (h) Ovality; (i) Extension; (j) Convex.
Figure 11. Feature distribution plots for five types of rocks: (a) Lab mean value; (b) Radius; (c) Contrast; (d) Correlation; (e) Energy; (f) Homogeneity; (g) Fractal dimension; (h) Ovality; (i) Extension; (j) Convex.
Aerospace 12 00026 g011
Figure 12. The mean distribution of nine characteristic values.
Figure 12. The mean distribution of nine characteristic values.
Aerospace 12 00026 g012
Figure 13. The mean distribution of Lab mean values.
Figure 13. The mean distribution of Lab mean values.
Aerospace 12 00026 g013
Figure 14. Dataset heatmap.
Figure 14. Dataset heatmap.
Aerospace 12 00026 g014
Figure 15. The evaluation parameter results.
Figure 15. The evaluation parameter results.
Aerospace 12 00026 g015
Figure 16. Residual analysis results for the four models.
Figure 16. Residual analysis results for the four models.
Aerospace 12 00026 g016
Figure 17. Comparison of prediction results.
Figure 17. Comparison of prediction results.
Aerospace 12 00026 g017
Figure 18. The meteorites used in the experiment: (a) KERIYA001 meteorite #1; (b) KERIYA001 meteorite #2; (c) unknown meteorite specimen #3; (d) unknown meteorite specimen #4; (e) unknown meteorite specimen #5.
Figure 18. The meteorites used in the experiment: (a) KERIYA001 meteorite #1; (b) KERIYA001 meteorite #2; (c) unknown meteorite specimen #3; (d) unknown meteorite specimen #4; (e) unknown meteorite specimen #5.
Aerospace 12 00026 g018
Figure 19. Distribution of predicted values for meteorites.
Figure 19. Distribution of predicted values for meteorites.
Aerospace 12 00026 g019
Figure 20. (a) Navigation image of Tianwen-1; (b) Image segmented by U-NET; (c) Binary image after image processing; (d) Visualization of segmentation results.
Figure 20. (a) Navigation image of Tianwen-1; (b) Image segmented by U-NET; (c) Binary image after image processing; (d) Visualization of segmentation results.
Aerospace 12 00026 g020
Figure 21. (a) Original image; (b) Binary image; (c) Single rock RGB image; (d) Single rock binary image; (e) Hardness prediction on the Martian surface.
Figure 21. (a) Original image; (b) Binary image; (c) Single rock RGB image; (d) Single rock binary image; (e) Hardness prediction on the Martian surface.
Aerospace 12 00026 g021
Figure 22. (a) Lunar original image; (b) Lunar binary image; (c) Single rock RGB image; (d) Single rock binary image; (e) Hardness prediction on the lunar surface.
Figure 22. (a) Lunar original image; (b) Lunar binary image; (c) Single rock RGB image; (d) Single rock binary image; (e) Hardness prediction on the lunar surface.
Aerospace 12 00026 g022
Table 1. Dataset information.
Table 1. Dataset information.
TypeNumber of RocksNumber of ImagesPetrogenesisMohs Hardness
Granite100300Igneous rock6.1
Limestone100300Sedimentary rock3.5
Rhyolite100294Igneous rock6.5
Sandstone100300Sedimentary rock6.9
Basalt100300Igneous rock5.5
Table 2. Comparison of model prediction results.
Table 2. Comparison of model prediction results.
ModelR2MSEMAE
Linear0.47640.73930.6722
RF0.81620.25950.2462
SVR0.74850.35510.4134
MEFSVR0.82190.25140.2431
Table 3. Hardness validation of meteorite specimens.
Table 3. Hardness validation of meteorite specimens.
NumbersPredicted ValuesExperimental ValuesMSEMAE
Rock1-16.1073451.11861.10734
Rock1-23.664952.98881.3351
Rock1-33.7878952.99631.2122
Rock2-13.9227752.98551.0772
Rock2-25.2743250.07320.27432
Rock2-35.3116650.10040.1004
Rock3-15.1378750.01880.1378
Rock3-24.6125350.37420.38747
Rock3-34.7665250.28520.23348
Rock4-14.3396650.44430.66034
Rock4-25.1937450.03840.19374
Rock4-33.7716152.97711.22839
Rock5-14.4828350.26310.51717
Rock5-23.9259252.72571.07408
Rock5-34.4225150.31080.57749
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, S.; Zhao, H.; Yuan, Z.; Xiao, L.; Shen, C.; Wan, X.; Tang, X.; Zhang, L. A Machine Learning Approach for the Autonomous Identification of Hardness in Extraterrestrial Rocks from Digital Images. Aerospace 2025, 12, 26. https://doi.org/10.3390/aerospace12010026

AMA Style

Liu S, Zhao H, Yuan Z, Xiao L, Shen C, Wan X, Tang X, Zhang L. A Machine Learning Approach for the Autonomous Identification of Hardness in Extraterrestrial Rocks from Digital Images. Aerospace. 2025; 12(1):26. https://doi.org/10.3390/aerospace12010026

Chicago/Turabian Style

Liu, Shuyun, Haifeng Zhao, Zihao Yuan, Liping Xiao, Chengcheng Shen, Xue Wan, Xuhai Tang, and Lu Zhang. 2025. "A Machine Learning Approach for the Autonomous Identification of Hardness in Extraterrestrial Rocks from Digital Images" Aerospace 12, no. 1: 26. https://doi.org/10.3390/aerospace12010026

APA Style

Liu, S., Zhao, H., Yuan, Z., Xiao, L., Shen, C., Wan, X., Tang, X., & Zhang, L. (2025). A Machine Learning Approach for the Autonomous Identification of Hardness in Extraterrestrial Rocks from Digital Images. Aerospace, 12(1), 26. https://doi.org/10.3390/aerospace12010026

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop