Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.)

Jiang, Xiangtai; Gao, Lutao; Xu, Xingang; Wu, Wenbiao; Yang, Guijun; Meng, Yang; Feng, Haikuan; Li, Yafeng; Xue, Hanyu; Chen, Tianen

doi:10.3390/agronomy15010038

Open AccessArticle

Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.)

by

Xiangtai Jiang

^1,2,†,

Lutao Gao

^3,†,

Xingang Xu

^1,2,*

,

Wenbiao Wu

¹,

Guijun Yang

¹,

Yang Meng

¹,

Haikuan Feng

¹

,

Yafeng Li

¹,

Hanyu Xue

¹

and

Tianen Chen

^1,*

¹

Key Laboratory of Quantitative Remote Sensing in Ministry of Agriculture and Rural Affairs, Information Technology Research Center, Beijing Academy of Agricultural and Forestry Sciences, Beijing 100097, China

²

College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China

³

College of Big Data, Yunnan Agricultural University, Kunming 650201, China

^*

Authors to whom correspondence should be addressed.

^†

L.G. and X.J. are co-first authors.

Agronomy 2025, 15(1), 38; https://doi.org/10.3390/agronomy15010038

Submission received: 25 November 2024 / Revised: 20 December 2024 / Accepted: 24 December 2024 / Published: 27 December 2024

(This article belongs to the Section Precision and Digital Agriculture)

Download

Browse Figures

Versions Notes

Abstract

:

One of the most important nutrients needed for fruit tree growth is nitrogen. For orchards to get targeted, well-informed nitrogen fertilizer, accurate, large-scale, real-time monitoring, and assessment of nitrogen nutrition is essential. This study examines the Leaf Nitrogen Content (LNC) of the custard apple tree, a noteworthy fruit tree that is extensively grown in China’s Yunnan Province. This study uses an ensemble learning technique based on multiple machine learning algorithms to effectively and precisely monitor the leaf nitrogen content in the tree canopy using multispectral canopy footage of custard apple trees taken via Unmanned Aerial Vehicle (UAV) across different growth phases. First, canopy shadows and background noise from the soil are removed from the UAV imagery by using spectral shadow indices across growth phases. The noise-filtered imagery is then used to extract a number of vegetation indices (VIs) and textural features (TFs). Correlation analysis is then used to determine which features are most pertinent for LNC estimation. A two-layer ensemble model is built to quantitatively estimate leaf nitrogen using the stacking ensemble learning (Stacking) principles. Random Forest (RF), Adaptive Boosting (ADA), Gradient Boosting Decision Trees (GBDT), Linear Regression (LR), and Extremely Randomized Trees (ERT) are among the basis estimators that are integrated in the first layer. By detecting and eliminating redundancy among base estimators, the Least Absolute Shrinkage and Selection Operator regression (Lasso)model used in the second layer improves nitrogen estimation. According to the analysis results, Lasso successfully finds redundant base estimators in the suggested ensemble learning approach, which yields the maximum estimation accuracy for the nitrogen content of custard apple trees’ leaves. With a root mean square error (RMSE) of 0.059 and a mean absolute error (MAE) of 0.193, the coefficient of determination (R²) came to 0. 661. The significant potential of UAV-based ensemble learning techniques for tracking nitrogen nutrition in custard apple leaves is highlighted by this work. Additionally, the approaches investigated might offer insightful information and a point of reference for UAV remote sensing applications in nitrogen nutrition monitoring for other crops.

Keywords:

canopy nitrogen content; UAV remote sensing; ensemble learning; Lasso model

1. Introduction

Because of its unique flavor and great nutritional content, the custard apple tree is a popular fruit crop among customers [1]. The most crucial mineral nutrient component for fruit tree growth is nitrogen. Precision fertilization and management decision-making in orchards depend on quick, precise, dynamic, and extensive nitrogen monitoring and diagnosis [2]. As a reliable, effective, and broadly used detection method, remote sensing offers crucial information support for sustainable development and precise crop nutrient management [3]. In terms of managing orchards, including estimating leaf area [4], water content [5], and chlorophyll levels [6], it has had considerable success.

Satellite imagery [7], UAV remote sensing [8], and ground-based hyperspectral data [9] are all examples of remote sensing data. Hyperspectral data collection on the ground is labor-intensive, time-consuming, and prone to human error [10]. However, satellite remote sensing data has slow acquisition cycles, poor resolution, and is susceptible to cloud cover [11]. On the other hand, because of its versatility, quick and real-time data gathering capabilities, affordability, and superior image spatial resolution, Unmanned Aerial Vehicle (UAV) remote sensing technology has advanced quickly in crop nitrogen nutrition monitoring [12]. Based on this, UAV remote sensing technology demonstrates clearer advantages in tasks that require small-scale and high-frequency monitoring. However, elements like canopy shadows [13] and soil background [14,15] might degrade the quality of UAV photography by lowering the reflectance measured. To eliminate the impacts of soil background and canopy shadow, multispectral shadow indices [16] are therefore crucial.

Since drone technology has become so popular, numerous researchers have evaluated crop development and the general health trends of field crops using various spectral bands and vegetation indices obtained from UAVs. For instance, Lu et al. had encouraging findings when they combined UAV RGB photos with vegetation indices, plant height, and canopy coverage to quantify the nitrogen content of summer maize leaves [17]. Rich texture features found in high-resolution remote sensing photos are crucial for research since they are not affected by color or brightness [18]. In order to assess wheat nitrogen nutritional indices, Zhang et al. [19] combined spectral, thermal, and textural information with machine learning techniques. Additionally, Zhuang et al. [20] discovered that the best regression accuracy was obtained by using UAV-derived VIs and TFs as predictors for estimating tea plant LAI. This was more precise than using only VIs or multispectral bands. In a similar vein, Wang et al. [21] created a model that combined vegetation and texture indices and demonstrated great generalization ability while accurately assessing the nitrogen content of rice stems and leaves.

Based on spectral data, there are now four primary approaches for crop nitrogen status monitoring: physical models, parametric regression, linear non-parametric regression, and non-linear non-parametric regression (machine learning algorithms) [22]. As UAV technology advances, machine learning methods are being used more and more in UAV-based hyperspectral measurement research [23]. According to the literature, Liu et al. [24] developed a quantitative model using multiple linear regression (MLR) and backpropagation neural network (BPNN) algorithms to estimate the LNC of winter wheat at four growth stages. Jia et al. [25] used stepwise MLR and BPNN methods to predict the LNC of flue-cured tobacco leaves. After reducing dimensionality using the SPA, LASSO, and Elastic Net methods, Cao [26] constructed nine models to invert nitrogen content using multiple linear regression, stepwise regression, and partial least squares. A robust relationship between nitrogen content and spectral characteristics was established using inverse modeling techniques, providing a more complex and accurate estimation method. The performance of this method [27] was also demonstrated in the hyperspectral estimation of nitrogen content in tobacco leaves using UAVs. The development of UAV-based hyperspectral imaging technology provides a non-destructive and high-throughput approach for effectively monitoring the physiological and biochemical traits of crops [28]. Therefore, the use of UAV remote sensing to estimate LNC has become a key technical tool nowadays.

However, individual models frequently fall short of their full potential due to their limits in terms of complexity and data diversity [27,29]. Multiple separate machine learning models are combined in ensemble learning, which has benefits over single models, including increased generalization capabilities, decreased overfitting, and better prediction accuracy [30]. Ensemble learning predictive models include Bagging, Stacking, and Boosting. Stacking can give full play to the characteristics and advantages of the base model, and the choice of the base model is relatively flexible and can be adapted to different types of data and problems. An excessive number of base models increases the model complexity and computational cost, and previous studies usually chose linear models in the selection of meta-models [31]. This means that only the base model results are used as inputs to the meta-model, ignoring the performance of the base model. When the base model does not work well, simple stacking creates redundancy, and the meta-modeling results are compromised. Lasso regression [32] is added to the second layer of the model to apply weights to each base model, which aids in choosing the top-performing models because machine learning algorithms perform very differently.

The custard apple trees have high economic value, but their cultivation process lacks standardization and informatization. Therefore, the aim is to use remote sensing data to construct a crop nutrient monitoring model, targeting precise and real-time nitrogen content monitoring of the custard apple trees. In light of this, this study builds an ensemble model for nitrogen content estimate using a Stacking architecture that incorporates RF [33], GBDT [34], ADA [35], LR [36], and ERT [37] with UAV remote sensing data, spectral information, and texture characteristics. A well-balanced ensemble learning framework is created by applying Lasso regression to calculate the weights of the basic models. The effectiveness of the suggested ensemble learning approach for inverting the nitrogen content of custard apple plants is assessed, offering insightful information to direct agricultural production methods. The specific contributions of this study are as follows:

(1): The NDCSI index was used to perform threshold-based removal of shadow noise and soil background in UAV imagery.
(2): UAV remote sensing imagery data were collected, and LNC data were measured. Eleven typical VIs were constructed. The correlation coefficients between spectral features, texture features, and LNC were initially screened using Pearson correlation, and sensitive features were further selected through RF modeling.
(3): To further improve the inversion quality of LNC, a Lasso-Stacking model with a feature selection function was constructed for nitrogen content monitoring in custard apple.

2. Materials and Methods

2.1. Study Area

The study region is situated at 101°47′ E longitude and 25°20′ N latitude in Yuanmou County, Kunming City, Yunnan Province. The topography of Yuanmou County is low in the center and high on the sides. It is situated in a low-elevation, low-latitude highland area. The subtropical monsoon climate of South Asia is what defines the warm climate. With an average yearly temperature of 21.9 °C, the region faces drought and little precipitation. It is a natural greenhouse because the temperature stays high all year round, and there is never any frost. Yuanmou is the perfect place to grow custard apples because of its special topographical circumstances. An overview of the study area is shown in Figure 1.

2.2. UAV Imagery Acquisition and Preprocessing

With a maximum flight altitude of 6000 m, a maximum horizontal flight speed of 21 m per second, a maximum wind resistance speed of 12 m per second, a maximum tilt angle of 35 degrees, and a maximum flight duration of 46 min, the DJI Mavic 3M quadcopter (Shenzhen DJI Innovation Technology Co., Shenzhen, China) was the UAV utilized for this investigation. A 20-megapixel digital camera and four 5-megapixel multispectral cameras (for the green (G), red (R), red edge (RE), and near-infrared (NIR) bands) make up the image sensor system in use. The camera brand is DJI, and the model is the Mavic 3 Multispectral Camera. The visible light camera model is 4/3 CMOS, and the multispectral camera model is 1/2.8-inch CMOS. Table 1 provides a summary of the UAV camera parameters. Figure 2 shows a physical drawing of the drone. The drone was manufactured by China’s Shenzhen DJI Innovation Technology Co., Shenzhen, China.

UAV imaging data collection was carried out in the custard apple orchard on 24 May 2023, 26 August 2023, and 1 November 2023. Imaging was done between 10:00 and 14:00 on clear, mostly cloudless weather days to guarantee data accuracy and dependability. The UAV flight area is within the range of latitude 25°72′ and longitude 101°79′. To get accurate reflectance data, a calibrated white reference panel was set up before every flight. The UAV’s flight parameters were predetermined to be 6 m/s, 50 m height, 80% forward overlap, and 75% side overlap.

DJI Terra 3.1.0 and ENVI 5.6 software were used for image mosaicking and preprocessing following UAV image acquisition. Control points were used to apply geometric correction to the picture, and then a pseudo-invariant feature-based radiometric correction technique was used. This method used the reflectance data from the calibrated white reference panel to convert the multispectral images’ digital number (DN) values to reflectance values. To enable later radiometric correction of the photos, the reflectance of the calibration panel was measured using an ASD spectrometer prior to each UAV flight. Following the calculation of the DN values for each spectral band in the multispectral images that corresponded to the target crop, the multispectral images were radiometrically corrected using the following formula [38]:

R_{1} = \frac{D_{n 1}}{D_{n 2}} * R_{2}

(1)

Theorem 1.

R1 is the target point’s reflectance, Dn1is the target point’s average DN value, Dn2 is the calibration panel’s default DN value, and R2 is the calibration panel’s measured reflectance value (with values of 0.9459 for green, 0.9455 for red, 0.9487 for red edge, and 0.9395 for near-infrared).

2.3. UAV Imagery Shadow Removal

The imagery is susceptible to influence from soil background and canopy shadows because of the substantial reflectance differences between soil and custard apple trees and the trees’ intricate three-dimensional structure. The accuracy of inversion is impacted by this interference since it lowers the quality of the custard apple trees’ reflectance data. These impacts can be successfully lessened by using a threshold segmentation technique. For efficient shadow removal, according to the experimental treatment of canopy shading by Luo, the Normalized Difference Canopy Shadow Index (NDCSI) [39] threshold of 0.3 [40] was obtained to have a better effect (Figure 3). The following is the NDCSI formula:

N D C S I = \frac{R_{N I R} - R_{R}}{R_{N I R} + R_{R}} \frac{R_{R E} - R_{R E M I N}}{R_{R E M A X} - R_{R E M I N}}

(2)

Theorem 2.

_NIR represents the reflectance in the near-infrared band, R_R represents the reflectance in the red band, R_RE represents the reflectance in the red-edge band, R_REMIN is the minimum value of the red-edge reflectance in the image, and R_REMAX is the maximum value of the red-edge reflectance in the image.

2.4. Information from the Ground Gathering and Measuring the Nitrogen Level in Custard Apple Leaves

Leaf samples from custard apple trees were gathered for this investigation in three crucial months: 24 May 2023, 26 August 2023, and 1 November 2023. In the 29 sampling areas distributed across six plots, three representative leaf samples from the custard apple trees were collected in each sampling area. The sampling points are shown in Table 2 below. Samples were first inactivated in the lab by heating them to 105 °C for 30 min, then drying them at 80 °C until their weight remained constant. An elemental analyzer was used to measure the amount of nitrogen. To quantify nitrogen, the samples were first burned at a high temperature, which allowed the nitrogen to react with the oxygen and create nitrogen oxides. These nitrogen oxides were then converted to nitrogen gas under the circumstances. Lastly, by measuring the amount of nitrogen gas generated, the analyzer determined the nitrogen content of the samples.

2.5. Extraction of Vegetation Indices from Multispectral Imagery

Compared to raw band reflectance, vegetation indices, which are derived from the reflectance values of multiple spectral bands, provide a more accurate assessment of vegetation health. These indices offer a rapid and efficient approach to evaluating plant status. Several vegetation indices, including the Modified Normalized Difference Red Edge (MNDRE), Normalized Red Index (NRI), Red Edge Simple Ratio (RESR), Transformed Normalized Difference Vegetation Index (TNDVI), Normalized Difference Vegetation Index (NDVI), Wide Dynamic Range Vegetation Index (WDRVI), Normalized Green Index (NGI), Modified Simple Ratio (MSR), Normalized Red Edge Index (NREI_R), Ratio Vegetation Index (RVI), and Normalized Red Edge Index (NREI), were calculated using the reflectance data from UAV imagery. Additionally, four spectral bands—green, red, red edge, and near-infrared—were included in the analysis alongside the original reflectance data. In each growth stage, among the 29 sampling areas, representative regions with uniform growth distribution were selected as data sources based on radiometrically corrected UAV remote sensing images combined with BeiDou Navigation Satellite System (BDS, Chance Location Network Ltd., Shanghai, China) coordinates of ground measurement points (as shown in Table 2). The area of each sampling region in the images was set to 20 × 20 pixels, totaling 400 pixels, which is considered to represent the leaf area of custard apple in UAV imagery. From these 400 pixels, the reflectance of G, R, RE, and NIR bands was extracted, and their average values were calculated to construct the spectral feature parameters. The same method was applied to texture features. Table 3 presents the vegetation indices constructed using spectral bands from UAV imagery.

2.6. Multispectral Image Texture Features

The Gray Level Co-occurrence Matrix (GLCM) is the most widely used technique for texture extraction in statistical methods, structured texture analysis, and spectral analysis. Eight GLCM-based texture features—Mean, Variance (Var), Homogeneity (Hom), Contrast (Con), Dissimilarity (Dis), Entropy (Ent), Second Moment (Sec), and Correlation (Cor)—were computed for every spectral band in this investigation. To increase the precision of estimating the nitrogen content of custard apple trees, these texture features offer important information on the structure and texture patterns of the canopy. The calculations [50] are as follows:

M e a n = \sum_{i = 0}^{G - 1} i \cdot p (i)

(3)

V a r i a n c e = \sum_{i = 0}^{G - 1} {(i - μ)}^{2} \cdot p (i)

(4)

H o m o g e n e i t y = \sum_{i, j} \frac{p (i, j)}{1 + {(i - j)}^{2}}

(5)

C o n t r a s t = \sum_{i, j} {(i - j)}^{2} {(i - j)}^{2}

(6)

D i s s i m i l a r i t y = \sum_{i = 1}^{G} \sum_{j = 1}^{G} p (i, j) \cdot |i - j|

(7)

E n t r o p y = \sum_{i, j} p {(i, j)}^{2}

(8)

S e c o n d M o m e n t = \sum_{i = 0}^{G - 1} \sum_{j = 0}^{G - 1} p (i, j) \cdot (i - μ) (j - μ)

(9)

C o r r e l a t i o n = \frac{\sum_{i, j} (i - μ) (j - μ) p (i, j)}{σ_{i} σ_{j}}

(10)

Theorem 3.

Where (i, j) represent the grayscale level indices, p (i,j) is the joint probability of grayscale levels i and j in the GLCM (Gray Level Co-occurrence Matrix). μ is the mean grayscale value of the image, and σ is the standard deviation of the grayscale levels.

2.7. Development of the Stacking Ensemble Learning Method for Calculating LNC of Custard Apple Leaf

The LNC of custard apple trees is estimated in this study using a stacking ensemble learning framework. An ensemble learning technique called stacking improves overall predictive accuracy by combining the predictions of several base learners. Prediction accuracy is frequently greatly increased by stacking, which takes advantage of the advantages of various models. The base learners in this work were trained and estimated using 5-fold cross-validation. A meta-model for nitrogen content estimation was then trained using the estimation findings from these base learners. By removing estimation biases from the base learners, the meta-learner in the stacking model enhances the ensemble model’s overall prediction accuracy and generalization capacity. RF, GBDT, ADA, LR, and ERT are among the base learners selected for this investigation. The ensemble model’s meta-learner is chosen to be Lasso regression. Figure 4 illustrates the workflow design of the ensemble learning approach for estimating custard apple LNC.

2.8. Techniques for Ensemble Learning

The following models are selected as basis learners in the ensemble learning design: Lasso regression is employed as the meta-model, along with RF, GBDT, ADA, LR, and ERT.RF is a decision tree-based ensemble learning technique. For classification or regression, it constructs several decision trees and aggregates their results. A random selection of features is used to determine the split at each node, and each tree is trained on a subset of the data. RF improves generalization skills and lessens model overfitting. Weighted voting is the foundation of the ensemble learning technique known as ADA.

To increase the predictive capacity of the model, it builds several weak classifiers, such as decision trees. Misclassified samples are given greater weights in each iteration, which forces later classifiers to concentrate more on these challenging cases. Assuming a linear relationship between the input and output variables, linear regression is one of the most basic regression models. LR and linear classifiers, such as logistic regression, are examples of common linear models. ERT is an RF variation. The main distinction is that the dividing method is more “extreme.” Instead of choosing the best split point, a random split point is selected for every feature in each tree. The model’s variety is increased by this method. A linear regression model called Lasso seeks to minimize both the L1 norm of the coefficients and the sum of squared residuals. A sparse model is produced via the L1 regularization term, which essentially performs variable selection by shrinking some coefficients to zero. By lessening susceptibility to noise in the training data, Lasso enhances the model’s capacity for generalization and helps avoid overfitting in high-dimensional data. It is rather easy to use and enables regularization settings to be changed to balance the model. The formula [51] is as follows:

a s s o = \min_{β} (\frac{1}{2 n} \sum_{i = 1}^{n} {(y_{i} - β_{o} - \sum_{j = 1}^{p} x_{i j} β_{j})}^{2} + λ \sum_{j = 1}^{p} |β_{j}|)

(11)

Theorem 4.

Represents the number of samples, p is the number of features, y_i is the true value of the i-th sample,

x_{i j}

is the value of the j-th feature for the i-th sample, β₀ is the intercept term, β_j is the regression coefficient for the j-th feature, and λ is the regularization parameter.

Lasso regression is employed as the meta-model to build the ensemble model, while RF, GBDT, ADA, LR, and ERT are chosen as foundation models. The performance of each base model in terms of its capacity to estimate the data is assessed using Lasso. To create 68 training samples and 18 validation samples, 86 chlorophyll content samples are randomly shuffled and divided into training and validation sets in an 80:20 ratio during the modeling phase. The estimation accuracy is assessed using the following metrics: RMSE, MAE, and R². Among these, the estimation accuracy increases with R2 being closer to 1 and MAE and RMSE values being smaller. The calculations [52,53] are as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - x_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(12)

MAE = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - x_{i}|

(13)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - x_{i})}^{2}}

(14)

Theorem 5.

y_{i}

represents the actual nitrogen content,

x_{i}

represents the predicted nitrogen content,

\bar{y}

is the mean of the actual nitrogen values, and

n

is the number of samples.

Through the above data collection and feature extraction, this study prepares to compare five regression models and their ensemble effects. The next section will discuss in detail the construction and validation process of each model.

3. Results and Analysis

3.1. Correlation Analysis of Multispectral Spectral Features and Nitrogen Content

A correlation analysis was conducted between the spectral features and the obtained nitrogen content in the custard apple canopy. The results showed that the correlation coefficients between the four original bands (G, R, RE, NIR) and nitrogen content ranged from 0.027 to 0.554, with values of 0.554, 0.432, 0.027, and 0.136, respectively. According to Pearson correlation analysis, the G and R bands exhibited a moderate correlation with the nitrogen content in the custard apple canopy. The correlation coefficients between nitrogen content and the computed vegetation indices (MNDRE, NRI, RESR, TNDVI, NDVI, WDRVI, NGI, MSR, NREI_R, RVI, and NERI) ranged from 0.648 to 0.422. By integrating the correlations between the various spectral bands, a noticeable improvement was observed compared to the original bands, with a smaller variance. The correlation coefficients were as follows: 0.647, 0.509, 0.495, 0.477, 0.476, 0.469, 0.454, 0.453, 0.453, 0.444, 0.433 and 0.422. According to Pearson correlation analysis, the vegetation indices all exhibited moderate to strong correlations with the canopy nitrogen content (LNC). Figure 5 presents the heatmap of spectral features with moderate or stronger correlations with LNC.

3.2. Nitrogen Content and Band Texture Features Correlation Analysis

The correlation analysis between all extracted multispectral texture features and the obtained LNC of custard apple is shown in Table 4. The results indicate that the texture indices of the red band exhibited a relatively strong correlation with the canopy nitrogen content, with correlation coefficients ranging from 0.348 to 0.548. In contrast, the texture indices of the red edge and near-infrared bands showed a weaker correlation, like the results observed for the red edge and near-infrared bands in the previous analysis. Table 4 presents the correlation values between the LNC of custard apple and UAV-derived spectral TFs.

3.3. Remote Sensing Modeling and Estimation of the Custard Apple Tree LNC

3.3.1. Selection of Input Feature Variables

To explore the optimal accuracy of three different input methods—spectral features, texture features, and the combination of spectral and texture features—random forest regression modeling was performed. The results are presented in Table 5. As shown in the table, the random forest model that combined both spectral and texture features as input achieved the highest accuracy. The modeling approach that incorporated texture features along with vegetation indices outperformed the modeling approach using single spectral features in estimating LNC. Specifically, the R² value increased by approximately 0.1 compared to using only spectral features, while both the MAE and RMSE decreased. Figure 6 presents the heatmap of the correlation between LNC and spectral as well as texture features. All features show a correlation with LNC greater than 0.4. Some spectral features have correlations with nitrogen content close to or exceeding 0.6, reflecting the direct influence of nitrogen content on vegetation reflectance. Although the correlation of texture features is slightly lower than that of some spectral features, they still exhibit significant correlations (all greater than 0.5). The correlation between spectral features and texture features is relatively low (represented by smaller dots in the heatmap), indicating that the two types of features are complementary in terms of information.

3.3.2. Inversion Modeling Using Ensemble Learning

The best input features were found and utilized in a stacking model to forecast the nitrogen content of the custard apple canopy following feature selection using Pearson correlation and variable selection using Random Forest. Data were standardized to minimize feature inconsistencies and improve model accuracy. Evaluation metrics for canopy nitrogen prediction using the Stacking-Lasso regression model and several base models are displayed in the following Table 5. While lower RMSE and MAE values suggest smaller disparities between anticipated and real values, showing stronger predictive potential for canopy nitrogen, higher values indicate better accuracy and regression performance. Every base model first made a primary regression prediction, and the second layer’s Lasso used the predictions as input. The final prediction was produced by Lasso by combining the outputs from the first-layer models. According to Table 5 and Figure 7, ERT and LR were comparatively weaker base models than ADA and RF. By analyzing the evaluation metrics in Table 5, it can be concluded that the performance of the ensemble learning model is superior to that of all the base models, although the improvement is relatively small. From the analysis of Figure 7, it is evident that the regression fit curve of the ensemble learning model is more concentrated around the trend line compared to the scatter points of the base models. This indicates that the stacking model is more accurate in capturing the main trend in the data. The stacking model demonstrates better performance in capturing the overall trend in the data, reducing overfitting, and improving prediction stability. Lasso functioned as the secondary learner in the ensemble framework of this study by automatically choosing and weighting the base models. By suppressing redundant or underperforming models, Lasso’s regularization process successfully emphasized the models that made the biggest contribution to the total prediction. ADA and RF had strong positive weights, according to weight analysis, indicating that these models were able to collect crucial predictive data. In contrast, ERT had a negative weight, which Lasso used to challenge the idea that its predictions would contain some noise or inaccuracy. With a weight of zero, the GBDT model was eliminated without changing the results of the Stacking model, indicating that although its predictions overlapped with those of other models, they were less successful, which is why it was given zero weight. This weighting strategy successfully decreased duplication, increased prediction accuracy, and simplified the model. Table 6 presents the accuracy of the base models and the meta-model. To assess the significant differences in model performance, we conducted a paired t-test on MAE, RMSE, and R² values. The test results showed that the performance of the ensemble model (Stacking) significantly differed from all base models (RF, GBDT, ADA, ERT, and LR) in terms of MAE and RMSE (p < 0.05). For the R² metric, the difference between the Stacking model and RF and ADA models did not reach statistical significance, but the significance in MAE and RMSE still indicates superior overall performance. The ensemble model effectively reduced errors on the test set by combining the predictive capabilities of multiple base models. Figure 7 illustrates the fitting curve of the prediction model and the weights of the base models, From (a) RF, (b) GBRT, (c) ADA, (d) ERT, and (e) LR, it can be observed that the base models generally show good fitting ability in terms of overall trends but still exhibit some bias, with the data points in the high-value regions being more widely scattered. This suggests that single models may have limitations within certain specific ranges. In the (f) Stacking plot, it can be observed that the fitting curve of the ensemble model is more concentrated around the trend line, with data points being more tightly distributed compared to the base models. The Lasso meta-model effectively reduces model bias by integrating the outputs of the base models, enhancing the robustness of the predictions. The weight distribution in (g) shows that Lasso avoids the risk of overfitting a single model by assigning sparse weights to the base models in the ensemble.

3.3.3. Using Ensemble Learning for UAV-Based Nitrogen Content Estimation in Custard Apple Leaves

Spatial distribution maps of custard apple tree leaf nitrogen content during the three growth stages of May, August, and November were produced using the ensemble learning technique previously mentioned. Figure 8 shows clear trends in nitrogen content during various time periods. Concurrent fertilization techniques support the high nitrogen requirement in May when new leaves and shoots are growing vigorously. Usually, this leads to increased concentrations of nitrogen in the canopy. The trees still need nitrogen when they reach the fruit enlargement phase in August, but as the fruit develops, their need for other nutrients (like calcium and potassium) gradually rises, and their nitrogen concentration may stay mostly constant or even slightly decrease from earlier stages. Lower canopy nitrogen concentrations occur in November because of a decline in nitrogen demand when fruits mature. To prepare for growth the next year, trees may start storing nitrogen in their root systems during this time. All things considered, the average nitrogen content in May was 4.6, in August, it was 4.2, and in November, it was 3.9, the geographical distribution of nitrogen in the leaves seems to fit in nicely with the normal growth patterns of custard apple trees. Notably, certain regions show indications of nitrogen deficit in August, which is probably caused by fertilization techniques or environmental changes. This implies that during this crucial stage of growth, focused fertilization and water management, as well as prompt adaptation to shifting environmental conditions, may be advantageous. The LNC in May, August, and November gradually decreases, which generally follows the crop growth pattern. In August, there are a few locations where nitrogen content is missing, which highlights the significance of remote sensing. In future work, this can be used to timely supplement nitrogen fertilizer to ensure the yield and quality of the custard apple.

The above results show that the stacking model outperforms the base models in terms of overall performance and provides the LNC estimation maps. The next section will further explore the biological or physical reasons behind this performance improvement, as well as the issues related to model applicability.

4. Discussion

Numerous studies have investigated the estimation of canopy nitrogen content using remote sensing since it is a critical indicator of plant growth and development. It is useful to use UAVs to rapidly estimate the amount of nitrogen in custard apple canopies to improve yield and quality. However, there is frequently a lot of interference from shadow and soil noise in UAV multispectral data. NDCSI can effectively eliminate interference caused by soil background and lighting variations. After preprocessing with NDCSI, the correlation of VIs significantly improves compared to the original imagery, with an increase of 0.2 or more in correlation strength. This improvement lays a solid foundation for more accurately reflecting crop nitrogen content. According to Xu et al. [54], nitrogen estimation can be improved by eliminating shadow and soil noise, which emphasizes the need for threshold-based preprocessing of multispectral shadow indices.

The relationship between crop physicochemical characteristics and spectral reflectance is successfully improved by this preprocessing step. Research on custard apple trees is still scarce despite the prevalence of studies on vegetative indices and textural characteristics in key staple crops. Our research shows that custard apple trees can benefit from the combination of vegetation indices and multispectral texture features for inversion modeling, which increases model accuracy. Through analysis, we found that spectral features can effectively reflect the growth, health status, and environmental conditions of plants, while texture features can reveal microscopic changes during the crop growth process. For instance, areas with higher nitrogen levels may exhibit more uniform textures, while low-nitrogen areas may show greater variation and irregularity. Spectral features primarily focus on the physiological state and growth condition of plants, whereas texture features emphasize spatial distribution and local variations. By combining both, the model can gain a more comprehensive understanding of the crop’s nitrogen requirements and growth dynamics. Our findings are in line with those of Zhuang et al. [20], who discovered that adding texture features to inversion modeling in addition to vegetation indices produced models with higher accuracy than those that relied only on vegetation indices. For feature selection, this study used Random Forest and Pearson correlation, although these approaches were a little limited, especially for texture characteristics, where red band texture predominated because of its greater correlation. To improve the selection process, future research will examine a wider range of feature selection techniques. Furthermore, data richness and quality are crucial [55]; for more thorough data gathering, future research may incorporate hyperspectral and UAV digital imagery data. Ensemble learning [27,56] is a new machine learning method that combines several predictive models and capitalizes on their advantages to increase model robustness and accuracy. Consistent with earlier research [29,57], a Stacking-Lasso ensemble model that included RF, GBDT, ADA, LR, and ERT models produced better prediction results in this investigation. Even though ensemble learning increased accuracy for all base models, it only slightly outperformed the top-performing base model. Future studies will investigate various base models and alternative meta-model combinations. Due to its own intrinsic fitting capabilities, Lasso may run the danger of overfitting even if it can assist in feature selection and redundancy removal [58]. The overfitting issue can be effectively mitigated by adjusting the regularization parameters and using meta-modeling methods such as Ridge and Elastic Net. Adding a third model layer or looking for easier ways to determine weights could increase the accuracy and dependability of the model. The purpose of this study was to use UAV multispectral data to quickly and accurately predict canopy nitrogen for custard apple trees, which could help direct their growth. However, the study’s application for widespread use was limited because it only covered one year. In the future, by acquiring data from more years and multiple locations, we will incorporate variables such as year and different geological characteristics into the model. This will help improve the model’s ability to capture differences across years and locations, thereby enhancing its generalization ability and applicability. Future research will use multi-year experiments to collect additional data points and evaluate the findings [59], ultimately enhancing generalizability and usefulness in agricultural monitoring.

Although this study provides a UAV-based remote sensing method for nitrogen monitoring, several directions remain to be explored and optimized:

Integration of Multi-source Data: We plan to integrate UAV remote sensing data with hyperspectral satellite data and ground sensor data to construct a more comprehensive multi-source data fusion framework. This will enhance the model’s universality and its effectiveness in large-scale applications.
Expansion of Spatiotemporal Data: The next step is to further integrate multi-year remote sensing data and multi-location monitoring data to address the challenges of nitrogen content monitoring under varying climatic conditions and crop cultivation patterns, thereby improving the model’s generalization capability.
Model Optimization and Generalization: We will explore more machine learning models (e.g., deep learning methods, ensemble learning) and investigate additional regularization techniques to tackle overfitting, aiming to further enhance the model’s stability and prediction accuracy.

Through these efforts, we hope to extend this method to a wider range of agricultural monitoring applications.

5. Conclusions

When evaluating the nutritional growth state of fruit trees, nitrogen is an essential metric. In this study, multispectral canopy data collected by UAVs at various growth stages was used to quantify the nitrogen content of custard apple leaves. To track and evaluate the amount of nitrogen in the custard apple trees’ canopy, a two-layer ensemble learning model was created. According to the investigation, the association between the acquired vegetation indices, textural features, and nitrogen content was much enhanced when canopy shadow and soil background noise were filtered using a spectral shadow index as opposed to raw data. The accuracy of nitrogen estimation was significantly higher in models that integrated textural data and vegetation indices than in models that only used vegetation indices. With R2 values of 0.62 and 0.61, respectively, ADA and RF stood out among single machine-learning models for their strong stability and estimation accuracy. However, with an R2 of 0.661, RMSE of 0.059, and MAE of 0.193, the Stacking ensemble model fared better than the individual models on average. As a filter to separate the performance of the underlying models, the second-layer Lasso model worked quite well. According to this study, monitoring nitrogen nutrition in custard apple leaves using UAV-based remote sensing and ensemble learning has a lot of potential. It may also be a useful guide for future studies on the management and development of custard apple trees.

Author Contributions

Conceptualization, X.X., T.C. and W.W.; writing—original draft preparation, L.G. and G.Y.; writing—review and editing, X.J., H.F., Y.L., Y.M. and H.X. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Key Science and Technology Special Project of Yunnan Province (No. 202202AE090013-2), National Natural Science Foundation (No. 42371323), National Key Research and Development Program (No. 2023YFD2300503) and National Modern Agricultural Industry Technology System (No. CARS-03).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author, upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Moussa, A.Y.; Siddiqui, S.A.; Elhawary, E.A.; Guo, K.; Anwar, S.; Xu, B. Phytochemical constituents, bioactivities, and applications of custard apple (Annona squamosa L.): A narrative review. Food Chem. 2024, 459, 140363. [Google Scholar] [CrossRef]
Sun, G.Z.; Chen, S.H.; Zhang, S.W.; Chen, S.M.; Liu, J.; He, Q.; Hu, T.T.; Zhang, F.C. Responses of leaf nitrogen status and leaf area index to water and nitrogen application and their relationship with apple orchard productivity. Agric. Water Manag. 2024, 296, 14. [Google Scholar] [CrossRef]
Jiang, H.; Jiang, L.; He, L.; Murengami, B.G.; Jing, X.; Misiewicz, P.A.; Cheein, F.A.; Fu, L. Yield prediction of root crops in field using remote sensing: A comprehensive review. Comput. Electron. Agric. 2024, 227, 109600. [Google Scholar] [CrossRef]
Gao, X.; Yao, Y.; Chen, S.; Li, Q.; Zhang, X.; Liu, Z.; Zeng, Y.; Ma, Y.; Zhao, Y.; Li, S. Improved maize leaf area index inversion combining plant height corrected resampling size and random forest model using UAV images at fine scale. Eur. J. Agron. 2024, 161, 127360. [Google Scholar] [CrossRef]
Cheng, Q.; Zhang, S.; Chen, X.; Cui, H.; Xu, Y.; Xia, S.; Xia, K.; Zhou, T.; Zhou, X. Inversion of reclaimed soil water content based on a combination of multi-attributes of ground penetrating radar signals. J. Appl. Geophys. 2023, 213, 105019. [Google Scholar] [CrossRef]
Li, Y.; Xu, X.; Wu, W.; Zhu, Y.; Yang, G.; Yang, X.; Meng, Y.; Jiang, X.; Xue, H. Hyperspectral Estimation of Chlorophyll Content in Grape Leaves Based on Fractional-Order Differentiation and Random Forest Algorithm. Remote Sens. 2024, 16, 2174. [Google Scholar] [CrossRef]
Wang, K.; Shan, S.; Dou, W.; Wei, H.; Zhang, K. A cross-modal deep learning method for enhancing photovoltaic power forecasting with satellite imagery and time series data. Energy Convers. Manag. 2025, 323, 119218. [Google Scholar] [CrossRef]
Dong, H.; Dong, J.; Sun, S.; Bai, T.; Zhao, D.; Yin, Y.; Shen, X.; Wang, Y.; Zhang, Z.; Wang, Y. Crop water stress detection based on UAV remote sensing systems. Agric. Water Manag. 2024, 303, 109059. [Google Scholar] [CrossRef]
Wu, G.; Al-qaness, M.A.A.; Al-Alimi, D.; Dahou, A.; Abd Elaziz, M.; Ewees, A.A. Hyperspectral image classification using graph convolutional network: A comprehensive review. Expert Syst. Appl. 2024, 257, 125106. [Google Scholar] [CrossRef]
Flynn, K.C.; Witt, T.W.; Baath, G.S.; Chinmayi, H.K.; Smith, D.R.; Gowda, P.H.; Ashworth, A.J. Hyperspectral reflectance and machine learning for multi-site monitoring of cotton growth. Smart Agric. Technol. 2024, 9, 100536. [Google Scholar] [CrossRef]
Segarra, J.; Rezzouk, F.Z.; Aparicio, N.; González-Torralba, J.; Aranjuelo, I.; Gracia-Romero, A.; Araus, J.L.; Kefauver, S.C. Multiscale assessment of ground, aerial and satellite spectral data for monitoring wheat grain nitrogen content. Inf. Process. Agric. 2023, 10, 504–522. [Google Scholar] [CrossRef]
Colomina, I.; Molina, P. Unmanned aerial systems for photogrammetry and remote sensing: A review. ISPRS J. Photogramm. Remote Sens. 2014, 92, 79–97. [Google Scholar] [CrossRef]
Zhang, C.; Chen, Z.; Yang, G.; Xu, B.; Feng, H.; Chen, R.; Qi, N.; Zhang, W.; Zhao, D.; Cheng, J.; et al. Removal of canopy shadows improved retrieval accuracy of individual apple tree crowns LAI and chlorophyll content using UAV multispectral imagery and PROSAIL model. Comput. Electron. Agric. 2024, 221, 108959. [Google Scholar] [CrossRef]
Almeida-Ñauñay, A.F.; Tarquis, A.M.; López-Herrera, J.; Pérez-Martín, E.; Pancorbo, J.L.; Raya-Sereno, M.D.; Quemada, M. Optimization of soil background removal to improve the prediction of wheat traits with UAV imagery. Comput. Electron. Agric. 2023, 205, 107559. [Google Scholar] [CrossRef]
Liu, S.; Jin, X.; Bai, Y.; Wu, W.; Cui, N.; Cheng, M.; Liu, Y.; Meng, L.; Jia, X.; Nie, C.; et al. UAV multispectral images for accurate estimation of the maize LAI considering the effect of soil background. Int. J. Appl. Earth Obs. Geoinf. 2023, 121, 103383. [Google Scholar] [CrossRef]
Xu, Z.-H.; Lin, L.; Wang, Q.-F.; Huang, X.-Y.; Liu, J.; Yu, K.-Y.; Chen, C.-C. Construction and application effects of normalized shaded vegetation index (NSVI). J. Infrared Millim. Waves 2018, 37, 154–162. [Google Scholar]
Lu, J.; Cheng, D.; Geng, C.; Zhang, Z.; Xiang, Y.; Hu, T. Combining plant height, canopy coverage and vegetation index from UAV-based RGB images to estimate leaf nitrogen concentration of summer maize. Biosyst. Eng. 2021, 202, 42–54. [Google Scholar] [CrossRef]
Wang, Y.; Gu, M. Classification methods for hyperspectral remote sensing images with weak texture features. J. Radiat. Res. Appl. Sci. 2024, 17, 101019. [Google Scholar] [CrossRef]
Zhang, S.; Duan, J.; Qi, X.; Gao, Y.; He, L.; Liu, L.; Guo, T.; Feng, W. Combining spectrum, thermal, and texture features using machine learning algorithms for wheat nitrogen nutrient index estimation and model transferability analysis. Comput. Electron. Agric. 2024, 222, 109022. [Google Scholar] [CrossRef]
Zhuang, Z.-H.; Tsai, H.P.; Chen, C.-I.; Yang, M.-D. Subtropical region tea tree LAI estimation integrating vegetation indices and texture features derived from UAV multispectral images. Smart Agric. Technol. 2024, 9, 100650. [Google Scholar] [CrossRef]
Wang, D.; Li, R.; Liu, T.; Liu, S.; Sun, C.; Guo, W. Combining vegetation, color, and texture indices with hyperspectral parameters using machine-learning methods to estimate nitrogen concentration in rice stems and leaves. Field Crops Res. 2023, 304, 109175. [Google Scholar] [CrossRef]
Fu, Y.; Yang, G.; Pu, R.; Li, Z.; Li, H.; Xu, X.; Song, X.; Yang, X.; Zhao, C. An overview of crop nitrogen status assessment using hyperspectral remote sensing: Current status and perspectives. Eur. J. Agron. 2021, 124, 126241. [Google Scholar] [CrossRef]
Bossung, C.; Schlerf, M.; Machwitz, M. Estimation of canopy nitrogen content in winter wheat from Sentinel-2 images for operational agricultural monitoring. Precis. Agric. 2022, 23, 2229–2252. [Google Scholar] [CrossRef]
Liu, H.; Zhu, H.; Wang, P. Quantitative modelling for leaf nitrogen content of winter wheat using UAV-based hyperspectral data. Int. J. Remote Sens. 2017, 38, 2117–2134. [Google Scholar] [CrossRef]
Jia, F.; Liu, G.; Liu, D.; Zhang, Y.; Fan, W.; Xing, X. Comparison of different methods for estimating nitrogen concentration in flue-cured tobacco leaves based on hyperspectral reflectance. Field Crops Res. 2013, 150, 108–114. [Google Scholar] [CrossRef]
Cao, C.; Wang, T.; Gao, M.; Li, Y.; Li, D.; Zhang, H. Hyperspectral inversion of nitrogen content in maize leaves based on different dimensionality reduction algorithms. Comput. Electron. Agric. 2021, 190, 106461. [Google Scholar] [CrossRef]
Zhang, M.; Chen, T.E.; Gu, X.; Kuai, Y.; Wang, C.; Chen, D.; Zhao, C. UAV-borne hyperspectral estimation of nitrogen content in tobacco leaves based on ensemble learning methods. Comput. Electron. Agric. 2023, 211, 108008. [Google Scholar] [CrossRef]
Li, B.; Xu, X.; Zhang, L.; Han, J.; Bian, C.; Li, G.; Liu, J.; Jin, L. Above-ground biomass estimation and yield prediction in potato by using UAV-based RGB and hyperspectral imaging. ISPRS J. Photogramm. Remote Sens. 2020, 162, 161–172. [Google Scholar] [CrossRef]
Du, R.; Lu, J.; Xiang, Y.; Zhang, F.; Chen, J.; Tang, Z.; Shi, H.; Wang, X.; Li, W. Estimation of winter canola growth parameter from UAV multi-angular spectral-texture information using stacking-based ensemble learning model. Comput. Electron. Agric. 2024, 222, 109074. [Google Scholar] [CrossRef]
Liang, M.; Chang, T.; An, B.; Duan, X.; Du, L.; Wang, X.; Miao, J.; Xu, L.; Gao, X.; Zhang, L.; et al. A Stacking Ensemble Learning Framework for Genomic Prediction. Front. Genet. 2021, 12, 600040. [Google Scholar] [CrossRef] [PubMed]
Xiong, W.; Xiao, L.; Han, D.; Yue, W. A prediction model for water absorption in sublayers based on stacking ensemble learning method. Geoenergy Sci. Eng. 2024, 240, 212988. [Google Scholar] [CrossRef]
Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J. R. Stat. Soc. Ser. B (Methodol.) 1996, 58, 267–288. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Freund, Y.; Schapire, R.E. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
Petruseva, S.; Zileska Pancovska, V.; Žujo, V.; Brkan-Vejzović, A. Construction costs forecasting: Comparison of the accuracy of linear regression and support vector machine models. Tech. Gaz. 2017, 24, 1431–1438. [Google Scholar] [CrossRef]
Geurts, P.; Ernst, D.; Wehenkel, L. Extremely randomized trees. Mach. Learn. 2006, 63, 3–42. [Google Scholar] [CrossRef]
Zhu, H.; Huang, Y.; An, Z.; Zhang, H.; Han, Y.; Zhao, Z.; Li, F.; Zhang, C.; Hou, C. Assessing radiometric calibration methods for multispectral UAV imagery and the influence of illumination, flight altitude and flight time on reflectance, vegetation index and inversion of winter wheat AGB and LAI. Comput. Electron. Agric. 2024, 219, 108821. [Google Scholar] [CrossRef]
Xu, N.; Tian, J.; Tian, Q.; Xu, K.; Tang, S. Analysis of Vegetation Red Edge with Different Illuminated/Shaded Canopy Proportions and to Construct Normalized Difference Canopy Shadow Index. Remote Sens. 2019, 11, 1192. [Google Scholar] [CrossRef]
Shengxian, L.X.X.T.D. Estimation of Citrus Canopy Chlorophyll Based on UAV Multispectral Images. Trans. Chin. Soc. Agric. Mach. 2023, 54, 198–205. [Google Scholar]
Marin, D.B.; Ferraz, G.A.e.S.; Santana, L.S.; Barbosa, B.D.S.; Barata, R.A.P.; Osco, L.P.; Ramos, A.P.M.; Guimarães, P.H.S. Detecting coffee leaf rust with UAV-based vegetation indices and decision tree machine learning models. Comput. Electron. Agric. 2021, 190, 106476. [Google Scholar] [CrossRef]
Erdle, K.; Mistele, B.; Schmidhalter, U. Comparison of active and passive spectral sensors in discriminating biomass parameters and nitrogen status in wheat cultivars. Field Crops Res. 2011, 124, 74–84. [Google Scholar] [CrossRef]
Ahmad, F. Detection of Change in Vegetation Cover Using Multi-Spectral and Multi-Temporal Information for District Sargodha. Glob. J. Hum. -Soc. Sci. Res. 2012, 24, 557–571. [Google Scholar] [CrossRef]
Rouse, J.W.; Haas, R.H.; Deering, D.W.; Schell, J.A.; Harlan, J.C. Monitoring the Vernal Advancement and Retrogradation (Green Wave Effect) of Natural Vegetation. [Great Plains Corridor]. 1973. Available online: https://ntrs.nasa.gov/citations/19750020419 (accessed on 27 January 1973).
Gitelson, A.A. Wide Dynamic Range Vegetation Index for Remote Quantification of Biophysical Characteristics of Vegetation. J. Plant Physiol. 2004, 161, 165–173. [Google Scholar] [CrossRef] [PubMed]
Sripada, R.P.; Heiniger, R.W.; White, J.G.; Meijer, A.D. Aerial Color Infrared Photography for Determining Early In-Season Nitrogen Requirements in Corn. Agron. J. 2006, 98, 968–977. [Google Scholar] [CrossRef]
Chen, J.M. Evaluation of Vegetation Indices and a Modified Simple Ratio for Boreal Applications. Can. J. Remote Sens. 1996, 22, 229–242. [Google Scholar] [CrossRef]
Zhu, Y.; Yao, X.; Tian, Y.; Liu, X.; Cao, W. Analysis of common canopy vegetation indices for indicating leaf nitrogen accumulations in wheat and rice. Int. J. Appl. Earth Obs. Geoinf. 2008, 10, 1–10. [Google Scholar] [CrossRef]
Cao, Q.; Miao, Y.; Wang, H.; Huang, S.; Cheng, S.; Khosla, R.; Jiang, R. Non-destructive estimation of rice plant nitrogen status with Crop Circle multispectral active canopy sensor. Field Crops Res. 2013, 154, 133–144. [Google Scholar] [CrossRef]
Yue, J.; Yang, G.; Tian, Q.; Feng, H.; Xu, K.; Zhou, C. Estimate of winter-wheat above-ground biomass based on UAV ultrahigh-ground-resolution image textures and vegetation indices. ISPRS J. Photogramm. Remote Sens. 2019, 150, 226–244. [Google Scholar] [CrossRef]
Yan, Z.; Yao, Y. Variable selection method for fault isolation using least absolute shrinkage and selection operator (LASSO). Chemom. Intell. Lab. Syst. 2015, 146, 136–146. [Google Scholar] [CrossRef]
Zhang, B.; Wu, W.; Zhou, J.; Dai, M.; Sun, Q.; Sun, X.; Chen, Z.; Gu, X. A spectral index for estimating grain filling rate of winter wheat using UAV-based hyperspectral images. Comput. Electron. Agric. 2024, 223, 109059. [Google Scholar] [CrossRef]
Li, Y.; Xu, X.; Wu, W.; Zhu, Y.; Yang, G.; Gao, L.; Meng, Y.; Jiang, X.; Xue, H. Hyperspectral Estimation of Leaf Nitrogen Content in White Radish Based on Feature Selection and Integrated Learning. Remote Sens. 2024, 16, 4479. [Google Scholar] [CrossRef]
Xu, X.; Fan, L.; Li, Z.; Meng, Y.; Feng, H.; Yang, H.; Xu, B. Estimating Leaf Nitrogen Content in Corn Based on Information Fusion of Multiple-Sensor Imagery from UAV. Remote Sens. 2021, 13, 340. [Google Scholar] [CrossRef]
Chen, H.; Chen, H.; Zhang, S.; Chen, S.; Cen, F.; Zhao, Q.; Huang, X.; He, T.; Gao, Z. Comparison of CWSI and Ts-Ta-VIs in moisture monitoring of dryland crops (sorghum and maize) based on UAV remote sensing. J. Integr. Agric. 2024, 23, 2458–2475. [Google Scholar] [CrossRef]
Hosseini, F.; Prieto, C.; Álvarez, C. Ensemble learning of catchment-wise optimized LSTMs enhances regional rainfall-runoff modelling—Case Study: Basque Country, Spain. J. Hydrol. 2025, 646, 132269. [Google Scholar] [CrossRef]
Hao, X.; Chen, Z.; Yi, S.; Liu, J. Application of improved Stacking ensemble learning in NIR spectral modeling of corn seed germination rate. Chemom. Intell. Lab. Syst. 2023, 243, 105020. [Google Scholar] [CrossRef]
Lakshmi, M.V.; Winkler, J.R. Numerical properties of solutions of LASSO regression. Appl. Numer. Math. 2024, 208, 297–309. [Google Scholar] [CrossRef]
Wang, R.; Dijkstra, F.A.; Liu, H.; Yin, J.; Wang, X.; Feng, X.; Xu, Z.; Jiang, Y. Response of soil carbon to nitrogen and water addition differs between labile and recalcitrant fractions: Evidence from multi–year data and different soil depths in a semi-arid steppe. Catena 2019, 172, 857–865. [Google Scholar] [CrossRef]

Figure 1. Geographical location of the study area.

Figure 2. DJI Mavic 3M.

Figure 3. (a–f) show a comparison between the original images and the images with shadows and soil removed across three growth stages.

Figure 4. Design of an Ensemble Learning Workflow for Estimating LNC in Custard Apple.

Figure 5. Presents the heatmap of spectral features with moderate or stronger correlations with the LNC of custard apple.

Figure 6. Heatmap of the correlation between custard apple LNC and the optimal input variables.

Figure 7. This illustrates the remote sensing estimation of custard apple leaf nitrogen content using different learning methods. (a–e) represent the fitting curves of the base models RF, GBDT, ADA, ERT, and LR, respectively. (f) shows the fitting curve of the meta-model, and (g) presents the Lasso weight graph.

Figure 8. Remote Sensing Monitoring of Custard Apple Leaf Nitrogen Content Based on UAV Multispectral Imagery. (a–c) represent the remote sensing monitoring images of leaf nitrogen content in May, August, and November, respectively.

Table 1. Multispectral camera parameters for UAV.

Waveband Band	Central Wavelength (nm)	Spectral Bandwidth (nm)
G	560 nm + 16 nm	40 nm
R	650 nm + 16 nm	40 nm
RE	730 nm + 16 nm	20 nm
NIR	860 nm + 26 nm	30 nm

Table 2. Latitude and longitude of the leaf sampling points.

Sample	Latitude	Longitude
Sample 1	25.72087188	101.7975859
Sample 2	25.7208494	101.7974183
Sample 3	25.72138112	101.7974339
Sample 4	25.72137894	101.7977156
Sample 5	25.7212916	101.7974944
Sample 6	25.72169407	101.7970533
Sample 7	25.72078584	101.7969467
Sample 8	25.72124224	101.79706
Sample 9	25.72170824	101.7968015
Sample 10	25.72144288	101.7967326
Sample 11	25.72205578	101.7978633
Sample 12	25.72232165	101.7979453
Sample 13	25.7225071	101.7979962
Sample 14	25.722647	101.7978964
Sample 15	25.7223012	101.7978186
Sample 16	25.72183514	101.7986564
Sample 17	25.7219216	101.7983524
Sample 18	25.72219541	101.7983899
Sample 19	25.72257355	101.7984414
Sample 20	25.72248967	101.7986328
Sample 21	25.72218713	101.7985957
Sample 22	25.7215716	101.7990846
Sample 23	25.72172485	101.7989925
Sample 24	25.72230296	101.7989335
Sample 25	25.72242464	101.7991441
Sample 26	25.72145072	101.7982732
Sample 27	25.72123758	101.7981791
Sample 28	25.72109625	101.7980155
Sample 29	25.72087188	101.7975859

Table 3. Vegetation indices constructed using UAV spectral bands.

Vegetation Index (VI)	Formula	References
Modified Normalized Difference Red Edge (MNDRE)	MNDRE = (NIR − RE + 2 × R)/(NIR + RE−2 × R)	[41]
Normalized Red Index (NRI)	NRI = R/(NIR + R + RE)	[41]
Red Edge Simple Ratio (RESR)	RESR = RE/R	[42]
Transformed Normalized Vegetation Index (TNDVI)	TNDVI = sqrt((NIR − R)/(NIR + R) + 0.5)	[43]
Normalized Difference Vegetation Index (NDVI)	NDVI = (NIR − R)/(NIR + R)	[44]
Wide Dynamic Range Vegetation Index (WDRVI)	WDRVI = (0.12 × NIR − R)/(0.12 × NIR + R)	[45]
Normalized Green Index (NGI)	NGI = G/(NIR + G + RE)	[46]
Modified Simple Ratio (MSR)	MSR = (NIR/R − 1)/sqrt (NIR/R + 1)	[47]
Normalized Red Edge Index (NREI_R)	NREI_R = RE/(NIR + RE + R)	[41]
Ratio Vegetation Index (RVI)	RVI = NIR/R	[48]
Normalized Red Edge Index (NREI)	NREI = RE/(NIR + G + RE)	[49]

Table 4. The correlation between the LNC of custard apple and the TFs of each spectral band.

TFs	Waveband
TFs	G	R	RE	NIR
Mean	0.064	0.503	0.205	0.249
Var	0.278	0.541	0.135	0.021
Hom	0.277	0.420	0.205	0.144
Con	0.278	0.506	0.151	0.018
Dis	0.163	0.548	0.007	0.147
Ent	0.094	0.517	0.276	0.298
Sec	0.214	0.459	0.180	0.123
Cor	0.116	0.348	0.182	0.183

Table 5. Comparison of LNC Estimation Using Different Types of Feature Variables.

Input Variables	R²	MAE	RMSE
VIs + R + G	0.539	0.229	0.081
TFs	0.614	0.215	0.067
VIs + TFs + R + G	0.621	0.216	0.066

Table 6. Evaluation metrics for base models and meta-models.

Model	Train			Test
Model	R2	MAE	RMSE	R2	MAE	RMSE
RF	0.612	0.236	0.099	0.621	0.216	0.066
GBDT	0.477	0.270	0.135	0.508	0.237	0.086
ADA	0.763	0.184	0.061	0.616	0.215	0.067
ERT	0.410	0.296	0.152	0.526	0.247	0.083
LR	0.525	0.239	0.096	0.399	0.238	0.105
Stacking	0.851	0.153	0.038	0.661	0.193	0.059

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, X.; Gao, L.; Xu, X.; Wu, W.; Yang, G.; Meng, Y.; Feng, H.; Li, Y.; Xue, H.; Chen, T. Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.). Agronomy 2025, 15, 38. https://doi.org/10.3390/agronomy15010038

AMA Style

Jiang X, Gao L, Xu X, Wu W, Yang G, Meng Y, Feng H, Li Y, Xue H, Chen T. Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.). Agronomy. 2025; 15(1):38. https://doi.org/10.3390/agronomy15010038

Chicago/Turabian Style

Jiang, Xiangtai, Lutao Gao, Xingang Xu, Wenbiao Wu, Guijun Yang, Yang Meng, Haikuan Feng, Yafeng Li, Hanyu Xue, and Tianen Chen. 2025. "Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.)" Agronomy 15, no. 1: 38. https://doi.org/10.3390/agronomy15010038

APA Style

Jiang, X., Gao, L., Xu, X., Wu, W., Yang, G., Meng, Y., Feng, H., Li, Y., Xue, H., & Chen, T. (2025). Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.). Agronomy, 15(1), 38. https://doi.org/10.3390/agronomy15010038

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Combining UAV Remote Sensing with Ensemble Learning to Monitor Leaf Nitrogen Content in Custard Apple (Annona squamosa L.)

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. UAV Imagery Acquisition and Preprocessing

2.3. UAV Imagery Shadow Removal

2.4. Information from the Ground Gathering and Measuring the Nitrogen Level in Custard Apple Leaves

2.5. Extraction of Vegetation Indices from Multispectral Imagery

2.6. Multispectral Image Texture Features

2.7. Development of the Stacking Ensemble Learning Method for Calculating LNC of Custard Apple Leaf

2.8. Techniques for Ensemble Learning

3. Results and Analysis

3.1. Correlation Analysis of Multispectral Spectral Features and Nitrogen Content

3.2. Nitrogen Content and Band Texture Features Correlation Analysis

3.3. Remote Sensing Modeling and Estimation of the Custard Apple Tree LNC

3.3.1. Selection of Input Feature Variables

3.3.2. Inversion Modeling Using Ensemble Learning

3.3.3. Using Ensemble Learning for UAV-Based Nitrogen Content Estimation in Custard Apple Leaves

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI