Next Article in Journal
Harnessing Multi-Source Data and Deep Learning for High-Resolution Land Surface Temperature Gap-Filling Supporting Climate Change Adaptation Activities
Previous Article in Journal
Remote Mapping of Bedrock for Future Cosmogenic Nuclide Exposure Dating Studies in Unvisited Areas of Antarctica
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Statewide Forest Canopy Cover Mapping of Florida Using Synergistic Integration of Spaceborne LiDAR, SAR, and Optical Imagery

by
Monique Bohora Schlickmann
1,*,
Inacio Thomaz Bueno
1,
Denis Valle
2,
William M. Hammond
3,
Susan J. Prichard
4,
Andrew T. Hudak
5,
Carine Klauberg
1,
Mauro Alessandro Karasinski
6,
Kody Melissa Brock
1,
Kleydson Diego Rocha
7,
Jinyi Xia
1,
Rodrigo Vieira Leite
8,
Pedro Higuchi
9,
Ana Carolina da Silva
9,
Gabriel Maximo da Silva
1,
Gina R. Cova
4 and
Carlos Alberto Silva
1
1
Forest Biometrics, Remote Sensing, and Artificial Intelligence Laboratory (Silva Lab)—School of Forest, Fisheries and Geomatics Sciences, University of Florida, P.O. Box 110410, Gainesville, FL 32611, USA
2
Quantitative Ecology, Conservation, and Remote Sensing Laboratory (Valle Lab)—School of Forest, Fisheries and Geomatics Sciences, University of Florida, P.O. Box 110410, Gainesville, FL 32611, USA
3
Plant Ecophysiology Laboratory (Ecophys Lab)—Agronomy Department, University of Florida, Gainesville, FL 32611, USA
4
School of Environmental and Forest Sciences, University of Washington, Seattle, WA 98195, USA
5
USDA Forest Service, Rocky Mountain Research Station, Moscow, ID 83843, USA
6
BIOFIX Research Center—Department of Forest Engineering, Federal University of Parana, Curitiba 80210-170, Brazil
7
Global Forest Dynamics Lab—School of Forest, Fisheries, and Geomatics Sciences, University of Florida, Gainesville, FL 32611, USA
8
NASA Goddard Space Flight Center Greenbelt, Greenbelt, MD 20771, USA
9
Forest Engineering Department, Santa Catarina State University, Av Luiz de Camões, 2090 Conta Dinheiro, Lages 88520-000, Brazil
*
Author to whom correspondence should be addressed.
Submission received: 1 November 2024 / Revised: 7 January 2025 / Accepted: 14 January 2025 / Published: 17 January 2025
(This article belongs to the Section Environmental Remote Sensing)

Abstract

:
Southern U.S. forests are essential for carbon storage and timber production but are increasingly impacted by natural disturbances, highlighting the need to understand their dynamics and recovery. Canopy cover is a key indicator of forest health and resilience. Advances in remote sensing, such as NASA’s GEDI spaceborne LiDAR, enable more precise mapping of canopy cover. Although GEDI provides accurate data, its limited spatial coverage restricts large-scale assessments. To address this, we combined GEDI with Synthetic Aperture Radar (SAR), and optical imagery (Sentinel-1 GRD and Landsat–Sentinel Harmonized (HLS)) data to create a comprehensive canopy cover map for Florida. Using a random forest algorithm, our model achieved an R2 of 0.69, RMSD of 0.17, and MD of 0.001, based on out-of-bag samples for internal validation. Geographic coordinates and the red spectral channel emerged as the most influential predictors. External validation with airborne laser scanning (ALS) data across three sites yielded an R2 of 0.70, RMSD of 0.29, and MD of −0.22, confirming the model’s accuracy and robustness in unseen areas. Statewide analysis showed lower canopy cover in southern versus northern Florida, with wetland forests exhibiting higher cover than upland sites. This study demonstrates the potential of integrating multiple remote sensing datasets to produce accurate vegetation maps, supporting forest management and sustainability efforts in Florida.

1. Introduction

Southern U.S. forests are among the most productive ecosystems globally in terms of biomass production and carbon sequestration, representing a significant portion of the U.S. terrestrial carbon sink [1]. In addition to supporting over 60% of the nation’s timber production [2], these forests harbor a highly diverse range of endemic species [3]. However, these forests are frequently exposed to natural disturbances, particularly hurricanes, posing significant threats to their structure, recovery processes, and sustainability [4,5,6]. Moreover, climate change is exacerbating these challenges by increasing the frequency and severity of hurricanes [7] as well as other disturbances such as wildfires and droughts [8,9,10], further amplifying their impact on forest dynamics. These disturbances can cause immediate structural damage and initiate long-term ecological shifts, emphasizing the urgent need for comprehensive research on forest recovery, resilience, and adaptation strategies [11].
Forest canopy cover serves as a valuable metric for evaluating the distribution and resistance of forest ecosystems [12,13,14]. It not only provides a means of evaluating immediate damage after disturbances but also offers critical insights into the recovery trajectories of forests over time [15]. For instance, Senf et al. [16] demonstrated that canopy cover can serve as an indicator for predicting the duration required for ecosystems to recover to their pre-disturbance conditions. Furthermore, integrating historical canopy cover data with records of extreme weather events allows researchers to develop predictive models that forecast the potential effects of future climatic disturbances [17]. Such predictive insights are essential for informing land use planning and management strategies, thereby enhancing the resilience of these vital natural habitats against ongoing climatic challenges [18].
Nevertheless, the effectiveness of such strategic efforts is currently hindered by the limited accuracy of existing canopy cover maps [19]. Canopy cover maps derived from optical sensors lack detailed information on the vertical structure of forest canopies [20]. Specifically, sensors such as Sentinel-2 provide comprehensive wall-to-wall visual coverage but fail to capture essential metrics such as canopy height and density, which are critical for understanding forest recovery and resilience [21]. While optical sensors deliver valuable spatial coverage, their limited precision restricts their usefulness for monitoring both current and future forest dynamics, particularly in the face of disturbances like hurricanes and wildfires [22,23,24].
Airborne and terrestrial light detection and ranging (LiDAR) sensors offer a promising approach to mapping forest structure, including canopy cover, forest height, and tree density [25,26]. However, despite its superior precision, the application of LiDAR is typically restricted to smaller-scale projects due to its high acquisition costs and the scarcity of ground-based (in situ) data needed for calibration and validation [20]. These constraints hinder its widespread use for regional forest monitoring, making it less accessible for continuous, wall-to-wall assessments of forest dynamics at broader scales.
Spaceborne LiDAR sensors, particularly NASA’s Global Ecosystem Dynamics Investigation (GEDI) mission, launched to the International Space Station (ISS) in 2018 [27,28], present new opportunities to monitor forest canopy cover changes over time [27]. GEDI addresses some of the limitations of traditional sensors by providing detailed vertical structure measurements at a global scale. This system, designed to capture forest vertical structure within a ~25 m diameter footprint, offers an innovative means of estimating canopy cover across diverse forest ecosystems [27,29,30].
Although GEDI provides global forest canopy cover estimates [27], it does not generate multi-temporal wall-to-wall maps of either forest structure or aboveground biomass density (AGBD) at the fine spatial resolution required to effectively support forest management and conservation in southern U.S. forests. To overcome this limitation, GEDI can be integrated with other sensors, such as optical Landsat and Sentinel Harmonized (HLS) [31,32] and synthetic aperture radar (SAR) data [30,33], like Sentinel 1 [34,35,36] to produce high-resolution, multi-temporal forest canopy cover maps [30]. This multi-sensor fusion approach has been successfully applied to map forest height [37,38] and AGBD [37,39] in other regions, but has not yet been used for mapping canopy cover, either in Florida or elsewhere.
Despite the significant impact of hurricanes on Southern U.S. forests, detailed studies examining how such disturbances affect forest canopy cover remain scarce. This knowledge gap highlights the urgent need for targeted research to understand the implications of these disturbances on canopy cover and to inform adaptive management strategies aimed at enhancing forests’ resilience and sustainability. By focusing on Florida—a state frequently impacted by hurricanes—this research will lay the groundwork to provide a comprehensive analysis of canopy cover dynamics, establishing a framework that can be applied to other forest ecosystems globally.
The primary objective of this study was to develop a statewide forest canopy cover map for Florida for the year 2021, leveraging a multi-sensor fusion approach using GEDI, Sentinel-1 GRD, and HLS datasets. This integrated framework enabled precise mapping, offering detailed insights into the distribution of canopy cover across different regions. The outcomes of this research provide a valuable tool for shaping local adaptive management strategies. For example, counties can implement policies to preserve urban tree cover, enhance reforestation efforts, or improve stormwater management practices to mitigate the effects of canopy loss [40]. By delivering accurate, up-to-date information on canopy cover, this study enables more informed decision-making to support the management and conservation of forest resources. As the southern USA faces escalating risks from larger and more intense hurricanes and other climate-induced disturbances, this research enhances our understanding of forest resilience and recovery patterns. Ultimately, this study will contribute to broader efforts aimed at ensuring the sustainability of southern U.S. forests in the face of ongoing climatic challenges, providing a framework that can be applied to both local and global forest conservation and management initiatives.

2. Materials and Methods

2.1. Study Area

Our study area included all forested areas in the state of Florida (Figure 1). Florida hosts a diverse array of forest types which are categorized by the Florida natural areas inventory (FNAI) into wetland and upland coniferous, hardwood, and mixed forests. This broad classification helps in assessing and managing the state’s diverse forest ecosystems [41]. Among conifers, prominent species include loblolly pine (Pinus taeda L.), longleaf pine (Pinus palustris Mill.), shortleaf pine (Pinus echinata Mill.), and slash pine (Pinus elliottii Engelm.), which collectively account for approximately 45% of the forested landscape [42]. About 42% of Florida’s forests are cultivated and managed for commercial timber production [2]. On the hardwood side, Florida supports a highly diverse range of species, including red maple (Acer rubrum L.), sweetgum (Liquidambar styraciflua), live oak (Quercus virginiana Mill.), turkey oak (Quercus laevis Walter), American beech (Fagus grandifolia), and pignut hickory (Carya glabra), among others [43].
Florida is characterized by a hot and humid climate, with temperatures often exceeding 32 °C for up to six months each year. The state also experiences high levels of relative humidity, typically exceeding 50%. Annual precipitation is among the highest in the nation, with an average annual rainfall of about 1524 mm. However, temporal precipitation patterns are highly variable, leading to frequent floods in some years and droughts in others [44]. Topography is shaped primarily by the processes of deposition and erosion, influenced by fluctuations in sea level and various erosion mechanisms. Known for its notably flat landscape, the state’s highest point is Britton Hill in Walton County, which reaches an elevation of only 105 m above sea level [45]. The composition of Florida’s soils varies widely across the state, with differences in sand, clay, and organic content, as well as the depth of soil overlying the limestone bedrock. These variations are crucial in determining the types of plant communities that thrive in different regions [45].

2.2. Remote Sensing Data Collection and Processing

2.2.1. NASA GEDI Data

The GEDI sensor provided the foundational estimates of canopy cover for this study. For this analysis, GEDI Level 2B (L2B) data [27] collected from 1 April to 30 April 2021, were used. This dataset was derived from waveforms at each 25 m diameter footprint and included the key biophysical metric of canopy cover, calculated from the directional gap probability profile of the L1B waveform [27]. To focus exclusively on forested areas, the Dynamic World dataset [46] was employed as a mask layer, isolating only tree-covered pixels, including wildland–urban interface areas. Additionally, to mitigate the influence of roads on the dataset, a buffer zone of 100 m on each side of the road was established using spatial data for all identified roads in Florida, obtained from the Florida Department of Transportation [47]. This precaution ensured the exclusion of areas affected by roads, preventing potential data distortion. Each selected footprint was filtered using a quality flag (=1) to ensure data integrity, retaining only measurements that met the highest quality standards. To further enhance the reliability of the measurements, we focused on footprints from nocturnal full-power beam transmissions, which are known to reduce atmospheric noise and interference [29].
To evaluate the spatial dependency between GEDI footprints, a semivariogram analysis was conducted to quantify the similarity of data points over increasing distances. A spatial lag distance of 1.24 km between footprints was determined as the range where spatial correlation diminished, ensuring an optimal distribution of the points [48]. This choice was based on the semivariogram curve, which showed a clear plateau at this distance, indicating minimal spatial correlation beyond this range. As a result, we selected footprints that were at least 1.24 km apart, because points closer than this distance would exhibit high similarity and thus provide less additional information. The selected canopy cover footprints were then grouped into 20 classes according to the percentage of canopy cover, ensuring a broad representation of canopy cover from low to high. Samples of footprints within each class were selected, thus capturing the full range of variability in canopy cover across the study area, rather than focusing on just a narrow subset (e.g., exclusively high or low canopy cover). After applying all filtering criteria, the final dataset consisted of approximately 9500 valid footprints, which spanned the entirety of the study area, ensuring comprehensive coverage for the analysis.

2.2.2. Airborne Laser (ALS) Scanning Data

ALS data used for external validation of the forest canopy cover maps were acquired from the United States Geological Survey (USGS) for the state of Florida [47]. This dataset was part of the comprehensive Florida Peninsula LiDAR collection and Hurricane Michael supplemental collection, with data collected between November 2018 and April 2020. The ALS dataset had a resolution of 2.5 feet (0.762 m), offering high-precision elevation data for the entire state of Florida.
Flight missions captured detailed point cloud data, with attributes such as flight dates and other metadata accessible through the Florida Geographic Information Office. These flight dates were crucial for selecting the parks used for validation (Apalachicola National Forest, Green Swamp Wilderness Preserve, and Babcock Ranch Preserve). Notably, these parks had ALS data collected from the end of 2019 until April 2020, aligning closely with the GEDI footprint collection season. This minimized the temporal discrepancies and allowed a more direct comparison between the data fusion model outputs and the ALS data, ensuring that differences in canopy metrics were not affected by changes in forest structure over time.
The canopy cover metric from ALS was computed using a height threshold of 1 m on a 30 m spatial grid, using the lidR package [49] according to Equation (1), aligning with the resolution of the canopy cover model. The ALS-derived canopy cover layer was then used to generate mosaics for each external validation site.
CCALS = (NFR_{HTS}/TFR) × 100
where CCALS is the ALS-derived canopy cover (%), NFR_{HTS} is the number of first returns above the 1 m height threshold (HTS = 1), and TFR is the total number of first returns.

2.2.3. Sentinel-1 GRD Data

Sentinel-1A, a C-band Synthetic Aperture Radar (SAR) satellite launched by the European Space Agency, is equipped with dual-polarization capabilities: vertically transmitted and vertically received (VV), and vertically transmitted and horizontally received (VH). The satellite operates at a center frequency of 5.405 GHz, providing a spatial resolution between 10 and 40 m, depending on the imaging mode. With a 12-day revisit cycle, Sentinel-1A offers consistent coverage, making it ideal for temporal analyses over large areas [50].
We utilized Sentinel-1 GRD (Ground Range Detected) data, a processed data product derived from Sentinel-1A. The GRD product is radiometrically calibrated, terrain-corrected, and projected onto a map coordinate system, providing backscatter coefficients (sigma0) that are consistent and reliable for analysis. Data from the Interferometric Wide (IW) swath mode with a spatial resolution of approximately 30 m (aligning closely with the GEDI footprint size) were sourced from the GEE platform [51]. The dataset spanned the period from 1 March to 30 April 2021, ensuring full temporal coverage during the study period.
Pre-processing of the GRD data on GEE included radiometric calibration, terrain correction, and conversion to backscatter coefficients. A seamless mosaic was produced by combining all 75 available Sentinel-1 images into a composite, capturing the entire area of interest without temporal gaps. Additionally, four vegetation indices were derived from Sentinel-1 GRD data, including the Radar Vegetation Index and Co-polarization Ratios 1, 2, and 3. These indices enhanced the analysis of vegetation structure and condition, contributing to the robustness of the study (Table 1).

2.2.4. Harmonized Landsat and Sentinel-2 Dataset

The NASA Harmonized Landsat and Sentinel-2 (HLS) project integrates surface reflectance (SR) data from the Operational Land Imager (OLI) aboard Landsat 8 and the Multi-Spectral Instrument (MSI) on Sentinel-2 [57]. By combining data from these two sensors, HLS provides global land observations every 2 to 3 days at a spatial resolution of 30 m, enhancing temporal and spatial monitoring capabilities. The HLS observations are processed using sophisticated algorithms that ensure seamless integration between the OLI and MSI sensors. These algorithms include atmospheric correction, cloud masking, spatial co-registration, and spectral normalization. The HLS dataset encompasses over 10 million km2, including data from Landsat 8 (since 2013), Landsat 9, and Sentinel-2 (since 2015) [58].
For this study, we used the HLSL30 data product with 30 m spatial resolution covering the period from 1 March to 30 April 2021. This time frame was selected to align with the GEDI footprint collection period, ensuring temporal consistency during the fusion process. To maintain data quality, we applied the Dynamic World filter to focus only on forested areas and applied additional cloud filtering to include only images with less than 30% cloud cover, using quality flags provided in the HLS data product to exclude cloud-contaminated pixels. Additionally, we addressed potential temporal misalignments by selecting the images closest in time to the GEDI acquisitions within the specified period, ensuring that the spectral data matched the conditions under which the GEDI data were collected.
We applied the Dynamic World filter to focus exclusively on forested areas, and road and water masks were applied to exclude non-forested regions and mitigate the potential influence of anthropogenic and aquatic features on our model results. These preprocessing steps ensured that the final dataset was both temporally and spatially aligned with the GEDI data and suitable for analysis.
Finally, we computed thirteen vegetation indices derived from the HLS dataset to further assess vegetation structure and forest health (see Table 2 for the complete list of indices).

2.2.5. Ancillary Imagery

Critical terrain metrics such as elevation, slope, and aspect were extracted from the NASADEM_HGTv001 dataset [72] and used in this study. NASADEM, an enhancement of earlier DEMs, utilizes refined Shuttle Radar Topography Mission (SRTM) data. The SRTM project was a collaborative effort involving NASA, the National Geospatial-Intelligence Agency (NGA), the German Aerospace Centre (DLR), and the Italian Space Agency (ASI). The reprocessing of original SRTM radar data incorporated advanced algorithms and additional elevation data from ASTER and ICESat’s Geoscience Laser Altimeter System (GLAS), significantly reducing data voids and improving accuracy [57,58]. NASADEM provides global elevation data with a resolution of 1 arc-second, covering all land areas between 60°N and 56°S latitude, representing approximately 80% of Earth’s land surface [72].
The inclusion of terrain metrics such as elevation, slope, and aspect help to account for the impact of topography on vegetation growth and spatial distribution, which is essential for understanding canopy composition and structure [73]. Although Florida’s elevation range is relatively narrow (0–105 m), even slight variations can influence vegetation patterns by affecting temperature, moisture availability, and atmospheric pressure [74]. For instance, coastal and near-coastal slopes, though they may be subtle, play a role in water drainage and retention, in turn affecting vegetation density and distribution [74,75]. Additionally, aspect influences sunlight exposure and moisture retention; north-facing slopes may retain slightly more moisture than south-facing ones even in low-elevation areas [76]. These metrics allowed the model to capture terrain-driven variations in canopy cover effectively. Geographic coordinates such as the longitude and latitude of each GEDI pixel were also used as predictor variables.

2.2.6. Spatial Transformations

For each studied variable, we calculated the maximum, minimum, mean, and standard deviation using a kernel-based spatial reclassification method within a 3 × 3 pixel moving window (Table 3). Additionally, we employed the Gray-Level Co-Occurrence Matrix (GLCM) to extract statistical features from the images’ grayscale values for each band and vegetation index, following methodologies established previously [77]. Incorporating the bands, vegetation indices, ancillary data, kernel moving window, and GLCM features, our dataset expanded to comprise a total of 256 predictor variables. These variables were then extracted and associated with the corresponding GEDI footprints to effectively model canopy cover. To ensure compatibility with GEDI geolocation uncertainty [78,79], all predictive metrics were calculated to a spatial resolution of 30 × 30 m. This resolution aligned with the Sentinel-1 and HLS datasets and minimized the potential impact of spatial mismatches when integrating multi-source data.

2.3. Canopy Cover Modeling and Assessment

Random forest (RF) [81] regression modeling was employed to create canopy cover models. RF modeling was chosen for its robustness, accuracy, and transferability, characteristics that are particularly beneficial for handling large and complex datasets typical of remote sensing data [82]. This approach allowed effective handling of non-linear relationships and interactions between variables. Initially, sampling was carried out, during which 1000 samples containing spatially independent GEDI-derived canopy cover estimates were selected as the response variables, and image-derived metrics were selected as predictor variables for model calibration. The dataset was then divided into training and testing subsets, with 70% of the data allocated for training and the remaining 30% used for testing. This split helped to maximize learning from the data while ensuring the model’s generalizability to unseen data (Figure 2).
An RF model was implemented using the RandomForestRegressor from the python sklearn.ensemble [83] library (version 1.6.0), configured with 250 trees to balance model complexity and computational efficiency. To ensure that only the most relevant features were included, the SelectFromModel method was applied to the feature set to rank the features based on their contribution to reducing the model’s impurity (i.e., how well a feature contributed to reducing variance or classification error). Features that met or exceeded the mean importance score were retained, reducing dimensionality and enhancing the model’s accuracy and interpretability. After selecting the most informative features, their importance scores were recorded for further analysis.
To evaluate model robustness and quantify uncertainty, bootstrapping was conducted with 100 iterations. In each iteration, the model was trained on the stratified resampled subset of the data, with replacement, allowing for variations in the sample while preserving overall model accuracy. The model’s performance was assessed using key metrics, including the coefficient of determination (R2), mean difference (MD), and root mean squared difference (RMSD), calculated for both absolute and relative terms (Equations (2)–(6)). These metrics provided a comprehensive understanding of the model’s accuracy and its capacity to predict canopy cover across the study area.
R 2 = i = 1 n y i y ¯ y ^ i y ^ ¯ 2 i = 1 n ( y i y ¯ ) 2 i = 1 n y ^ i y ^ ¯ 2
R M S D = i = 1 n y ^ i y i 2 n
R M S D % = R M S D y ^ × 100
M D = 1 n i = 1 n y ^ i y i
M D % = M D y ^
where n is the number of observations; yi and y ^ i are the observed and predicted values for canopy cover for the i-th observation; y ¯ and y ^ ¯ are the observed and predicted mean values across all observations.
After completing all the bootstrap iterations, variable importance analysis was conducted to evaluate each predictor’s contribution to the model. During each run, variables were assessed based on their ability to enhance model accuracy, quantified by the mean decrease in impurity, which quantified how effectively a variable split the data into more homogeneous groups, reducing uncertainty or disorder in the predictions [81]. The cumulative mean decrease in impurity was then calculated for each variable across all bootstrap iterations. Final importance scores were determined by evaluating how frequently each variable was selected during the iterations, weighted by its contribution to the model’s performance. These scores were converted to percentages, ranging from 0% to 100%, to allow easier comparison and ranking. This process highlighted the most significant predictors, ensuring that the final model emphasized the variables with the greatest predictive power, thus improving both the model’s generalizability and its accuracy in predicting canopy cover.

2.4. Wall-to-Wall Canopy Cover Mapping

The previously trained RF models were initially trained as regressors to predict continuous canopy cover values. For implementation on GEE [84], these models were adapted into a classifier format, enabling canopy cover predictions at a 30 × 30 m pixel resolution across Florida. Using 100 models generated through bootstrap iterations, this classifier was applied statewide on a 30 × 30 m grid, providing detailed spatial predictions of canopy presence and facilitating precise geospatial analysis within GEE. The decision to use 100 bootstrap iterations was based on findings from previous studies, such as Rosenfeld et al. [85], which demonstrated that 100 iterations strike an optimal balance between computational efficiency and model stability. Additionally, preliminary testing in this study confirmed that increasing the number of iterations beyond 100 did not lead to significant improvements in the stability or accuracy of the model’s outputs (e.g., R2, RMSD, and MD), further supporting the use of 100 iterations as a reliable and efficient choice.
The output maps from each of the 100 models were subsequently stacked to compute the mean and standard deviation across all models. This process resulted in the creation of a final upscaled map, representing the average predicted values across all 100 models, and a standard deviation map, capturing the variability or uncertainty in these predictions. These maps, as delineated by Equations (7) and (8), represent the spatial variability and consistency of the modeled data across Florida. The mean map illustrates the central tendencies of the modeled variables, while the standard deviation map identifies areas with greater uncertainty, indicating where the model’s predictions were less consistent.
x ^ i , j = 1 M   k = 1 M x i ,   j ,   k
where xi,j,k is the value at grid cell (i,j) in the k-th map; i, j are the indices representing the position of the cell in the grid; k is the index of the map (k = 1,…,M), one for each bootstrapped model. M is the total number of bootstrap maps.
σ i , j 1 99   k = 1 M ( x i , j , k x ^ i , j ) 2
where σ i , j is the average standard deviation value for cell (i,j); xi,j,k is the value at grid cell (i,j) in the k-th map; x ^ i , j is the mean value for the grid cell (i,j) across all bootstraps. The factor 99 in the denominator corresponds to the degree of freedom.

2.5. External Validation

To evaluate the performance of our model on an independent dataset, we selected three state parks in Florida for external validation: Apalachicola National Forest (ANF), Green Swamp Wilderness Preserve (GSWP), and Babcock Ranch Preserve (BRP) (Figure 1). These sites were chosen based on the availability of ALS data collected within a similar phenological season, ensuring consistency and avoiding strong discrepancies during validation. Additionally, the sites were selected for their inherent range of canopy cover, providing a robust test of the model’s ability to accurately predict areas of both lesser and greater canopy coverage. The sites are located in Tallahassee, Lake, and Punta Gorda counties, respectively. The ALS data (refer to Section 2.2.2) from these sites were employed for map validation (see Figure 2).
To ensure comprehensive validation, we first assessed the impact of including each park individually in the validation process and then evaluated the model’s performance across all three parks combined. For the validation process, which involved comparing our model-derived maps with the ALS ground-truth data, 2000 pixels measuring 30 × 30 m each were randomly chosen from each site and assessed on a one-to-one basis. This number of pixels was selected as a balance between ensuring robust statistical representation and addressing computational constraints, as processing larger pixel counts would have exceeded the platform limitations. To quantify the model’s precision, accuracy, and agreement with the observed data, we calculated key metrics, including the R2 and absolute and relative (%) RMSD and MD.
In addition to validating our model with ALS data, we calculated the mean canopy cover for each individual park (refer to Equations (9)–(11)). Specifically, we estimated the mean value x ^ of the predicted canopy cover, which served as the central tendency of the predictions for each park. Accompanying this mean, the 95% confidence interval provided the range within which the true mean canopy cover was expected to be found. To assess the variability in these predictions, we calculated the total variance V x ^ for each park. From this variance, the standard error (SE) was computed, providing a measure of the average deviation of the estimated means from the true population mean. Additionally, the relative standard error (rSE) was calculated, relating the standard error to the mean canopy cover, offering a relative measure of uncertainty. This was particularly useful for comparing the precision of the model across different parks. These error measurements provide a comprehensive evaluation of the reliability and consistency of the predictions in different geographic contexts, enabling a detailed assessment of the model’s accuracy and precision to ensure that it can be applied effectively throughout the state’s diverse environments.
x ^ = 1 N g = 1 N 1 M k = 1 M x k , g
where x ^ represents the mean value for each park, N is the total number of observations in each park, M is the total number of bootstrap maps, g refers to the position of the grid cell, x k , g is the value at position g in the k-th bootstrap map for the respective park.
S E = V x ^
where S E is the standard error for each park and V x ^ represents the variance.
r S E % = S E x ^ × 100

2.6. Characterization of Forest Canopy over Across the State of Florida

To calculate the canopy cover percentage for each forest type, we integrated the final average canopy cover map generated by our model with the FNAI vegetation classifications [41]. This analysis utilized the vegetation type data from the Florida Department of Environmental Protection (FDEP), obtained from their statewide land use and land cover inventory [86], allowing us to accurately assign vegetation type to the entire extension of the study area. This integration enabled the precise assignment of vegetation types across the entire study area, resulting in a detailed map illustrating canopy cover percentages by forest type. Additionally, we calculated both the mean and standard deviation of canopy cover for each forest type to further analyze the distribution and variability within these categories.
A Kruskal–Wallis test [87] was performed at the 95% confidence level to assess whether there were significant differences in canopy cover among the various forest types studied. The null hypothesis (H0) assumed that the median canopy cover values across the forest types are equal, while the alternative hypothesis (H1) suggested that at least one group’s median would be different, indicating a significant variation in canopy cover among the forest types. If significant differences were detected, Dunn’s post-hoc pairwise comparisons [88] with Bonferroni correction were applied to determine which specific forest types significantly differed from one another.
For our county characterization, we clipped the mean canopy cover map to the geographic boundaries of each county to calculate the mean canopy cover per county. To estimate the standard deviation for each county, we used the average standard deviation map (refer to Equation (8)), and by extracting the standard deviation values within each county’s boundaries, we computed the variance for each county using Equation (12). The standard deviation for each county was then obtained by taking the square root of the variance (Equation (13)). This process provides a measure of the uncertainty in the canopy cover predictions at the county level.
V c o u n t y = 1 N 1 i = 1 n ( σ i σ ^ ) 2
where V c o u n t y represents the variance in standard deviation per county, N is the number of extracted values of standard deviation within the county, σ i is the standard deviation for each pixel in the county, and σ ^ is the mean standard deviation across the county.
σ c o u n t y = V c o u n t y

3. Results

3.1. Model Performance Assessment

The mean R2 of the 100 aggregated models was 0.69, indicating that, on average, 69% of the variance in canopy cover was explained by the model. The RMSD values, reflecting the average prediction errors, were 0.17 (absolute) and 44% (relative) (Figure 3). The MD values were notably low at 0.001 (absolute) and 0.34% (relative), demonstrating satisfactory bias in the predictions.
The range of R2 across the models varied from 0.62 to 0.77, indicating moderate variations in model accuracy between iterations. RMSD values oscillated between 0.15 to 0.19 (absolute) and 40.2% to 49.8% (relative), highlighting the variability in error size relative to the actual canopy cover values. Additionally, the MD fluctuated from −0.025 to 0.029 (absolute) and ranged from −6.27% to 7.79% (relative), indicating changes in the direction and magnitude of the prediction bias across different model iterations.
Longitude emerged as the most influential predictor, contributing over 95% to the model importance in the RF models (Figure 4), and was used by the RF in 83 of the 100 models. Following closely, the red and red mean predictors showed significant influence, contributing to 55% of the model variance and being selected in 53 models. Latitude also had considerable impact with a 53% contribution, while red mean had 49% importance.
Lesser but still notable contributions came from green mean and swir2 min, each accounting for 31% of the model’s variability, and green min at 30%. Other predictors varied in their influence, with importance scores ranging from 29% to 10% (e.g., green, swir2 mean, evi min, red savg, and others). For clarity and ease of visualization, Figure 4 includes only those metrics that demonstrated more than 10% in the importance scoring. It is important to note that our approach to modeling canopy cover considered a total of 260 metrics (Table 1, Table 2 and Table 3), but through feature selection, only 50 variables were chosen for the final model.

3.2. Wall-to-Wall Canopy Cover Maps for Validation Sites

The final wall-to-wall canopy cover maps revealed distinct variations in canopy cover across the three validation parks. BRP exhibited a mean canopy cover of 20.76%, indicating relatively sparse vegetation in some areas. In contrast, GSWP showed a higher mean canopy cover of 43.07%, reflecting denser foliage. ANF also displayed a high mean canopy cover of 42.40% with a lower standard deviation of 9.10%, suggesting more uniform coverage. These maps enable a comprehensive evaluation of specific regions within these state parks, offering a detailed perspective beyond the limitations of relying solely on GEDI footprints (Figure 5).
When evaluating the mean canopy cover for each individual validation area, we found that BRP had the lowest mean canopy cover at 20.8%, while ANF and GSWP had higher estimates of canopy cover at 42.4% and 43.9%, respectively (Table 4). The validation revealed that the canopy cover estimates for BRP had the lowest precision, with the highest rSE value (87.9%). In contrast, the canopy maps for ANF and GSWP exhibited greater precision, with rSE of 48.3% and 46.7%, respectively. Overall, these results underscore the variability and precision of our canopy cover estimates, with GSWP showing the most reliable results among the evaluated sites.
As shown in Table 5, we found that using all three parks for validation yielded a higher R2 value (0.70), but it also resulted in a slight increase in errors compared with BRP, with an absolute RMSD of 0.29% and an MD of −0.22%. These results reveal that incorporating a broader range of canopy cover conditions—by including all three parks—allowed the model to explain more variability in the dataset, as indicated by the higher R2. However, the increased RMSD and MD indicate that this approach introduced greater complexity into the predictions, possibly due to the model having to account for more extreme or diverse canopy conditions. When the individual parks were examined, GSWP provided the best validation results, with an R2 of 0.64, an RMSD of 0.30%, and an MD of −0.24%. This park was the most homogeneous and had the highest canopy cover compared with the other two parks, suggesting that our model performed better in areas with high canopy cover than in areas with lower or mixed canopy cover. Conversely, ANF produced the poorest results among all three parks, with the lowest R2 value (0.57), and higher RMSD and MD at 0.30% and −0.24%, respectively. Overall, the validation results indicate underestimation in canopy cover, as evidenced by the negative MD values for all individual sites and the combined all-sites validation. When considering all sites for validation, the model underestimated canopy cover by an average of −0.22, or −36.21% relative to the actual values.

3.3. Spatial Characterization of Canopy Cover Across the State of Florida

Figure 6 presents the prediction and uncertainty of canopy cover across the state of Florida. Observations indicate a higher concentration of canopy cover in the northern regions of the state, with a noticeable decrease towards the south. Notably, areas with more canopy cover also exhibit higher uncertainty, as shown by the red and purple colors on the map. In contrast, the blue areas in the south represent regions with less canopy cover but higher uncertainty, while the yellow areas indicate places with lower uncertainty associated with their canopy cover predictions.
The canopy cover in the state ranges from as low as 2% to as high as 77%. Most of the state exhibits canopy cover between 20% and 30%, followed by a substantial portion with canopy cover ranging from 40% to 54%. Regarding the standard deviation across the 100 model iterations applied to Florida, values ranged from 0.35% to 37%. Higher standard deviations were associated with regions of dense canopy cover, indicating that the model estimated a wide range of canopy cover values for these areas. Figure 6b,c depict how longitude and latitude influence canopy cover for the state. Canopy cover increases from southern to northern latitudes. Forest canopy cover also generally increases longitudinally from east to west, except in areas with less land (from 88°W to 86°W) and regions with cloud occlusion.
Across Floridan forest types, wetland areas generally exhibit higher canopy cover compared with upland forest types. Specifically, wetland hardwood forests include the most canopy cover at approximately 46%, followed closely by mixed wetlands at 45% and wetland coniferous forests at 44%. In contrast, upland forest types display lower rates of canopy cover, with mixed uplands at 42%, upland hardwoods at 41%, and upland coniferous forests at 37% (Figure 7).
The Kruskal–Wallis test revealed a significant difference in canopy cover across all forest types (χ2 = 6212.3, df = 5, p < 2.2 × 10−16). For instance, results for the upland hardwood forest (UHF) and upland mixed forest (UMF) did not differ significantly from each other (p = 1.000, both labeled “a”), indicating similar canopy cover. Upland coniferous forest (UCF) included significantly less canopy cover compared with other upland forests (p < 0.001, labeled “b”). Among wetland types, wetland coniferous (WCF) and wetland mixed forest (WMF) showed no significant difference from each other (p = 0.126, both labeled “c”), but were significantly different from all upland types. Wetland hardwood forest (WHF) had the highest rate of canopy cover, and this result was significantly different from all other forest types (p < 0.001, labeled “d”).
Furthermore, the standard deviation of canopy cover indicated considerable variability within these forest types. Wetland hardwood forest, which had the highest canopy cover, also exhibited the highest standard deviation 10%. Other forest types, such as wetland coniferous, upland hardwood, and upland coniferous, show standard deviation values ranging from 9% to 9.7%, indicating relatively high variability. Conversely, mixed wetlands and mixed uplands had the lowest standard deviation values at 6.9% and 8.3%, respectively, suggesting more uniform canopy cover within these types.
Our analysis revealed substantial variation in canopy cover across counties. The counties of Baker, Holmes, and Union lead with the greatest area of average canopy cover, each at 43%; following closely are Bradford, Columbia, Dixie, and Lafayette counties, each with canopy cover of 42%; at the lower end, Broward, Palm Beach, Miami-Dade, and Martin counties have ≤12% canopy cover. Notably, these counties with lower canopy cover are predominantly located in the southern part of the state. The remaining counties exhibit canopy cover ranging from 39% to 14% (Figure 8c).
An observable trend indicates that higher canopy cover is concentrated in the northern, northeastern, and western parts of the state, while central regions show moderate coverage and southern regions have significantly lower rates of tree cover. Additionally, counties such as Monroe, Jefferson, and Miami-Dade exhibit high variability in canopy cover, suggesting diverse landscapes within these areas. In contrast, counties like Gilchrist and Suwannee show minimal variation, indicating more uniform canopy distribution.

4. Discussion

This paper focuses on developing and testing a data fusion framework that combines GEDI-derived canopy cover samples with continuous data from HLS optical imagery, Sentinel-1 GRD radar, and other ancillary data to create a high-resolution, wall-to-wall canopy cover map for the entire state of Florida. While previous research has used similar fusion approaches using GEDI to estimate biomass [29,30,33,89,90] and tree height [37,91,92], this is the first study to adapt such methods specifically for canopy cover, marking a significant advancement in the field. The novelty of this study lies not only in the application of this data fusion technique but also in the reliable, high-accuracy canopy cover maps it produces. Canopy cover is a critical metric for the early detection of forest changes, as it is often the first visible indicator of forest alteration that can be observed remotely. This is particularly relevant in Florida, where forests are regularly impacted by hurricanes, making it essential to monitor canopy cover dynamics over time.

4.1. Canopy Cover Model Performance and Prediction Assessment

The RF model developed and utilized in this study demonstrated robust performance in predicting canopy cover, with an average R2 of 0.69 and RMSD values ranging from 0.15 to 0.19. These results are consistent with those reported by [79] and represent an improvement over earlier studies using simpler regression models (e.g., linear regression) for similar approaches [30,93]. To date, linear regression models have been the most widely used methods for AGBD estimation; however, this approach does not effectively capture the complex nonlinear relationships between forest structure/AGB and remotely sensed data [94].
Numerous studies have indicated that algorithms based on decision trees, such as RF, can better estimate forest metrics such as AGB, height, and canopy cover [79,95,96,97,98,99]. The strength of the RF method lies in its ability to accommodate large datasets with numerous predictor variables and its robustness against overfitting, making it particularly well suited for remote sensing applications where complex interactions between variables are common [81]. In this study, the RF model effectively integrated data from diverse sources, including GEDI, HLS, Sentinel-1 GRD, and ancillary datasets, demonstrating its ability to capture the majority of the variability in canopy cover across Florida.
While the model exhibited strong performance overall, moderate variability was observed in key metrics across the bootstrap iterations. For instance, R2 values ranged from 0.62 to 0.77, reflecting differences in the model’s ability to explain variance in canopy cover across subsets of data. Similarly, RMSD values ranged from 0.15 to 0.19 (absolute) and 40.2% to 49.8% (relative), highlighting variability in error magnitude relative to actual canopy cover. These fluctuations are likely to have stemmed from the diverse environmental conditions across Florida, including abrupt changes in canopy cover in wetlands or coastal areas, for example, which present additional modeling challenges in the remote sensing realm.
Despite the challenges posed by abrupt changes in vegetation and environmental variability, the narrow ranges observed in the performance metrics indicate the model’s robustness and reliability in predicting canopy cover under varying conditions. For instance, the RF model demonstrated strong performance across Florida’s diverse environmental gradients, underscoring its ability to adapt to the state’s unique and heterogeneous forest ecosystems. While this suggests potential applicability to other regions with varied ecological characteristics, further analysis is needed to confirm its ability to generalize across different landscapes. Testing the model’s spatial transferability will be essential to determine its effectiveness in large-scale environmental monitoring and management beyond the study area.
The high R2 and low RMSD values suggest that our model captured the majority of the variability in canopy cover across Florida, as highlighted earlier. This has significant implications for forest management, conservation planning, and ecological research. For example, accurate canopy cover maps can help in monitoring deforestation, assessing habitat quality, and evaluating the effectiveness of conservation interventions. These results underscore the importance of using advanced methodologies that integrate diverse data sources to improve predictive accuracy.
Optical and radar data provide complementary information about forest structure and canopy cover [100,101,102], which can enhance model robustness. Comparing our results with findings from other studies that relied solely on single data sources [103,104], those studies reported lower accuracy and higher error margins, emphasizing the potential for integrating multiple data sources, such as GEDI, Sentinel-1, and HLS, to improve model performance. However, it is important to recognize that these studies focused on different forest types and target metrics, which may account for some of the observed differences in performance.
Our model identified several key predictors of canopy cover, including longitude, red mean, red, latitude, and red min. The importance of these predictors aligns with findings from other studies that have emphasized the relevance of geographic and spectral variables in canopy cover estimation [105]. Specifically, longitude explained more than 95% of model variation, being considered important in 83% of the models. This was higher than the importance scores for spectral indices alone, in agreement with previous reports [105].
The relationship between latitude and vegetation distribution has long been recognized, dating back to Von Humboldt’s observations between 1799 and 1804, and it is strongly related to climatic zones [106,107]. The spatial and temporal coordinates of data points are often neglected in ecological studies, but it has been argued [107] that these should be included as predictors when modeling the spatial distribution of organisms. Similarly, it was found [108] that incorporating coordinate metrics such as longitude and latitude enhanced model performance in predicting forest structure.
Longitude and latitude are associated with the spatial distribution of species [106,107] and are closely linked to climatic zones and soil types [109,110]. These geographic metrics help capture environmental gradients that influence canopy cover. For instance, in our study, the trend of canopy cover distribution from south to north revealed that lower rates of canopy cover were prevalent in southern counties, while more canopy cover was observed in northern counties. This spatial pattern probably explains why latitude was one of the most important metrics, accounting for more than 53% of the model’s variation.
While the longitudinal patterns are not immediately obvious when looking at Figure 6a, there are reasons why this metric appeared in more than 85 out of 100 models. Florida is a peninsula, meaning that its vegetation distribution is highly impacted by the distance from the ocean. Coastal areas are subjected to harsher conditions such as salt spray, higher winds, and frequent disturbances like hurricanes, which can limit tree growth and reduce overall canopy density [111,112]. Additionally, coastal regions are often more developed due to urbanization and tourism [113], further decreasing canopy cover. Conversely, inland areas generally experience more stable conditions that are conducive to denser and more extensive forest cover. Thus, longitude is likely to serve as a proxy for these varying environmental and anthropogenic influences across the state.
Unexpectedly, vegetation indices (e.g., NDVI, NDWI) were not the most important metrics; instead, coordinate metrics and spectral bands were. The red, red mean, and red min metrics emerged as significant predictors of canopy cover. Spectral indices, particularly those in the red band, are critical in remote sensing for assessing vegetation health and density. In particular, the red band is sensitive to chlorophyll content and can indicate the presence and condition of vegetation [114]. Studies have shown that red band reflectance is inversely related to vegetation health; healthier vegetation absorbs more red light due to a higher chlorophyll concentration [99,114]. This relationship makes the red band a valuable indicator for modeling canopy cover. The red mean metric captures the average reflectance in the red band, providing a measure of overall vegetation health across an area. Meanwhile, red min indicates the minimum reflectance value, highlighting areas with the densest and healthiest vegetation, which is crucial for accurate estimation of canopy cover. Consequently, these spectral bands might also directly reflect canopy characteristics more effectively than composite indices, leading to their higher importance in the model.
Our findings are consistent with those of other studies that have highlighted the significance of geographic and spectral variables in canopy cover modeling. For example, studies [108,109] have emphasized the role of geographic metrics such as longitude and latitude in predicting vegetation patterns. Jongman et al. [106] similarly reported that spectral bands, particularly in the red region, were essential for accurate estimation of canopy cover.
The variability in canopy cover estimation performance across validation sites can be linked to site-specific factors, including forest types and structural characteristics. For example, GSWP is dominated by wetland forests with high canopy cover and a relatively homogenous structure [115], while Babcock Ranch Preserve features a mix of pine flatwoods and cypress swamps, representing moderate canopy cover [116]. In contrast, ANF is ecologically diverse, with landscapes ranging from longleaf pine savannas to hardwood swamps, making it the most structurally heterogeneous among the sites [117,118].
Our model performed best at GSWP, where the homogenous vegetation and higher rate of canopy cover probably contributed to lower errors and higher R2 values. Conversely, ANF showed the poorest performance, with lower R2 values and higher errors. This result aligns with the ecological complexity of ANF, where diverse vegetation types and canopy structures pose greater challenges for accurate predictions. Additionally, ANF has been heavily impacted by disturbances such as hurricanes (e.g., Hurricane Michael in 2018), which caused significant canopy loss and altered forest composition, further complicating the model’s performance [118].
These insights not only confirm the validity of our approach but also emphasize the importance of site-specific considerations in canopy cover modeling, especially in regions with diverse forest types. The incorporation of external ALS data for validation further underscores the innovative nature of our study, as validating a statewide canopy cover map across such ecologically diverse areas is a novel and critical contribution to the field. This approach enhances confidence in the model’s generalizability while providing a roadmap for similar studies in other regions with complex forest landscapes.

4.2. Spatial Characterization of Canopy Cover

The spatial distribution of canopy cover across the state shows a clear trend from south to north. Southern counties, such as Miami-Dade and Monroe, generally have lower mean canopy cover percentages (11–17%), while northern counties exhibit higher canopy cover (35–43%). This gradient can be attributed to climatic and soil differences, with northern Florida having cooler, wetter conditions and more fertile soils conducive to dense forest growth. According to the Köppen climate classification [119], southern Florida’s climate is classified as Aw (equatorial with dry winter) and Am (equatorial with monsoon), while northern Florida’s climate is classified as Cfa (warm temperate, fully humid with hot summer), contributing to the differences in vegetation patterns.
In particular, Miami-Dade County is characterized by flat topography, sandy soils, a shallow groundwater table, and well-developed canal systems [120]. These factors contribute to the lower canopy cover observed in the region. The flat topography and sandy soils are less supportive of dense forest growth, while the shallow groundwater table and canal systems alter the natural hydrology, impacting vegetation patterns.
The variability of canopy cover within counties is particularly notable in Monroe, Jefferson, Miami-Dade, Madison, Palm Beach, and Martin, as indicated by high standard deviation values. This variability can be attributed to the heterogeneous landscapes in these counties. For instance, southern counties exhibit contrasting environments such as beach areas with little vegetation alongside sparse forests. In northern counties like Jefferson and Madison, high cloud cover in HLS images may have introduced noise, leading to greater variability in canopy cover estimates.
Land use practices and history of disturbance significantly influence canopy cover distribution [121,122,123]. Northern counties tend to have more stable forest ecosystems, partly due to experiencing fewer hurricanes compared with the southern regions, which are more frequently affected by these [124], impacting forest cover dynamics. Additionally, conservation practices vary across the state [125], with some regions implementing more effective land management strategies that promote canopy cover.
Urban development patterns also play a crucial role, as more densely populated areas typically exhibit lower rates of canopy cover due to land conversion for residential and commercial use [122]. In Florida, population distribution varies significantly across the state. According to data from the U.S. Census Bureau [126], southern counties such as Miami-Dade, Broward, and Palm Beach have significantly higher population densities compared with the more rural northern counties. Central Florida, including Orange County (home to Orlando) and surrounding counties like Seminole, Osceola, and Lake, has moderate population densities but is experiencing rapid growth due to urbanization.
Moreover, our model also indicates that wetland forests generally exhibit more canopy cover compared with upland forests. This higher rate of canopy cover in wetlands can be attributed to their unique ecological conditions, including abundant water supply and fertile soils [127], supporting dense vegetation growth. These areas are also protected by law and are often less disturbed by human activities [128], allowing for more mature forest development. On the other hand, upland forests may experience more significant water stress, particularly during dry seasons [129], which is a primary reason for their having less canopy cover. Additionally, their well-drained and less nutrient-dense soils further contribute to reduced vegetation density [130]. Upland forest types, such as longleaf pine savannas, are also adapted to regular fire regimes, which can periodically reduce canopy cover [110].
The spatial characterization of canopy cover across Florida underscores the value of having detailed, county-level canopy cover information. This granularity enables a more nuanced understanding of regional forest dynamics and the factors that drive canopy cover variability, such as climate, soil type, land use, and disturbance history. Accurate county-level data enables targeted conservation and management efforts, as stakeholders can tailor strategies to the specific ecological and socio-economic conditions of each region [131]. For example, identifying areas with less canopy cover in densely populated southern counties could inform urban greening initiatives, while understanding the higher canopy in wetland forests highlights areas of high conservation value. By providing insights into both broad trends and localized variations, county-level canopy cover information becomes a crucial tool for effective forest management, biodiversity preservation, and urban planning throughout the state. This approach aligns with the needs of forest managers, conservationists, and policymakers who require precise data to make informed decisions at both local and statewide levels.

4.3. Limitations and Future Work

The comparison of our model’s performance and the identified key predictors with the results from existing studies highlights the effectiveness of integrating GEDI, Sentinel-1 GRD, and HLS data. However, despite these strengths, our model tended to underestimate canopy cover, as indicated by the negative MD. In our case, the MD of −36.15% suggests that the model tended to predict lower rates of canopy cover than were observed in the ALS dataset. This underestimation is a common issue in remote sensing models, particularly those using RF [132].
This underestimation may have stemmed from a combination of factors. First, optical and radar sensors have certain limitations, one of which is saturation [97,133,134]. Specifically, spectral saturation occurs when spectral reflectance values are no longer sensitive to increases in biomass or leaf area index, resulting in low estimation accuracy [135]. This phenomenon has been demonstrated in several studies [136,137,138,139,140] and is reflected in our results. Despite considerable research efforts to address saturation in high-density vegetation, it remains a significant challenge in the remote sensing community [64,139,141]. However, studies have shown that the fusion of different types of sensors can significantly mitigate the saturation problem [139,142,143,144]. In our study, the integration of LiDAR, SAR, and optical data substantially improved the accuracy of our canopy cover estimates, suggesting that without this data fusion, the issue of saturation would have been much more pronounced. LiDAR, in particular, stands out because unlike optical sensors, it is not prone to saturation, enabling it to capture fine-scale details across a wide range of canopy densities [139]. This characteristic allowed LiDAR to effectively penetrate dense canopies, providing valuable structural information that enhanced the precision of our canopy cover assessments.
In addition to the challenges of underestimation and sensor limitations, several other potential sources of error may have influenced our canopy cover predictions. One important factor is geographical variability. Environmental differences between northern and southern Florida can affect model accuracy, particularly in areas with heterogeneous landscapes, such as southern counties, where abrupt, small-scale changes in canopy cover present added challenges. Another significant source of error is geolocation uncertainty. The geolocation accuracy of GEDI footprints is estimated at around 7 m at the 1-sigma level [92]. While this uncertainty was not directly modeled in our analysis, we attempted to reduce its impact by using a spatial resolution of 30 m. However, differences in resolution between ALS and GEDI data, along with our own canopy cover maps, also contributed to discrepancies during validation. Despite the use of a 30 m spatial resolution, variations in data collection and canopy cover computation between ALS and GEDI introduced potential inaccuracies, especially since the ALS data were processed with minimal modeling, making such differences more noticeable when compared with our maps.
Seasonal variations and phenological changes can further exacerbate these errors, as collecting data within a narrow time frame does not entirely eliminate the discrepancies between predicted and actual canopy cover. For example, even small differences in vegetation growth stages between the ALS and GEDI datasets could have influenced canopy measurements, as foliage density, leaf area, and canopy structure fluctuate seasonally. These phenological changes, particularly in rapidly growing or deciduous species, can cause significant variations in canopy cover between the datasets [145,146], making it difficult to align the model’s predictions with ground-truth observations. Additionally, cloud cover in some HLS images, particularly in northern counties, may have reduced the quality of the data and introduced noise, leading to greater variability in the model’s predictions. These combined factors underscore the complexity of accurately estimating canopy cover in dynamic landscapes. Differences in seasonal conditions, phenology, and data quality due to cloud interference can introduce additional sources of error, limiting the model’s ability to predict canopy cover with full precision.
While our model effectively integrates multiple data sources and performs well in explaining variance in canopy cover, it is crucial to address the identified sources of error to enhance accuracy. Future work should focus on refining data integration methods, particularly to better capture high-density canopy areas, and on mitigating the impacts of geolocation uncertainty, resolution differences, seasonal variations, and cloud cover. Despite our model’s competitive performance metrics, the observed underestimation bias underscores the need for these refinements in order to achieve greater accuracy and robustness in canopy cover prediction. Furthermore, exploring advanced machine learning techniques such as deep learning for improving prediction accuracy has shown promise in recent studies [147]. Incorporating temporal data could also provide insights into seasonal variations [148] and long-term trends in canopy cover, further enhancing the model’s utility for dynamic forest monitoring. By addressing these aspects, future models can achieve higher precision and provide more reliable data for forest management and conservation strategies.
Overall, our study contributes significantly to the field of ecological modeling by providing a robust framework for canopy cover estimation and highlighting the critical role of data fusion in enhancing the accuracy and utility of environmental models. By integrating GEDI-derived canopy cover samples with optical imagery, radar, and ancillary data, our framework not only enhances canopy cover mapping at finer resolutions but also offers a realistic pathway toward better forest management and decision-making. By applying our model, we can more accurately calculate the impacts of yearly hurricane events on Florida’s forests, assessing changes in vegetation and carbon stock over time.
Another key strength of this study lies in the rigorous external validation using ALS data, ensuring the robustness of the model and increasing its reliability and applicability to unseen areas. Additionally, by incorporating spatial predictors such as geographic coordinates alongside spectral and structural data, we have demonstrated how combining diverse remote sensing variables can significantly improve vegetation mapping. This methodology also highlights the importance of accounting for environmental gradients, which are particularly pronounced in Florida due to its unique geographic and ecological characteristics.
Florida’s forests are characterized by a remarkable diversity of ecosystems ranging from upland to wetland forests, presenting unique challenges and opportunities for forest monitoring. Our study provides valuable insights into the spatial distribution of canopy cover across these forest types, offering the potential to identify areas that are more or less resilient to disturbances such as hurricanes. These findings have practical implications for forest management, including prioritizing reforestation efforts and mitigating the impacts of canopy loss. Furthermore, the ability to map canopy cover annually establishes a baseline for monitoring hurricane-induced damage over time, contributing to adaptive management strategies and long-term forest sustainability.

5. Conclusions

In this research, we investigated the efficacy of a data fusion framework using LiDAR, SAR, and optical data to create a comprehensive, wall-to-wall map of canopy cover for the state of Florida. By incorporating GEDI, HLS, and Sentinel-1 GRD data, we successfully addressed the spatial resolution limitations of GEDI alone, achieving a detailed and extensive canopy cover map for Florida, which had previously been unavailable. This integration proved to be highly effective, marking a significant advancement in canopy cover mapping. Our findings demonstrate the effectiveness of using geographic coordinate metrics and spectral bands as key predictors for capturing environmental gradients and vegetation health, which are essential for accurate estimation of canopy cover. Notably, we observed distinct spatial patterns, with higher canopy cover in northern Florida and lower canopy cover in the southern regions. This variability reflects the diverse ecological conditions across the state and offers valuable insights for targeted conservation and forest management efforts. Identifying and addressing potential sources of error, such as geolocation uncertainty and resolution differences between datasets, will be critical for improving future models and enhancing their predictive accuracy. In summary, our model approach holds significant potential for the conservation of Florida’s forests. The framework can be adapted to predict canopy cover for different years, including post-hurricane scenarios or in the wake of other environmental disturbances like droughts. This capability makes it an effective tool for assessing forest area loss and planning conservation strategies in response to environmental changes.

Author Contributions

Conceptualization: M.B.S., C.A.S. and I.T.B.; methodology, M.B.S., C.A.S., I.T.B., and D.V.; software, M.B.S., M.A.K. and I.T.B.; validation, M.B.S. and K.M.B.; formal analysis, M.B.S. and I.T.B.; investigation, M.B.S., C.A.S. and I.T.B.; resources, C.A.S.; data curation, M.B.S. and C.A.S.; writing—original draft preparation, M.B.S.; writing—review and editing, M.B.S., C.A.S., I.T.B., S.J.P., D.V., R.V.L., A.T.H., W.M.H., K.D.R., J.X., M.A.K., C.K., P.H., A.C.d.S., G.M.d.S., K.M.B., and G.R.C.; visualization, M.B.S.; supervision, C.A.S. and I.T.B.; project administration, C.A.S.; funding acquisition, C.A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the intramural research program of the U.S. Department of Agriculture, National Institute of Food and Agriculture, Rapid Response to Extreme Weather Events Across Food and Agricultural Systems (Grant # 2023-68016-39039). We also acknowledge funding from NASA grants (ICESat-2, 80NSSC23K0941), Carbon Monitoring System (CMS, grant 80NSSC23K1257), and Commercial Smallsat Data Scientific Analysis (CSDSA, grant 80NSSC24K0055).

Data Availability Statement

ALS data are available for download from [47]; GEDI, Sentinel 1, and HLS data are available for download from [83].

Acknowledgments

We would like to thank the College of Agricultural and Life Sciences for awarding the first author a full PhD scholarship (CALS 2022-2026), and the School of Forest, Fisheries, and Geomatics Sciences, as well as the University of Florida Institute of Food and Agricultural Sciences (UF/IFAS), for providing essential support and infrastructure. We also extend our gratitude to the Forest Biometrics, Remote sensing and Artificial Intelligence Laboratory (Silva Lab) team members for their invaluable support and contributions throughout this work.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Pan, Y.; Birdsey, R.A.; Fang, J.; Houghton, R.; Kauppi, P.E.; Kurz, W.A.; Phillips, O.L.; Shvidenko, A.; Lewis, S.L.; Canadell, J.G.; et al. A Large and Persistent Carbon Sink in the World’s Forests. Science 2011, 333, 988–993. [Google Scholar] [CrossRef] [PubMed]
  2. Wear, D.N.; Greis, J.G. The Southern Forest Futures Project: Summary Report. General Technical Report-Southern Research Station; SRS-168; USDA Forest Service: Washington, DC, USA, 2012. [Google Scholar]
  3. Jenkins, C.N.; Van Houtan, K.S.; Pimm, S.L.; Sexton, J.O. US protected lands mismatch biodiversity priorities. Proc. Natl. Acad. Sci. USA 2015, 112, 5081–5086. [Google Scholar] [CrossRef]
  4. Chambers, J.Q.; Fisher, J.I.; Zeng, H.; Chapman, E.L.; Baker, D.B.; Hurtt, G.C. Hurricane Katrina’s Carbon Footprint on U.S. Gulf Coast Forests. Science 2007, 318, 1107. [Google Scholar] [CrossRef] [PubMed]
  5. Kim, D.; Millington, A.C.; Lafon, C.W. Disturbance after Disturbance: Combined Effects of Two Successive Hurricanes on Forest Community Structure. Ann. Am. Assoc. Geogr. 2020, 110, 571–585. [Google Scholar] [CrossRef]
  6. Gang, C.; Pan, S.; Tian, H.; Wang, Z.; Xu, R.; Bian, Z.; Pan, N.; Yao, Y.; Shi, H. Satellite observations of forest resilience to hurricanes along the northern Gulf of Mexico. For. Ecol. Manag. 2020, 472, 118243. [Google Scholar] [CrossRef]
  7. Balaguru, K.; Xu, W.; Chang, C.-C.; Leung, L.R.; Judi, D.R.; Hagos, S.M.; Wehner, M.F.; Kossin, J.P.; Ting, M. Increased U.S. coastal hurricane risk under climate change. Sci. Adv. 2023, 9, eadf0259. [Google Scholar] [CrossRef] [PubMed]
  8. Jones, M.W.; Smith, A.J.; Betts, R.; Canadell, J.G.; Prentice, I.C.; Le Quéré, C. Climate Change Increases the Risk of Wildfires: January 2020. ScienceBrief. Available online: https://ueaeprints.uea.ac.uk/id/eprint/77982/ (accessed on 18 July 2023).
  9. Pokhrel, Y.; Felfelani, F.; Satoh, Y.; Boulange, J.; Burek, P.; Gädeke, A.; Gerten, D.; Gosling, S.N.; Grillakis, M.; Gudmundsson, L.; et al. Global terrestrial water storage and drought severity under climate change. Nat. Clim. Change 2021, 11, 226–233. [Google Scholar] [CrossRef]
  10. Patacca, M.; Lindner, M.; Lucas-Borja, M.E.; Cordonnier, T.; Fidej, G.; Gardiner, B.; Hauf, Y.; Jasinevičius, G.; Labonne, S.; Linkevičius, E.; et al. Significant increase in natural disturbance impacts on European forests since 1950. Glob. Change Biol. 2023, 29, 1359–1376. [Google Scholar] [CrossRef] [PubMed]
  11. Lugo, A.E. Effects of Extreme Disturbance Events: From Ecesis to Social–Ecological–Technological Systems. Ecosystems 2020, 23, 1726–1747. [Google Scholar] [CrossRef]
  12. Pan, Y.; Birdsey, R.A.; Phillips, O.L.; Jackson, R.B. The Structure, Distribution, and Biomass of the World’s Forests. Annu. Rev. Ecol. Evol. Syst. 2013, 44, 593–622. [Google Scholar] [CrossRef]
  13. Nadkarni, N.M.; McIntosh, A.C.S.; Cushing, J.B. A framework to categorize forest structure concepts. For. Ecol. Manag. 2008, 256, 872–882. [Google Scholar] [CrossRef]
  14. Disney, M.; Lewis, P.; Saich, P. 3D modelling of forest canopy structure for remote sensing simulations in the optical and microwave domains. Remote Sens. Environ. 2006, 100, 114–132. [Google Scholar] [CrossRef]
  15. Leitold, V.; Morton, D.C.; Martinuzzi, S.; Paynter, I.; Uriarte, M.; Keller, M.; Ferraz, A.; Cook, B.D.; Corp, L.A.; González, G. Tracking the Rates and Mechanisms of Canopy Damage and Recovery Following Hurricane Maria Using Multitemporal Lidar Data. Ecosystems 2022, 25, 892–910. [Google Scholar] [CrossRef]
  16. Senf, C.; Seidl, R.; Poulter, B. Post-disturbance canopy recovery and the resilience of Europe’s forests. Glob. Ecol. Biogeogr. 2021, 31, 25–36. [Google Scholar] [CrossRef]
  17. Epstein, M.D.; Seielstad, C.A.; Moran, C.J. Impact and recovery of forest cover following wildfire in the Northern Rocky Mountains of the United States. Fire Ecol. 2024, 20, 56. [Google Scholar] [CrossRef]
  18. Turner, M.G.; Gardner, R.H. Landscape Ecology in Theory and Practice: Pattern and Process; Springer: New York, NY, USA, 2015. [Google Scholar] [CrossRef]
  19. Chen, G.; Singh, K.K.; Lopez, J.; Zhou, Y. Tree canopy cover and carbon density are different proxy indicators for assessing the relationship between forest structure and urban socio-ecological conditions. Ecol. Indic. 2020, 113, 106279. [Google Scholar] [CrossRef]
  20. Cui, L.; Jiao, Z.; Zhao, K.; Sun, M.; Dong, Y.; Yin, S.; Li, Y.; Chang, Y.; Guo, J.; Xie, R.; et al. Retrieval of Vertical Foliage Profile and Leaf Area Index Using Transmitted Energy Information Derived from ICESat GLAS Data. Remote Sens. 2020, 12, 2457. [Google Scholar] [CrossRef]
  21. Nasiri, V.; Deljouei, A.; Moradi, F.; Sadeghi, S.M.M.; Borz, S.A. Land Use and Land Cover Mapping Using Sentinel-2, Landsat-8 Satellite Images, and Google Earth Engine: A Comparison of Two Composition Methods. Remote Sens. 2022, 14, 1977. [Google Scholar] [CrossRef]
  22. Cohen, W.B.; Goward, S.N. Landsat’s Role in Ecological Applications of Remote Sensing. BioScience 2004, 54, 535–545. [Google Scholar] [CrossRef]
  23. Wulder, M.A.; White, J.C.; Goward, S.N.; Masek, J.G.; Irons, J.R.; Herold, M.; Cohen, W.B.; Loveland, T.R.; Woodcock, C.E. Landsat continuity: Issues and opportunities for land cover monitoring. Remote Sens. Environ. 2008, 112, 955–969. [Google Scholar] [CrossRef]
  24. Urbazaev, M.; Thiel, C.; Cremer, F.; Dubayah, R.; Migliavacca, M.; Reichstein, M.; Schmullius, C. Estimation of forest aboveground biomass and uncertainties by integration of field measurements, airborne LiDAR, and SAR and optical satellite data in Mexico. Carbon Balance Manag. 2018, 13, 5. [Google Scholar] [CrossRef] [PubMed]
  25. Cui, L.; Jiao, Z.; Dong, Y.; Sun, M.; Zhang, X.; Yin, S.; Ding, A.; Chang, Y.; Guo, J.; Xie, R. Estimating Forest Canopy Height Using MODIS BRDF Data Emphasizing Typical-Angle Reflectances. Remote Sens. 2019, 11, 2239. [Google Scholar] [CrossRef]
  26. Zhao, K.; Suarez, J.C.; Garcia, M.; Hu, T.; Wang, C.; Londo, A. Utility of multitemporal lidar for forest and carbon monitoring: Tree growth, biomass dynamics, and carbon flux. Remote Sens. Environ. 2018, 204, 883–897. [Google Scholar] [CrossRef]
  27. Dubayah, R.; Blair, J.B.; Goetz, S.; Fatoyinbo, L.; Hansen, M.; Healey, S.; Hofton, M.; Hurtt, G.; Kellner, J.; Luthcke, S.; et al. The Global Ecosystem Dynamics Investigation: High-resolution laser ranging of the Earth’s forests and topography. Sci. Remote Sens. 2020, 1, 100002. [Google Scholar] [CrossRef]
  28. Sun, M.; Cui, L.; Park, J.; García, M.; Zhou, Y.; Silva, C.A.; He, L.; Zhang, H.; Zhao, K. Evaluation of NASA’s GEDI lidar observations for estimating biomass in temperate and tropical forests. Forests 2022, 13, 1686. [Google Scholar] [CrossRef]
  29. Duncanson, L.; Neuenschwander, A.; Hancock, S.; Thomas, N.; Fatoyinbo, T.; Simard, M.; Silva, C.A.; Armston, J.; Luthcke, S.B.; Hofton, M.; et al. Biomass estimation from simulated GEDI, ICESat-2 and NISAR across environmental gradients in Sonoma County, California. Remote Sens. Environ. 2020, 242, 111779. [Google Scholar] [CrossRef]
  30. Silva, C.A.; Duncanson, L.; Hancock, S.; Neuenschwander, A.; Thomas, N.; Hofton, M.; Fatoyinbo, L.; Simard, M.; Marshak, C.Z.; Armston, J.; et al. Fusing simulated GEDI, ICESat-2 and NISAR data for regional aboveground biomass mapping. Remote Sens. Environ. 2021, 253, 112234. [Google Scholar] [CrossRef]
  31. USGS; NASA. Harmonized Landsat and Sentinel-2 (HLS) product. Available online: https://hls.gsfc.nasa.gov/ (accessed on 15 August 2024).
  32. Di Tommaso, S.; Wang, S.; Lobell, D.B. Combining GEDI and Sentinel-2 for wall-to-wall mapping of tall and short crops. Environ. Res. Lett. 2021, 16, 125002. [Google Scholar] [CrossRef]
  33. Shendryk, Y. Fusing GEDI with earth observation data for large area aboveground biomass mapping. Int. J. Appl. Earth Obs. Geoinf. 2022, 115, 103108. [Google Scholar] [CrossRef]
  34. European Space Agency. Sentinel-1 User Handbook; European Space Agency: Paris, France, 2012. [Google Scholar]
  35. Kaplan, G.G.; Avdan, U. SENTINEL-1 and SENTINEL-2 Data Fusion for Wetlands Mapping: Balikdami, Turkey. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 2018, 42, 729–734. [Google Scholar] [CrossRef]
  36. Heckel, K.; Urban, M.; Schratz, P.; Mahecha, M.D.; Schmullius, C. Predicting forest cover in distinct ecosystems: The potential of multi-source Sentinel-1 and -2 data fusion. Remote Sens. 2020, 12, 302. [Google Scholar] [CrossRef]
  37. Tamiminia, H.; Salehi, B.; Mahdianpari, M.; Goulden, T. State-wide forest canopy height and aboveground biomass map for New York with 10 m resolution, integrating GEDI, Sentinel-1, and Sentinel-2 data. Ecol. Inform. 2024, 79, 102404. [Google Scholar] [CrossRef]
  38. Lang, N.; Jetz, W.; Schindler, K.; Wegner, J.D. A high-resolution canopy height model of the Earth. Nat. Ecol. Evol. 2023, 7, 1778–1789. [Google Scholar] [CrossRef]
  39. Guo, J.; Xu, Q.; Zeng, Y.; Liu, Z.; Zhu, X.X. Nationwide urban tree canopy mapping and coverage assessment in Brazil from high-resolution remote sensing images using deep learning. ISPRS J. Photogramm. Remote Sens. 2023, 198, 1–15. [Google Scholar] [CrossRef]
  40. Stovin, V.R.; Jorgensen, A.; Clayden, A. Street Trees and Stormwater Management. Arboric. J. 2008, 30, 297–310. [Google Scholar] [CrossRef]
  41. Florida Natural Areas Inventory. Guide to the Natural Communities of Florida. 2010. Available online: http://www.fnai.org (accessed on 18 September 2024).
  42. Coyle, D.R.; Klepzig, K.D.; Koch, F.H.; Morris, L.A.; Nowak, J.T.; Oak, S.W.; Otrosina, W.J.; Smith, W.D.; Gandhi, K.J.K. A review of southern pine decline in North America. For. Ecol. Manag. 2015, 349, 134–148. [Google Scholar] [CrossRef]
  43. Nelson, G. The Trees of Florida: A Reference and Field Guide; Pineapple Press Inc.: Sarasota, FL, USA, 2011; pp. 1–428. ISBN 978-1561644759. [Google Scholar]
  44. Black, R.J. Florida Climate Data; University of Florida Cooperative Extension Service, Institute of Food and Agriculture Sciences, EDIS: Gainesville, FL, USA, 1993. [Google Scholar]
  45. Main, M.B.; Allen, G.M. The Florida environment: An overview. EDIS 2007, 19, 1–4. [Google Scholar] [CrossRef]
  46. Venter, Z.S.; Barton, D.N.; Chakraborty, T.; Simensen, T.; Singh, G. Global 10 m land use land cover datasets: A comparison of dynamic world, world cover, and ESRI land cover. Remote Sens. 2022, 14, 4101. [Google Scholar] [CrossRef]
  47. Florida Geographic Information Office (FGIo). FL Peninsular Hx Michael Supplemental LiDAR. 2023. Available online: https://www.floridagio.gov/datasets/FGIO::fl-peninsular-hx-michael-supplemental-lidar-2020-1/about (accessed on 8 January 2024).
  48. Bohling, G. Introduction to Geostatistics and Variogram Analysis; Kansas Geological Survey: Lawrence, KS, USA, 2005. [Google Scholar]
  49. Roussel, J.-R.; Auty, D.; Coops, N.C.; Tompalski, P.; Goodbody, T.R.H.; Meador, A.S.; Bourdon, J.-F.; De Boissieu, F.; Achim, A. lidR: An R package for analysis of airborne laser scanning (ALS) data. Remote Sens. Environ. 2020, 251, 112061. [Google Scholar] [CrossRef]
  50. European Space Agency. Sentinel-1 Data Access and Products. Available online: https://sentinel.esa.int/web/sentinel/copernicus/sentinel-1 (accessed on 18 September 2024).
  51. Google Earth Engine. Sentinel-1 Algorithms. Available online: https://developers.google.com/earth-engine/guides/sentinel1 (accessed on 24 October 2024).
  52. Haas, J.; Ban, Y. Sentinel-1A SAR and Sentinel-2A MSI data fusion for urban ecosystem service mapping. Remote Sens. Appl. Soc. Environ. 2017, 8, 41–53. [Google Scholar] [CrossRef]
  53. Agapiou, A. Estimating proportion of vegetation cover at the vicinity of archaeological sites using Sentinel-1 and -2 data, supplemented by crowdsourced OpenStreetMap geodata. Appl. Sci. 2020, 10, 4764. [Google Scholar] [CrossRef]
  54. Soudani, K.; Delpierre, N.; Berveiller, D.; Hmimina, G.; Vincent, G.; Morfin, A.; Dufrêne, E. Potential of C-band synthetic aperture radar Sentinel-1 time-series for the monitoring of phenological cycles in a deciduous forest. Int. J. Appl. Earth Obs. Geoinf. 2021, 104, 102505. [Google Scholar] [CrossRef]
  55. Nasirzadehdizaji, R.; Sanli, F.B.; Abdikan, S.; Cakir, Z.; Sekertekin, A.; Ustuner, M. Sensitivity analysis of multi-temporal Sentinel-1 SAR parameters to crop height and canopy coverage. Appl. Sci. 2019, 9, 655. [Google Scholar] [CrossRef]
  56. Schlund, M.; Erasmi, S. Sentinel-1 time series data for monitoring the phenology of winter wheat. Remote Sens. Environ. 2020, 246, 111814. [Google Scholar] [CrossRef]
  57. Claverie, M.; Ju, J.; Masek, J.G.; Dungan, J.L.; Vermote, E.F.; Roger, J.-C.; Skakun, S.V.; Justice, C. The harmonized Landsat and Sentinel-2 surface reflectance data set. Remote Sens. Environ. 2018, 219, 145–161. [Google Scholar] [CrossRef]
  58. Claverie, M.; Masek, J.G.; Ju, J.; Dungan, J.L. Harmonized Landsat-8 Sentinel-2 (HLS) Product User’s Guide; National Aeronautics and Space Administration (NASA): Washington, DC, USA, 2017. [Google Scholar]
  59. Rouse, J.W.; Haas, R.H.; Schell, J.A.; Deering, D.W. Monitoring vegetation systems in the Great Plains with ERTS. NASA Spec. Publ. 1974, 351, 309. [Google Scholar]
  60. Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
  61. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295–309. [Google Scholar] [CrossRef]
  62. Qi, J.; Chehbouni, A.; Huete, A.R.; Kerr, Y.H.; Sorooshian, S. A modified soil adjusted vegetation index. Remote Sens. Environ. 1994, 48, 119–126. [Google Scholar] [CrossRef]
  63. Zhang, X.; Shang, K.; Cen, Y.; Shuai, T.; Sun, Y. Estimating ecological indicators of karst rocky desertification by linear spectral unmixing method. Int. J. Appl. Earth Obs. Geoinf. 2014, 31, 86–94. [Google Scholar] [CrossRef]
  64. Tucker, C.J. Red and photographic infrared linear combinations for monitoring vegetation. Remote Sens. Environ. 1979, 8, 127–150. [Google Scholar] [CrossRef]
  65. Ji, L.; Zhang, L.; Wylie, B. Analysis of dynamic thresholds for the normalized difference water index. Photogramm. Eng. Remote Sens. 2009, 75, 1307–1317. [Google Scholar] [CrossRef]
  66. Bausch, W.C.; Halvorson, A.D.; Cipra, J. Quickbird satellite and ground-based multispectral data correlations with agronomic parameters of irrigated maize grown in small plots. Biosyst. Eng. 2008, 101, 306–315. [Google Scholar] [CrossRef]
  67. Gitelson, A.A. Wide dynamic range vegetation index for remote quantification of biophysical characteristics of vegetation. J. Plant Physiol. 2004, 161, 165–173. [Google Scholar] [CrossRef] [PubMed]
  68. Ceccato, P.; Gobron, N.; Flasse, S.; Pinty, B.; Tarantola, S. Designing a spectral index to estimate vegetation water content from remote sensing data: Part 1. Remote Sens. Environ. 2002, 82, 188–197. [Google Scholar] [CrossRef]
  69. Vincini, M.; Frazzi, E.; D’alessio, P. A broad-band leaf chlorophyll vegetation index at the canopy scale. Precis. Agric. 2008, 9, 303–319. [Google Scholar] [CrossRef]
  70. Drury, S.A. Image Interpretation in Geology; Allen & Unwin: London, UK, 1987. [Google Scholar]
  71. Camps-Valls, G.; Campos-Taberner, M.; Moreno-Martínez, A.; Walther, S.; Duveiller, G.; Cescatti, A.; Mahecha, M.D.; Muñoz-Marí, J.; García-Haro, F.J.; Guanter, L.; et al. A unified vegetation index for quantifying the terrestrial biosphere. Sci. Adv. 2021, 7, eabc7447. [Google Scholar] [CrossRef]
  72. NASA JPL. NASADEM Merged DEM Global 1 Arc Second V001. 2020. Available online: https://lpdaac.usgs.gov/products/nasadem_hgtv001/ (accessed on 24 September 2024).
  73. Li, Y.; Zeng, H.; Xiong, J.; Miao, G. Influence of topography on UAV LiDAR-based LAI estimation in subtropical mountainous secondary broadleaf forests. Forests 2023, 15, 17. [Google Scholar] [CrossRef]
  74. Slocum, M.G.; Platt, W.J.; Beckage, B.; Panko, B.; Lushine, J.B. Decoupling natural and anthropogenic fire regimes: A case study in Everglades National Park, Florida. Nat. Areas J. 2007, 27, 41–55. [Google Scholar] [CrossRef]
  75. Fipps, G.; Skaggs, R.W. Influence of slope on subsurface drainage of hillsides. Water Resour. Res. 1989, 25, 1717–1726. [Google Scholar] [CrossRef]
  76. Hylander, K. Aspect modifies the magnitude of edge effects on bryophyte growth in boreal forests. J. Appl. Ecol. 2005, 42, 518–525. [Google Scholar] [CrossRef]
  77. Haralick, R.M.; Shanmugam, K.; Dinstein, I.H. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 1973, SMC-3, 610–621. [Google Scholar] [CrossRef]
  78. Xu, Y.; Ding, S.; Chen, P.; Tang, H.; Ren, H.; Huang, H. Horizontal geolocation error evaluation and correction on full-waveform LiDAR footprints via waveform matching. Remote Sens. 2023, 15, 776. [Google Scholar] [CrossRef]
  79. Jin, S.; Su, Y.; Gao, S.; Hu, T.; Liu, J.; Guo, Q. The transferability of Random Forest in canopy height estimation from multi-source remote sensing data. Remote Sens. 2018, 10, 1183. [Google Scholar] [CrossRef]
  80. Wang, X.; Kruger, U.; Irwin, G.W. Process monitoring approach using fast moving window PCA. Ind. Eng. Chem. Res. 2005, 44, 5691–5702. [Google Scholar] [CrossRef]
  81. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  82. Gao, X.; Wen, J.; Zhang, C. An improved random forest algorithm for predicting employee turnover. Math. Probl. Eng. 2019, 2019, 4140707. [Google Scholar] [CrossRef]
  83. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  84. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  85. Rosenfeld, J.P.; Ward, A.; Meijer, E.H.; Yukhnenko, D. Bootstrapping the P300 in diagnostic psychophysiology: How many iterations are needed? Psychophysiology 2017, 54, 366–373. [Google Scholar] [CrossRef] [PubMed]
  86. Florida Department of Environmental Protection (FDEP). Statewide Land Use and Land Cover of Florida. Available online: https://geodata.dep.state.fl.us/datasets/statewide-land-use-land-cover (accessed on 1 March 2024).
  87. Kruskal, W.H.; Wallis, W.A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 1952, 47, 583–621. [Google Scholar] [CrossRef]
  88. Dunn, O.J. Multiple comparisons using rank sums. Technometrics 1964, 6, 241–252. [Google Scholar] [CrossRef]
  89. Chen, L.; Ren, C.; Bao, G.; Zhang, B.; Wang, Z.; Liu, M.; Man, W.; Liu, J. Improved object-based estimation of forest aboveground biomass by integrating LiDAR data from GEDI and ICESat-2 with multi-sensor images in a heterogeneous mountainous region. Remote Sens. 2022, 14, 2743. [Google Scholar] [CrossRef]
  90. Francini, S.; D’amico, G.; Vangi, E.; Borghi, C.; Chirici, G. Integrating GEDI and Landsat: Spaceborne LiDAR and four decades of optical imagery for the analysis of forest disturbances and biomass changes in Italy. Sensors 2022, 22, 2015. [Google Scholar] [CrossRef]
  91. Wang, C.; Elmore, A.J.; Numata, I.; Cochrane, M.A.; Lei, S.; Hakkenberg, C.R.; Li, Y.; Zhao, Y.; Tian, Y. A framework for improving wall-to-wall canopy height mapping by integrating GEDI LiDAR. Remote Sens. 2022, 14, 3618. [Google Scholar] [CrossRef]
  92. Qi, W.; Lee, S.-K.; Hancock, S.; Luthcke, S.; Tang, H.; Armston, J.; Dubayah, R. Improved forest height estimation by fusion of simulated GEDI LiDAR data and TanDEM-X InSAR data. Remote Sens. Environ. 2019, 221, 621–634. [Google Scholar] [CrossRef]
  93. Wu, X.; Shen, X.; Cao, L.; Wang, G.; Cao, F. Assessment of individual tree detection and canopy cover estimation using unmanned aerial vehicle-based light detection and ranging (UAV-LiDAR) data in planted forests. Remote Sens. 2019, 11, 908. [Google Scholar] [CrossRef]
  94. Li, Y.; Li, M.; Li, C.; Liu, Z. Forest aboveground biomass estimation using Landsat 8 and Sentinel-1A data with machine learning algorithms. Sci. Rep. 2020, 10, 9952. [Google Scholar] [CrossRef]
  95. Ahmed, O.S.; Franklin, S.E.; Wulder, M.A.; White, J.C. Characterizing stand-level forest canopy cover and height using Landsat time series, samples of airborne LiDAR, and the Random Forest algorithm. ISPRS J. Photogramm. Remote Sens. 2015, 101, 89–101. [Google Scholar] [CrossRef]
  96. Blackard, J.A.; Finco, M.V.; Helmer, E.H.; Holden, G.R.; Hoppus, M.L.; Jacobs, D.M.; Lister, A.J.; Moisen, G.G.; Nelson, M.D.; Riemann, R.; et al. Mapping U.S. forest biomass using nationwide forest inventory data and moderate resolution information. Remote Sens. Environ. 2008, 112, 1658–1677. [Google Scholar] [CrossRef]
  97. Carreiras, J.M.; Vasconcelos, M.J.; Lucas, R.M. Understanding the relationship between aboveground biomass and ALOS PALSAR data in the forests of Guinea-Bissau (West Africa). Remote Sens. Environ. 2012, 121, 426–442. [Google Scholar] [CrossRef]
  98. Fagua, J.C.; Jantz, P.; Rodriguez-Buritica, S.; Duncanson, L.; Goetz, S.J. Integrating LiDAR, multispectral and SAR data to estimate and map canopy height in tropical forests. Remote Sens. 2019, 11, 2697. [Google Scholar] [CrossRef]
  99. Zhang, N.; Chen, M.; Yang, F.; Yang, C.; Yang, P.; Gao, Y.; Shang, Y.; Peng, D. Forest height mapping using feature selection and machine learning by integrating multi-source satellite data in Baoding City, North China. Remote Sens. 2022, 14, 4434. [Google Scholar] [CrossRef]
  100. Joshi, N.; Mitchard, E.T.; Woo, N.; Torres, J.; Moll-Rocek, J.; Ehammer, A.; Collins, M.; Jepsen, M.R.; Fensholt, R. Mapping dynamics of deforestation and forest degradation in tropical forests using radar satellite data. Environ. Res. Lett. 2015, 10, 034014. [Google Scholar] [CrossRef]
  101. Camarretta, N.; Harrison, P.A.; Bailey, T.; Potts, B.; Lucieer, A.; Davidson, N.; Hunt, M. Monitoring forest structure to guide adaptive management of forest restoration: A review of remote sensing approaches. New For. 2020, 51, 573–596. [Google Scholar] [CrossRef]
  102. Ygorra, B.; Frappart, F.; Wigneron, J.; Moisy, C.; Catry, T.; Baup, F.; Hamunyela, E.; Riazanoff, S. Monitoring loss of tropical forest cover from Sentinel-1 time-series: A CuSum-based approach. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102532. [Google Scholar] [CrossRef]
  103. Asner, G.P.; Mascaro, J.; Muller-Landau, H.C.; Vieilledent, G.; Vaudry, R.; Rasamoelina, M.; Hall, J.S.; van Breugel, M. A universal airborne LiDAR approach for tropical forest carbon mapping. Oecologia 2012, 168, 1147–1160. [Google Scholar] [CrossRef] [PubMed]
  104. Saatchi, S.S.; Harris, N.L.; Brown, S.; Lefsky, M.; Mitchard, E.T.A.; Salas, W.; Zutta, B.R.; Buermann, W.; Lewis, S.L.; Hagen, S.; et al. Benchmark map of forest carbon stocks in tropical regions across three continents. Proc. Natl. Acad. Sci. USA 2011, 108, 9899–9904. [Google Scholar] [CrossRef]
  105. Hadi; Korhonen, L.; Hovi, A.; Rönnholm, P.; Rautiainen, M. The accuracy of large-area forest canopy cover estimation using Landsat in boreal region. Int. J. Appl. Earth Obs. Geoinf. 2016, 53, 118–127. [Google Scholar] [CrossRef]
  106. Jongman, R.H.G.; Ter Braak, C.J.; Van Tongeren, O.F. Data Analysis in Community and Landscape Ecology; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
  107. Legendre, P.; Fortin, M.J. Spatial pattern and ecological analysis. Vegetatio 1989, 80, 107–138. [Google Scholar] [CrossRef]
  108. Silveira, E.M.; Radeloff, V.C.; Martinuzzi, S.; Pastur, G.J.M.; Bono, J.; Politi, N.; Lizarraga, L.; Rivera, L.O.; Ciuffoli, L.; Rosas, Y.M.; et al. Nationwide native forest structure maps for Argentina based on forest inventory data, SAR Sentinel-1 and vegetation metrics from Sentinel-2 imagery. Remote Sens. Environ. 2023, 285, 113391. [Google Scholar] [CrossRef]
  109. Hengl, T.; De Jesus, J.M.; Heuvelink, G.B.M.; Gonzalez, M.R.; Kilibarda, M.; Blagotić, A.; Shangguan, W.; Wright, M.N.; Geng, X.; Bauer-Marschallinger, B.; et al. SoilGrids250m: Global gridded soil information based on machine learning. PLoS ONE 2017, 12, e0169748. [Google Scholar] [CrossRef]
  110. Kruckeberg, A.R. An essay: The stimulus of unusual geologies for plant speciation. Syst. Bot. 1986, 11, 455–463. [Google Scholar] [CrossRef]
  111. Ewel, J.J.; Myers, R.L. Ecosystems of Florida; University of Central Florida Press: Orlando, FL, USA, 1990. [Google Scholar]
  112. Whittaker, R.H. Gradient analysis of vegetation. Biol. Rev. 1967, 42, 207–264. [Google Scholar] [CrossRef] [PubMed]
  113. Petrişor, A.-I.; Hamma, W.; Nguyen, H.D.; Randazzo, G.; Muzirafuti, A.; Stan, M.-I.; Tran, V.T.; Aştefănoaiei, R.; Bui, Q.-T.; Vintilă, D.-F.; et al. Degradation of coastlines under the pressure of urbanization and tourism: Evidence on the change of land systems from Europe, Asia and Africa. Land 2020, 9, 275. [Google Scholar] [CrossRef]
  114. Hunt, E.R., Jr.; Daughtry, C.S.T.; Eitel, J.U.H.; Long, D.S. Remote sensing leaf chlorophyll content using a visible band index. Agron. J. 2011, 103, 1090–1099. [Google Scholar] [CrossRef]
  115. Nordheim-Shelt, B. Effects of Anthropogenic Activity on the Green Swamp Preserve Ecosystem (Order No. 10259534). Dissertations & Theses A&I. 2017. Available online: https://login.lp.hscl.ufl.edu/login?url=https://www.proquest.com/dissertations-theses/effects-anthropogenic-activity-on-green-swamp/docview/1894204899/se-2 (accessed on 1 October 2024).
  116. Florida Fish and Wildlife & Conservation Commission. Babcock Ranch Preserve. 2022. Available online: https://myfwc.com/recreation/cooperative/babcock-ranch-preserve/ (accessed on 1 October 2024).
  117. Trager, M.D.; Drake, J.B.; Jenkins, A.M.; Petrick, C.J. Mapping and Modeling Ecological Conditions of Longleaf Pine Habitats in the Apalachicola National Forest. J. For. 2018, 116, 304–311. [Google Scholar] [CrossRef]
  118. Zampieri, N.E.; Pau, S.; Okamoto, D.K. The impact of Hurricane Michael on longleaf pine habitats in Florida. Sci. Rep. 2020, 10, 8483. [Google Scholar] [CrossRef] [PubMed]
  119. Kourtzanidis, M.; Grieser, J.; Beck, C.; Rudolf, B.; Rubel, F. World map of the Köppen-Geiger climate classification updated. Meteorol. Z. 2006, 15, 259–263. [Google Scholar]
  120. Yan, J.; Smith, K.R. Simulation of integrated surface water and ground water systems—Model formulation 1. J. Am. Water Resour. Assoc. 1994, 30, 879–890. [Google Scholar] [CrossRef]
  121. Bonilla-Moheno, M.; Aide, T.M.; Clark, M.L. The influence of socioeconomic, environmental, and demographic factors on municipality-scale land-cover change in Mexico. Reg. Environ. Change 2012, 12, 543–557. [Google Scholar] [CrossRef]
  122. Fox, J.; Vogler, J.B. Land-use and land-cover change in montane mainland Southeast Asia. Environ. Manag. 2005, 36, 394–403. [Google Scholar] [CrossRef] [PubMed]
  123. Shrestha, S.; Shrestha, U.B.; Bawa, K. Socio-economic factors and management regimes as drivers of tree cover change in Nepal. PeerJ 2018, 6, e4855. [Google Scholar] [CrossRef]
  124. Orr, D.W.; Ogden, J.C. The impact of Hurricane Andrew on the ecosystems of South Florida. Conserv. Biol. 1992, 6, 488–490. Available online: https://www.jstor.org/stable/2386357 (accessed on 18 August 2024).
  125. Pannozzo, P. Florida Local Government Conservation Planning: Variability, Drivers, and Policy Implications. Doctoral Dissertation, University of Central Florida, Orlando, FL, USA, 2013. [Google Scholar]
  126. U.S. Census Bureau. Population Density Data for U.S. Counties. Available online: https://data.census.gov/ (accessed on 1 October 2024).
  127. Carse, G.B.; Foss, J.H. Florida: Its Climate, Soil, Productions, and Agricultural Capabilities; BoD–Books on Demand: Norderstedt, Germany, 2023. [Google Scholar]
  128. Gramling, G.F., III. Wetland regulation and wildlife habitat protection: Proposals for Florida. Harv. Envtl. L. Rev. 1984, 8, 365. [Google Scholar]
  129. Oren, R.; Zimmermann, R.; Terbough, J. Transpiration in Upper Amazonia floodplain and upland forests in response to drought-breaking rains. Ecology 1996, 77, 968–973. [Google Scholar] [CrossRef]
  130. National Park Service. Upland Vegetation Zone. Natl. Park Serv. 2015. Available online: https://www.nps.gov/biso/learn/nature/uplandplants.htm (accessed on 10 November 2024).
  131. Cordell, S.; Questad, E.J.; Asner, G.P.; Kinney, K.M.; Thaxton, J.M.; Uowolo, A.; Brooks, S.; Chynoweth, M.W. Remote sensing for restoration planning: How the big picture can inform stakeholders. Restor. Ecol. 2017, 25, S147–S154. [Google Scholar] [CrossRef]
  132. Breidenbach, J.; Ivanovs, J.; Kangas, A.; Nord-Larsen, T.; Nilsson, M.; Astrup, R. Improving living biomass C-stock loss estimates by combining optical satellite, airborne laser scanning, and NFI data. Can. J. For. Res. 2021, 51, 1472–1485. [Google Scholar] [CrossRef]
  133. Kumar, L.; Mutanga, O. Remote sensing of above-ground biomass. Remote Sens. 2017, 9, 935. [Google Scholar] [CrossRef]
  134. Lu, D.; Chen, Q.; Wang, G.; Liu, L.; Li, G.; Moran, E. A survey of remote sensing-based aboveground biomass estimation methods in forest ecosystems. Int. J. Digit. Earth 2016, 9, 63–105. [Google Scholar] [CrossRef]
  135. Zhao, P.; Lu, D.; Wang, G.; Wu, C.; Huang, Y.; Yu, S. Examining spectral reflectance saturation in Landsat imagery and corresponding solutions to improve forest aboveground biomass estimation. Remote Sens. 2016, 8, 469. [Google Scholar] [CrossRef]
  136. Askar, N.; Nuthammachot, N.; Phairuang, W.; Wicaksono, P.; Sayektiningsih, T. Estimating aboveground biomass on private forest using Sentinel-2 imagery. J. Sens. 2018, 2018, 6745629. [Google Scholar] [CrossRef]
  137. Kasischke, E.S.; Goetz, S.; Hansen, M.C.; Ozdogan, M.; Rogan, J.; Ustin, S.L.; Woodcock, C.E. Remote Sensing for Natural Resource Management and Environmental Monitoring, 3rd ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2014. [Google Scholar]
  138. Steininger, M.K. Satellite estimation of tropical secondary forest above-ground biomass: Data from Brazil and Bolivia. Int. J. Remote Sens. 2000, 21, 1139–1157. [Google Scholar] [CrossRef]
  139. Mutanga, O.; Masenyama, A.; Sibanda, M. Spectral saturation in the remote sensing of high-density vegetation traits: A systematic review of progress, challenges, and prospects. ISPRS J. Photogramm. Remote Sens. 2023, 198, 297–309. [Google Scholar] [CrossRef]
  140. Pandit, S.; Tsuyuki, S.; Dube, T. Estimating above-ground biomass in sub-tropical buffer zone community forests, Nepal, using Sentinel-2 data. Remote Sens. 2018, 10, 601. [Google Scholar] [CrossRef]
  141. Mutanga, O.; Adam, E.; Cho, M.A. High density biomass estimation for wetland vegetation using WorldView-2 imagery and random forest regression algorithm. Int. J. Appl. Earth Obs. Geoinf. 2012, 18, 399–406. [Google Scholar] [CrossRef]
  142. Almeida, D.R.A.D.; Stark, S.C.; Chazdon, R.; Nelson, B.W.; César, R.G.; Meli, P.; Gorgens, E.B.; Duarte, M.M.; Valbuena, R.; Moreno, V.S.; et al. The effectiveness of lidar remote sensing for monitoring forest cover attributes and landscape restoration. For. Ecol. Manag. 2019, 438, 34–43. [Google Scholar] [CrossRef]
  143. Arjasakusuma, S.; Swahyu Kusuma, S.; Phinn, S. Evaluating variable selection and machine learning algorithms for estimating forest heights by combining lidar and hyperspectral data. ISPRS Int. J. Geo-Inf. 2020, 9, 507. [Google Scholar] [CrossRef]
  144. Moeckel, T.; Safari, H.; Reddersen, B.; Fricke, T.; Wachendorf, M. Fusion of ultrasonic and spectral sensor data for improving the estimation of biomass in grasslands with heterogeneous sward structure. Remote Sens. 2017, 9, 98. [Google Scholar] [CrossRef]
  145. Suttle, K.B.; Thomsen, M.A.; Power, M.E. Species interactions reverse grassland responses to changing climate. Science 2007, 315, 640–642. [Google Scholar] [CrossRef]
  146. Yang, L.H.; Rudolf, V.H.W. Phenology, ontogeny and the effects of climate change on the timing of species interactions. Ecol. Lett. 2010, 13, 1–10. [Google Scholar] [CrossRef] [PubMed]
  147. Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.-S.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep learning in remote sensing: A comprehensive review and list of resources. IEEE Geosci. Remote Sens. Mag. 2017, 5, 8–36. [Google Scholar] [CrossRef]
  148. Keane, R.E.; McKenzie, D.; Falk, D.A.; Smithwick, E.A.H.; Miller, C.; Kellogg, L.-K.B. Representing climate, disturbance, and vegetation interactions in landscape models. Ecol. Model. 2015, 309–310, 33–47. [Google Scholar] [CrossRef]
Figure 1. Map of the United States highlighting the study area in the state of Florida. The map also indicates the locations of the validation sites: (1) Apalachicola National Forest, (2) Green Swamp Wilderness Preserve, and (3) Babcock Ranch Preserve.
Figure 1. Map of the United States highlighting the study area in the state of Florida. The map also indicates the locations of the validation sites: (1) Apalachicola National Forest, (2) Green Swamp Wilderness Preserve, and (3) Babcock Ranch Preserve.
Remotesensing 17 00320 g001
Figure 2. A workflow of the proposed methodology for canopy cover modeling using GEDI, Sentinel-1HLS, and ancillary data.
Figure 2. A workflow of the proposed methodology for canopy cover modeling using GEDI, Sentinel-1HLS, and ancillary data.
Remotesensing 17 00320 g002
Figure 3. Histogram with density curves of model performance across 100 bootstrap runs. The plots show the distributions of relative and absolute RMSD, relative and absolute MD, and R2 values. Each plot includes an average performance line (solid black line) and standard deviation indicators (orange dotted lines) to illustrate the variability in model performance.
Figure 3. Histogram with density curves of model performance across 100 bootstrap runs. The plots show the distributions of relative and absolute RMSD, relative and absolute MD, and R2 values. Each plot includes an average performance line (solid black line) and standard deviation indicators (orange dotted lines) to illustrate the variability in model performance.
Remotesensing 17 00320 g003
Figure 4. Bar plot of the importance of input predictors in the RF regression models using Sentinel-1C, HLS, and ancillary data for the metrics that demonstrated more than 10% in the importance scoring.
Figure 4. Bar plot of the importance of input predictors in the RF regression models using Sentinel-1C, HLS, and ancillary data for the metrics that demonstrated more than 10% in the importance scoring.
Remotesensing 17 00320 g004
Figure 5. Canopy cover maps from GEDI footprint data and final wall-to-wall maps. The maps on the left (a1a3) illustrate the spatial distribution of canopy cover at the GEDI footprint level and (b1b3) the corresponding model-derived estimates across the validation sites. (c1c3) The density plots on the right depict the distribution of canopy cover percentages for each validation area.
Figure 5. Canopy cover maps from GEDI footprint data and final wall-to-wall maps. The maps on the left (a1a3) illustrate the spatial distribution of canopy cover at the GEDI footprint level and (b1b3) the corresponding model-derived estimates across the validation sites. (c1c3) The density plots on the right depict the distribution of canopy cover percentages for each validation area.
Remotesensing 17 00320 g005
Figure 6. (a) Final wall-to-wall map for the state of Florida, illustrating the prediction and uncertainty for each pixel. The yellow gradient indicates more canopy cover with low uncertainty, the blue gradient represents less canopy cover with high uncertainty, and the red/purple gradient displays areas with more canopy cover and high uncertainty. Note: transparent/black areas on the map correspond to lakes, rivers, or cloud presence; (b) spatial distribution of canopy cover as a percentage along the latitude gradient of Florida; (c) spatial distribution of canopy cover as a percentage along the longitude gradient of Florida.
Figure 6. (a) Final wall-to-wall map for the state of Florida, illustrating the prediction and uncertainty for each pixel. The yellow gradient indicates more canopy cover with low uncertainty, the blue gradient represents less canopy cover with high uncertainty, and the red/purple gradient displays areas with more canopy cover and high uncertainty. Note: transparent/black areas on the map correspond to lakes, rivers, or cloud presence; (b) spatial distribution of canopy cover as a percentage along the latitude gradient of Florida; (c) spatial distribution of canopy cover as a percentage along the longitude gradient of Florida.
Remotesensing 17 00320 g006
Figure 7. Map showing the spatial distribution of canopy cover across different forest types (FNAI 2010) throughout the state, with bar plot illustrating the mean canopy cover percentage for each forest type and error bars representing the standard deviation. Forest types with different letters indicate statistically significant differences in canopy cover according to Dunn’s test (p < 0.05). The forest types include wetland hardwood forest (WHF), wetland mixed forest (WMF), wetland coniferous forest (WCF), upland mixed forest (UMF), upland hardwood forest (UHF), and upland coniferous forest (UCF).
Figure 7. Map showing the spatial distribution of canopy cover across different forest types (FNAI 2010) throughout the state, with bar plot illustrating the mean canopy cover percentage for each forest type and error bars representing the standard deviation. Forest types with different letters indicate statistically significant differences in canopy cover according to Dunn’s test (p < 0.05). The forest types include wetland hardwood forest (WHF), wetland mixed forest (WMF), wetland coniferous forest (WCF), upland mixed forest (UMF), upland hardwood forest (UHF), and upland coniferous forest (UCF).
Remotesensing 17 00320 g007
Figure 8. (a) Map showing the average canopy cover for each county in Florida; (b) map illustrating the standard deviation of canopy cover for each county in Florida; (c) bar plot of canopy cover by county, displaying the 5 counties with the highest canopy cover and the 5 counties with the lowest canopy cover, with error bars representing the standard deviation for each county; (d) bar plot showing the standard deviation of canopy cover for the 5 counties with the highest standard deviation and the 5 counties with the lowest standard deviation.
Figure 8. (a) Map showing the average canopy cover for each county in Florida; (b) map illustrating the standard deviation of canopy cover for each county in Florida; (c) bar plot of canopy cover by county, displaying the 5 counties with the highest canopy cover and the 5 counties with the lowest canopy cover, with error bars representing the standard deviation for each county; (d) bar plot showing the standard deviation of canopy cover for the 5 counties with the highest standard deviation and the 5 counties with the lowest standard deviation.
Remotesensing 17 00320 g008
Table 1. Sentinel-1 GRD SAR bands and the corresponding vegetation indices, including their respective acronyms and associated wavelengths or equations.
Table 1. Sentinel-1 GRD SAR bands and the corresponding vegetation indices, including their respective acronyms and associated wavelengths or equations.
MetricAcronymEquation/WavelengthReference
Sentinel-1 GRD
Vertical–vertical; vertical–horizontal polarizationσVV and σVH5.405 GHz[52]
Radar Vegetation IndexRVI σ VV σ VV + σ VH × σ VV σ VH [53]
Co-polarization RatioCOPOL V V V H [54]
Co-polarization Ratio 2COPOL 2 V V V H V V + V H [55]
Co-polarization Ratio 3COPOL 3 V H V V [56]
Table 2. HLS spectral bands and corresponding vegetation indices, including their respective acronyms and associated wavelengths or equations.
Table 2. HLS spectral bands and corresponding vegetation indices, including their respective acronyms and associated wavelengths or equations.
MetricAcronymEquation/WavelengthReference
HLS
Blue, green, red, near-infrared, short-wave infrared, short-wave infrared 2B, G, R, NIR, SWIR1, SWIR20.490 µm, 0.560 µm, 0.665 µm, 0.842 µm, 1.610 µm, 2.190 µm[57]
Normalized Difference Vegetation IndexNDVI N I R R N I R + R [59]
Enhanced Vegetation IndexEVI 2.5 × NIR R NIR + 6 × R 7.5 × B + 1 [60]
Soil-Adjusted Vegetation IndexSAVI 1.5 × NIR R NIR + R + 0.5 [61]
Modified -Adjusted Vegetation IndexMSAVI2 2 × NIR + 1 2 × NIR + 2 2 8 × NIR R 2 [62]
Linear Spectral UnmixingLSU-[63]
SoilFSOIL 0.140 × B + 0.160 × G + 0.220 × R + 0.390 × NIR
+ 0.450 × SWIR 1 × 0.270 × SWIR 2
WaterFWATER 0.070 × B × 0.039 × G + 0.023 × R + 0.031 × N I R + 0.011 × S W I R 1 + 0.007 × S W I R 2
VegetationFVEG 0.086 × B + 0.062 × G + 0.043 × R + 0.247 × N I R + 0.109 × S W I R 1 + 0.039 × S W I R 2
Simple Ratio IndexSRI N I R R [64]
Normalized Difference Water IndexNDWI G + N I R G N I R [65]
Green Chlorophyll IndexGCI NIR G 1 [66]
Wide Dynamic Range Vegetation
Index
WDRVI 0.1 × N I R 0.2 × R 0.1 × NIR + 0.2 × R + 0.5 [67]
Global Vegetation Moisture IndexGVMI N I R + 0.1 S W I R 1 0.02 N I R + 0.1 + S W I R 1 + 0.02 [68]
Chlorophyll Vegetation indexCVI NIR × R G 2 [69]
Clay Minerals RatioCMR S W I R 1 S W I R 2 [70]
Kernel-Normalized Difference Vegetation IndexKNDVI tanh N I R R 2 σ 2 [71]
Table 3. Spatial transformations applied in this study, along with their corresponding acronyms and mathematical equations.
Table 3. Spatial transformations applied in this study, along with their corresponding acronyms and mathematical equations.
Spatial TransformationAcronymEquationReference
GLCM
Angular Second MomentMetric + “_ASM” i j p i , j 2 [77]
ContrastMetric + “_CONT” i j p i , j i - j 2
CorrelationMetric + “_COR” i j ( ij ) p ( i , j ) - μ x μ y / σ x σ y
VarianceMetric + “_VAR” i j i μ 2 p i , j
Inverse Difference MomentMetric + “_IDM” i j ( 1 / ( 1 + i - j 2 ) ) p ( i , j )
Sum VarianceMetric + “_SVAR” i = 2 2 N g ( i s e n t ) 2 p x + y ( i )
Sum AverageMetric + “_SVG” i = 2 2 N g i p x + y ( i )
Sum EntropyMetric + “_SENT” i = 2 2 N g p x y ( i ) l o g { p x + y ( i ) }
EntropyMetric + “_ENT” i j p i , j log p i , j
Difference VarianceMetric + “_DVAR”Variance of p x y
Difference EntropyMetric + “_DENT” i = 0 N g 1 p x y i l o g P x y i
Informal Measures of Correlation 1Metric + “_IMCORR1” ( HXY - HXY 1 ) / max { XH , XY }
Informal Measures of Correlation 2Metric + “_IMCORR2” 1 exp 2 H X Y 2 H X Y 0.5
Maximum Correlation CoefficientMetric + “_MAXCOR” k p i , k p j , k p x i p y k
DissimilarityMetric + “_DISS” i j i - j p ( i , j )
InertiaMetric + “_INERTIA” i j i - j 2 p i , j
ShadeMetric + “_SHADE” i = 0 N g - 1 i + j - μ x - μ y 3 p ( i , j )
Cluster ProminenceMetric + “_PROM” i = 0 N g - 1 i + j - μ x - μ y 4 p ( i , j )
Moving window
MaximumMetric + “_MAX” max i , j win F ( i , j ) [80]
MinimumMetric + “_MIN” min i , j win F ( i , j )
MeanMetric + “_MEAN” 1 / n i , j win F ( i , j )
Standard DeviationMetric + “_STD” 1 / n i , j win ( F ( i , j ) - μ ( x , y ) ) 2
Where P(i,j) represents the probability of co-occurrence of pixel pairs with gray levels i and j within a specified window. This is derived from the normalized GLCM, where each entry (i,j) reflects the likelihood of gray levels i and j appearing together in a given spatial relationship. i,j are indices corresponding to the gray tone levels in the GLCM, and also serve as coordinates in the matrix, where each entry at position (i,j) indicates the co-occurrence probability for gray tones i and j. Ng is the number of distinct gray levels in the quantized image, which is derived from reducing the original image’s gray value range to a fixed number of levels for analysis. The mean (μ) and standard deviation (σ) of P(i,j) summarize the average co-occurrence probability and its variability, respectively [80]. px(i) represents the probability of gray level i appearing as the first value in the co-occurring pixel pairs; py(j) represents the probability of gray level j appearing as the second value in the co-occurring pixel pairs; HX and HY are entropies of px and py; px + y is the sum distribution, representing the probability of the sum of gray levels i + j; px − y refers to the difference distribution, representing the probability of the difference between gray levels i − j; HXY is the joint entropy of the GLCM; and t quantifies the randomness or uncertainty in the entire co-occurrence matrix. HXY1 is the cross-entropy based on row and column marginal probabilities; HXY2 is defined as the cross-entropy based on the joint marginal probabilities of px + y; p refers to the marginal probabilities of gray levels; and Sent is the sum entropy, measuring the randomness of the sum of gray levels [76]. The notation “i, j ∈ win” denotes that the indices i and j are within a specified window “win” in an image. The term “win” specifies the boundary or the extent of this local neighborhood, and the operations (like max, min, or average) are conducted exclusively over the pixels that fall within this window. F(i,j) represents a function applied to each pixel (or pixel pair) located at position (i,j) in the image.
Table 4. Summary of canopy cover (%) and associated errors for each validation site. The reported canopy cover includes ±2 times the standard error to indicate the 95% confidence interval. GSWP = Green Swamp Wilderness Preserve; BRP = Babcock Ranch Preserve; and ANF = Apalachicola National Forest.
Table 4. Summary of canopy cover (%) and associated errors for each validation site. The reported canopy cover includes ±2 times the standard error to indicate the 95% confidence interval. GSWP = Green Swamp Wilderness Preserve; BRP = Babcock Ranch Preserve; and ANF = Apalachicola National Forest.
SiteEstimated Canopy
Cover Mean (%) ± Standard Error
Standard Error (%)Relative Standard Error to the Mean (%)
BRP20.8 ± 36.618.387.9
ANF42.4 ± 41.020.548.3
GSWP43.9 ± 40.920.546.7
Table 5. Results of external validation using ALS data. GSWP = Green Swamp Wilderness Preserve; BRP = Babcock Ranch Preserve; ANF = Apalachicola National Forest; all sites = GSWP + BRP + ANF.
Table 5. Results of external validation using ALS data. GSWP = Green Swamp Wilderness Preserve; BRP = Babcock Ranch Preserve; ANF = Apalachicola National Forest; all sites = GSWP + BRP + ANF.
SiteR2RMSDMD
Abs. %Rel. %Abs. %Rel. %
GSWP0.640.300.30−0.24−0.24
BRP0.6143.0542.12−34.21−34.71
ANF0.570.260.29−0.17−0.22
All sites0.7064.1547.31−44.04−36.15
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Schlickmann, M.B.; Bueno, I.T.; Valle, D.; Hammond, W.M.; Prichard, S.J.; Hudak, A.T.; Klauberg, C.; Karasinski, M.A.; Brock, K.M.; Rocha, K.D.; et al. Statewide Forest Canopy Cover Mapping of Florida Using Synergistic Integration of Spaceborne LiDAR, SAR, and Optical Imagery. Remote Sens. 2025, 17, 320. https://doi.org/10.3390/rs17020320

AMA Style

Schlickmann MB, Bueno IT, Valle D, Hammond WM, Prichard SJ, Hudak AT, Klauberg C, Karasinski MA, Brock KM, Rocha KD, et al. Statewide Forest Canopy Cover Mapping of Florida Using Synergistic Integration of Spaceborne LiDAR, SAR, and Optical Imagery. Remote Sensing. 2025; 17(2):320. https://doi.org/10.3390/rs17020320

Chicago/Turabian Style

Schlickmann, Monique Bohora, Inacio Thomaz Bueno, Denis Valle, William M. Hammond, Susan J. Prichard, Andrew T. Hudak, Carine Klauberg, Mauro Alessandro Karasinski, Kody Melissa Brock, Kleydson Diego Rocha, and et al. 2025. "Statewide Forest Canopy Cover Mapping of Florida Using Synergistic Integration of Spaceborne LiDAR, SAR, and Optical Imagery" Remote Sensing 17, no. 2: 320. https://doi.org/10.3390/rs17020320

APA Style

Schlickmann, M. B., Bueno, I. T., Valle, D., Hammond, W. M., Prichard, S. J., Hudak, A. T., Klauberg, C., Karasinski, M. A., Brock, K. M., Rocha, K. D., Xia, J., Vieira Leite, R., Higuchi, P., da Silva, A. C., Maximo da Silva, G., Cova, G. R., & Silva, C. A. (2025). Statewide Forest Canopy Cover Mapping of Florida Using Synergistic Integration of Spaceborne LiDAR, SAR, and Optical Imagery. Remote Sensing, 17(2), 320. https://doi.org/10.3390/rs17020320

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop