Next Article in Journal
Mapping Windthrow Severity as Change in Canopy Cover in a Temperate Eucalypt Forest
Previous Article in Journal
Wildfire Severity to Valued Resources Mitigated by Prescribed Fire in the Okefenokee National Wildlife Refuge
Previous Article in Special Issue
Assessment of FY-3E GNOS II Radio Occultation Data Using an Improved Three-Cornered Hat Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Influence of the Spatial Co-Registration Error on the Estimation of Growing Stock Volume Based on Airborne Laser Scanning Metrics

by
Marek Lisańczuk
,
Krzysztof Mitelsztedt
and
Krzysztof Stereńczak
*
Department of Geomatics, Forest Research Institute, Sękocin Stary, 3 Braci Leśnej Street, 05-090 Raszyn, Poland
*
Author to whom correspondence should be addressed.
Submission received: 26 November 2024 / Revised: 12 December 2024 / Accepted: 13 December 2024 / Published: 17 December 2024

Abstract

:
Remote sensing (RS)-based forest inventories are becoming increasingly common in forest management. However, practical applications often require subsequent optimisation steps. One of the most popular RS-based forest inventory methods is the two-phase inventory with regression estimator, commonly referred to as the area-based approach (ABA). There are many sources of variation that contribute to the overall performance of this method. One of them, which is related to the core aspect of this method, is the spatial co-registration error between ground measurements and RS data. This error arises mainly from the imperfection of the methods for positioning the sample plots under the forest canopy. In this study, we investigated how this positioning accuracy affects the area-based growing stock volume (GSV) estimation under different forest conditions and sample plot radii. In order to analyse this relationship, an artificial co-registration error was induced in a series of simulations and various scenarios. The results showed that there were minimal differences in ABA inventory performance for displacements below 4 m for all stratification groups except for deciduous sites, where sub-metre plot positioning accuracy was justified, as site- and terrain-related factors had some influence on GSV estimation error (r up to 0.4). On the other hand, denser canopy and spatially homogeneous stands mitigated the negative aspects of weaker GNSS positioning capabilities under broadleaved forest types. In the case of RMSE, the results for plots smaller than 400 m2 were visibly inferior. The BIAS behaviour was less strict in this regard. Knowledge of the actual positioning accuracy as well as the co-registration threshold required for a particular stand type could help manage and optimise fieldwork, as well as better distinguish sources of statistical uncertainty.

1. Introduction

In many modern inventory projects, remote sensing (RS) data are used to obtain spatial information about the forest environment. The application of RS methods offers extensive possibilities for environmental surveys. The most notable are spatially continuous data for each area of interest (AOI) within the surveyed region and the ability to efficiently map large and hardly accessible sites [1]. As RS technology and the corresponding methods are constantly being developed and adapted for forestry purposes, many issues concerning optimisation steps are being studied.
One of the most widely used RS forest inventory methods is based on the principles of the two-phase survey with regression estimator [2]. The core aspect of this approach is to establish a relationship (estimator/model) between the traditional ground samples (also referred to as second-stage sampling plots) and the RS metrics (first phase) within a corresponding spatial extent. In the next stage, the validated model is used to estimate the variable of interest for each AOI, e.g., forest stand. Therefore, it seems to be of utmost importance to ensure the best possible spatial link between ground (reference) and RS data and minimise this source of uncertainty that could propagate the inventory error in the later stages.
One of the tasks carried out during the fieldwork concerns the positioning of the sample plots. Information about the XY coordinates of the plots is needed to register their attributes with the corresponding space on the RS layers. Such positioning is mainly performed with the help of GNSS (Global Navigation Satellite System) measurements. A very common positioning technology is real-time kinematics (RTK), which is recognised for its balance between reliability and ease of data acquisition for surveying applications and navigation [3]. This technique uses instantaneous correction data and applies it to its own raw measurements. Positioning updates come either from independent GNSS signal acquisition sources, such as fixed, country-specific triangulated stations (e.g., ASG-EUPOS in Poland) or from mobile reference base stations. RTK technology usually operates in two modes: fixed—when corrections are available at the time of measurement; otherwise, the acquisition solution is called float. The first mode is known to provide precise and more stable results and allows fast measurements [4]. However, under forest-specific conditions, such as signal interference from trees and their canopy, as well as in remote, off-grid sites, a scarcity of GNSS signal often impedes reliable fixes. Under such conditions, static occupation is an alternative. In contrast to RTK, static acquisition can be time-consuming, i.e., 10–30 min per plot, depending on receiver type and site characteristics. Such a duration is usually required to achieve sub-metre horizontal positioning error [5,6]. Yet, in some specific circumstances related to acquisition conditions, the accuracy can deteriorate to over 1 or even 2 m [4,6,7,8], e.g., if there is a direct obstacle, such as the direct proximity of the tree trunk and the receiver antenna [3]. The design of the receiver also has an influence on the positioning error under tree canopies. Kaartinen et al. [9] have shown that dual-frequency geodetic receivers are very sensitive to satellite visibility, whereas modern single-frequency devices can be more independent in this aspect.
Optimisation of GNSS measurement sessions could also have practical importance for forest survey crews. Knowledge of both current and site-specific allowable positioning errors could help to further optimise survey work in forest inventory campaigns. Some GNSS receivers are equipped with an indicator for the actual positioning precision. It is commonly calculated as the average difference between consecutive data loggings during static occupations. Such precision level partially corresponds to the accuracy of the current positioning session. Furthermore, Hussain et al. [10] developed an Adaptive Environment Navigation (AEN) system that is capable of recognising different types of multipath GNSS signal environments (such as forests) during measurement sessions. Such environments can adversely affect the ability to register satellite signals [11]. In addition, Feng et al. [12] developed a model that predicts positioning errors. Therefore, the need to describe site-specific thresholds for plot positioning accuracy seems to be well justified.
The positioning of the plots could influence the overall efficiency of the inventory campaign. Mauro et al. [13] state that accurate determination of plot locations can be crucial in the development of robust predictive models based on high-resolution RS data. Janssen et al. [14] have shown that adjusted co-registration can improve the estimation of timber volume based on regression predictions in the two-phase ALS inventory. These improvements are greater when dGNSS (differential corrections) are not available. They reported a 31% reduction in root mean square error (RMSE) for timber volume and a 10% reduction in RMSE for basal area estimation when the original co-registration changes were applied to the raw positioning data. If available, differential corrections in conjunction with IMU sensors can also contribute to better positioning results, which can achieve sub-metre accuracy under forest canopy even when positioning in motion [9].
The size of the sample plot area, in combination with the stand spatial structure, also appears to have an influence on co-registration importance. Hernández-Stefanoni et al. [15] and Frazer et al. [16] reported that larger sample areas are less affected by the effect of co-registration error. However, they used a relatively small sample (57 units) or simulated data only. Gobakken and Næsset [17] reported that in dense, mature coniferous stands, the differences in volume estimates in two-phase sampling inventories did not exceed 10% in most cases, even with a positioning error of 5 m, although with fewer trees, positioning accuracy proved to be more important. However, no terrain characteristics nor deciduous sites were involved in that study.
Certain discrepancies in the results of the above-mentioned papers, as well as their relatively narrow scope, formed the basis for this study. Some of the differences could have their origin in site/stand-specific characteristics. It was assumed that forest inventory estimates in homogeneous stands are less affected by positioning and co-registration errors than in stands with a more complex structure. Therefore, the authors investigated how airborne laser scanning (ALS)-based growing stock volume (GSV) estimation reliability attenuates sample plot positioning error under different forest conditions and for various sizes of sample plot areas. Therefore, the aim was to determine the importance of plot positioning accuracy for GSV estimation errors in a two-phase sampling design. Analyses were conducted at two levels, i.e., for the entire sample and for individual plots, including stratification by specific forest and terrain conditions.

2. Materials and Methods

2.1. Study Area and Ground Survey

In order to capture different site conditions, two completely different forest districts were included in the analysis. The first, Supraśl, located in the lowlands of north-east Poland, consists of even-aged single-layer coniferous stands (75% share of Scots pine, European larch and Norway spruce). The second (Gorlice) is located in the mountainous strip in south-eastern Poland and consists of uneven-aged, multi-layered stands with a 54% share of broadleaf species (mainly European beech) and silver fir among the conifer species. Table 1 compares the quantitative descriptions of selected forest districts.
A set of 500 circular sample plots with a radius of 12.62 m was distributed in an even grid of 350 m over each study area. The heights and breast height diameters (DBH) were captured for all trees exceeding 7 cm in DBH, using conventional measuring devices, i.e., rangefinders and callipers. The age of the stands was calculated on the basis of previous inventories, as most of the stands had been planted. In case of doubt regarding the age of the tree layers, an increment borer was used to assess this attribute. Finally, the volume of each tree was estimated using allometric equations commonly used in Polish forestry [18]. The main variable of interest in this study was the growing stock volume (GSV), defined as the sum of the volumes of individual trees within a plot divided by the plot area. GSV was used as a benchmark indicator for the analyses as it is one of the most important and common variable of interest estimated in most forest inventories [19]. It is also a traditional indicator of timber resources, carbon stock (thanks to its close relationship with aboveground biomass), management efficiency and sustainability in the forest sector [20,21].
Table 1. Summary statistics of topography and forest attributes across the study areas.
Table 1. Summary statistics of topography and forest attributes across the study areas.
VariableMeanStandard DeviationCV [%]
DistrictLowlandsMountainsLowlandsMountainsLowlandsMountains
GSV [m3/ha]4073761551703845
Tree height [m]21173.981848
DBH [cm]212211145164
Age [years]535724264545
Trees dens. [n/ha]7625893753684963
Slope [degrees]4.813.62.84.85835
TRI *0.0330.190.0190.075837
* Terrain Rugged Index: mean of the absolute differences between the height value of a DTM cell and its 8 surrounding cells. Averaged at plot level [22].

2.2. GNSS Positioning

The coordinates of the plot’s origins were determined by static GNSS measurements with Topcon HiPer V receivers. The receiving module was placed at a height of 4.65 m above the ground on 85% of the plots and at a height of above 2 m on 97% of the plots. Static measurements of at least 20 min duration were taken for each origin. Raw observations were then adjusted using the post-processing kinematic (PPK) method by applying corrections from ASG-EUPOS, a Polish GNSS fixed position correction network. The theoretical accuracy of the averaged positions of plots’ origins was reported to be less than one metre [23].

2.3. ALS Data

Airborne laser scanning missions were flown during the same season as the fieldwork. Point clouds with an average density of 10–13 pulses/m2 were obtained. The mean error of XY points coordinates was less than 20 cm, and the mean vertical error was below 15 cm. The overall accuracy of the point cloud classification was over 95%. Digital terrain models (DTMs) were interpolated based on the ALS point clouds to target a resolution of 1 m. The maximum elevation error of the DTM was less than 0.3 m (0.08 m on average). The DTM was used for two reasons: first, to provide terrain-related variables such as slope and Terrain Rugged Index (TRI) [22] for each sample area. Secondly, to normalise the heights of the LiDAR points so that meaningful and comparable ALS forest metrics could be calculated for each plot.

2.4. Stratification

The forest area under investigation is characterised by diverse forest conditions, i.e., some stands are very homogeneous, while others have complex structures. There is also a variety of tree species groups and a non-uniform terrain. Assuming that all trees in the exemplary stand had been the same, growing on perfectly uniform terrain, the effect of co-registration error would have been of minor importance. In reality, however, this is never the case. For this reason, a list of factors was selected that could have a hypothetical influence on the co-registration accuracy (Table 2). The results for nominal data type/factors were presented in terms of RMSE and bias, both for non-stratified samples as well as for specific groups with similar site characteristics, i.e., lowlands, mountains, coniferous, deciduous, mixed. The results for continuous data types/factors were presented in terms of correlations between GSV estimation change due to registration error and given factor, e.g., slope or tree density. Five sample plot area variants were analysed in this study: 500, 400, 300, 200 and 100 m2. Scenarios with smaller plots were not planned because such radii rarely occur in forest stand inventories and, as Mauro et al. [13] have shown, only plots with a radius of more than 10 m led to an insignificant positioning error. However, with respect to data clarity, in some figures, only three plot sizes were shown, i.e., 500 m2 (original), 400 m2 (commonly applied in Polish forestry) and 300 m2. Furthermore, Stereńczak et al. [24] and Lisańczuk et al. [25] reported that plots smaller than 300 m2 are not suitable for forest inventory. The design of this study resulted in 990 scenarios. There were 2,930,247 unique plot estimates. All calculations took approximately 14 days using a single-thread Intel Core i9-13900KF processor.

2.5. GSV Estimation

The ALS metrics (Table 3, Appendix A) were computed for the sample plots using the lidR package [28,29] for R programming language for statistical computing. The variables were derived in 6 variants: all, first only and last only echoes, with and without a 2 m cut-off from the ground. To account for possible non-linearity among the potential predictors, the ALS metrics were analysed both in their original forms as well as in their linearly transformed equivalents (using log, power and square transformations). Table 3 contains a juxtaposition of calculated variables along with their importance according to the Random Forest (RF) estimator. These are standard metrics commonly applied in ALS-based forest inventories and described in [30,31], as well as so-called ‘basket metrics’ described in [32,33] that characterise the internal vertical structure of forests.
In the next step, an RF model was trained to estimate the GSVs for each. The following values for hyperparameters were set as a result of the tuning analysis: number of trees—501, number of predictors at each node—7, node size—5. We could not detect any substantial influence of these parameters on RF-based predictions as long as the number of trees grown was set to a relatively high value, i.e., above 300, to keep the variance of the results stable [34]. Leave-one-out cross-validation (LOOCV) was used for the following reasons: (1) to ensure that exactly the same plots were present in each scenario, in order to exclude the part of the random effect caused by different samples, (2) to ensure large test sample size for the RF regression and (3) to avoid overfitting.

2.6. Monte Carlo Simulations

The main concept of the research was based on a series of Monte Carlo simulations. The displacement of the plots (positioning drift/shift/co-registration effect) was simulated in 1 m increments from the original position of the plots up to a distance of 10 m. The displacement directions were random and independent for each plot. Second, the ALS metrics were recalculated for the newly determined plot positions in each simulation. Next, the GSV estimates were derived for each plot based on the output of the RF model trained on the shifted dataset in a given scenario. Finally, the new GSV estimates were juxtaposed with the reference values from the ground inventory. In order to evaluate the magnitude of the co-registration effect, a commonly applied statistical modelling kind of errors were calculated, i.e., RMSE and bias (Table 4, Table 5 and Table 6), as well as plot level cumulative error frequencies (Figure 1).
For each scenario, the simulation was repeated 30 times [35,36] to account for the randomness of the displacement direction and to establish a balance between the reliability of the results [37,38] and computation time. All data preparation, processing and analysis were performed in the R programming language for statistical computing, using packages such as lidR [28,29] for ALS metrics, terra [39] for terrain-related variables, spatstat [40,41] for spatial distributions of trees, random Forest [42,43] for regression and BAMMtools [44] for stratification.

3. Results

The results were presented at three levels of generalisation. First, we show the gradient of the estimation errors (Table 4, Table 5 and Table 6) over the course of changing positioning shifts and the different plot sizes for the entire sample as well as for stratified samples. These tables show the maximum errors from 30 iterations to account for the worst scenarios. Next, we take a look at the distribution of error dynamics at the individual plot level (Figure 1), as the ALS forest inventories also allow for small area (single stand) estimation for every AOI within the scanned region. These figures (curves) show the mean frequencies to illustrate general trends and the distribution of error variation in GSV estimation. Such generalisation was possible because the variation between each iteration was small enough (typically 1–5%), probably due to the large sample size. Nevertheless, one should bear in mind that presented trend lines may have some narrow confidence intervals. Finally, Table 7 presents the correlations between the changes in GSV estimation due to co-registration error and specific site/terrain characteristics. More detailed results explanations can be found next to particular tables/figures.
Figure 1. Distribution of GSV estimation error due to co-registration shift and plot area.
Figure 1. Distribution of GSV estimation error due to co-registration shift and plot area.
Remotesensing 16 04709 g001
Table 4 presents the dynamics of the GSV estimation error, expressed as mean %RMSE (from 30 iterations), in the dimensions of positioning accuracy and changing plot sizes. The error gradient follows a logical distribution, i.e., larger displacement and smaller plots–larger error. The error rates were presented in a gradient span of 5 percentage points (pp) for easier interpretation and comparison between particular stratification groups. As can be seen, the lowland and coniferous groups have the lowest values of %RMSE, following fairly similar trends. For these strata, even 4–6 m positioning drift can result in %RMSE of less than 20%, provided that 500 m2 plots are used for model calibration. The highest error rates were observed for mountains and deciduous tree stands, where only 500 m2 plots with very high positioning accuracy, i.e., <1 m, ensured RMSE of below 30%.
Apart from the relative error values, Table 5 shows a decrease in the RMSE compared to the estimation performed with the original sample plots. The most important result indicates that except for deciduous forests, the deterioration of GSV estimation strength oscillates at around 1–3 pp, up to a position shift of 3–5 m for 400/500 m2 sample plots. After this range, the error derivatives gain significant declivity (dark grey areas).
Table 6 shows how the systematic error depends on the radius of the plots and the accuracy of the co-registration. The most visible aspect concerns slightly biased estimates for mountain plots (from 6 to 12%). The BIAS occurring in the non-stratified scenario was most likely caused by the mountainous part of the sample. A sharp BIAS transition is visible in the lowland and especially in the deciduous stratum when the plot size is changed from 200 to 100 m2. Apart from the extreme scenarios, the BIAS issue was irrelevant in deciduous, mixed and coniferous stands as well as in the lowlands. The acceleration of this type of error was also visibly slower across the analysed factors than in the case of %RMSE. In some cases, the analysed shift range was even too small to detect its negative impact on the accuracy of the estimates (see mixed or lowland groups 400/500 m2).
ALS-assisted forest inventories not only help to infer the mean values and total values of the entire sampling population but also enable estimation for small, non-sampled in the second-phase areas [2,45]. Therefore, it is also important to show the changes in the distribution of the GSV estimation error along the analysed factors at a sample plot level, as shown in Figure 1. In the interest of clarity of the results, only selected variants were presented. Nevertheless, the designed graphs show the stepwise error progression of the obtained results so that missing scenarios can be easily visually interpolated between the presented trends. The positioning and plot size factors had roughly the same effect on the GSV estimates, regardless of the stratification layers (Figure 1). All curves have a logarithmic flow, whose steepness depends mainly on the shift value, i.e., the lower the co-registration error, the steeper the curve, meaning that more plots were relatively unaffected by the effects of the controlled experiment. Some differences can be seen in the initial parts of the graphs. For example, when considering a 10% threshold for the estimation difference, 80% of all 400 m2 sample plots were still below this threshold, even with a position error of 3 m (Figure 1—blue dashed line). However, when only lowland areas were analysed (Figure 1), the proportion of sample plots meeting the same criteria increased to 85%. Consequently, if a 20% deviation in GSV estimation was an acceptable value for small areas, even a 5 m shift in plot positions would not result in 90% of the 500 m2 sample areas being above this margin of error (except in the mountains). Again, the mountain and deciduous forests were slightly more susceptible to induced effects than the other stratification groups analysed.
In the initial values of the sources of error analysed, the positioning of the plots had a greater influence on the GSV estimates than the area of the plots (colour curves become more linear with the displacement distance than with the area of the plots). However, this trend becomes more floating above about a 5 m positioning shift. In addition, the deterioration in GSV estimation performance was non-linear when moving from 500 m2 plots to 300 m2 plots. The GSV differences were greater when moving from 400 to 300 m2 than when moving from 500 to 400 m2.
Table 7 was designed in order to trace analysed dependencies on the factors represented by continuous data. Among tested site characteristics, tree density (TD) was found to be the strongest error-causing factor (Pearson correlation coefficient up to r = −0.43) for plots with mixed species groups. This means that the more trees there are on the site, the lower the proportion of GSV estimation errors caused by plot co-registration and vice versa. Moreover, this relationship generally becomes stronger the larger the plot radius is (increase from r = −0.25 to −0.43 for 100 and 500 m2 plots for mixed stands and from r = −0.32 to r = −0.40 for lowlands, respectively).
Canopy Height Diversity (CHD) was positively correlated with the error magnitude: r = 0.42 for 400 m2 in the lowlands, r > 0.3 for other stratification groups except the mountain region. Again, this relationship became tighter with the increase in the area of sample plots. However, more spacious plots can contain more trees, thus making the CHD factor more reliable on bigger plots. Trees horizontal distribution (THD) had no effect on the estimation error for deciduous stands (r < 0.1) and little influence on coniferous sites (r up to −0.28), indicating that the clustered horizontal spatial distribution of trees had some effect on GSV estimation errors. Inversely, terrain-related factors such as slope and rugged index (TRI) had no influence on the estimation error rates for all strata, with the exception of deciduous sites, where there was a slight positive correlation (r up to 0.27).

4. Discussion

The results obtained corresponded to the expectations arising from the logical assumptions. In general, the larger the position shift, the higher the RMSE values (Table 4). Conversely, the smaller the plot area, the greater the error. If estimates for small areas/individual stands are of interest, the positioning accuracy of the plots should not be worse than 3–4 m, as the RMSE acceleration appears to increase significantly near this threshold (Table 5). Within this buffer, the error values only change by about 1–3 pp. As McGaughey et al. [46] have shown, the mentioned accuracy can be achieved even with a GNSS receiver of the mapping class under a dense forest canopy within 15 min of static occupation.
As far as the area of the plots is concerned, the establishment of plots with an area of less than 400 m2 may, according to the available results, entail the risk of increased random error. On the other hand, according to Hernández-Stefanoni et al. [15], the influence of co-registration error could be further reduced by increasing the size of sample plots. According to that study, the size of the plot strongly attenuates the influence of co-registration error between ground and ALS data. They claim that in structurally less complex stands, plots of 1000 m2 can provide robust ALS estimates of aboveground biomass, even with a shift of up to 10 m. From the perspective of Polish forest conditions, where most stands are coniferous, single-layered and even-aged, this information appears to be of high practical value. Moreover, as shown in Table 4, further improvement of RMSE due to plots bigger than 500 m2 seems plausible. On the other hand, the transition towards increased biodiversity has been a continuing trend in European forestry in recent decades, so the need for accurate positioning/co-registration seems to become relevant. However, the results of this study do not necessarily support this premise. As shown in Table 5, even in stands with mixed species groups, the RMSE does not increase by more than 2 pp at a positioning accuracy of 4 m.
An interesting insight might be drawn by tracing the relative insensitivity of the systematic error to positioning displacement within the majority of the analysed scenarios (Table 6). Therefore, the issue concerning accurate co-registration of ALS-ground data does not seem to be relevant when the determination of population means and totals are the main objective of the inventory. Nevertheless, Hawryło et al. [47] obtained nearly the same results for coniferous (Scots pine) stands, i.e., RMSE = 24.2%, BIAS = −2.2%, at a single stand level, even with positioning uncertainty ranging from several to 15 m. Again, if the population mean and totals are the sole aim of the inventory, the application of ALS can be omitted—this is, however, rarely the case. Moreover, as has been shown in [41,42,48], the inclusion of RS data aids in reducing sample size and may help to improve estimates for Pinus sylvestris stands in mountainous regions where, as shown in Table 4 and Table 5, accurate positioning appears to be more important.
Polish experience with ALS-based two-phase forest inventory shows GSV estimation error rates between 16 and 25% RMSE [47,49]. In other countries, reported errors in ABA-ALS forest stock inventories range between 10 and 30% RMSE [50]. In the US National Forest Inventory (NFI), an error level of ±5% is required for total values [51], and for ALS-based local forest inventories, deviations of more than 10% can be flagged for more detailed analysis [52]. Comparing the reported error values with the results obtained in this study (Table 4), it seems possible to achieve similar performance levels without sub-metre positioning accuracy for most site types, provided that the sample is stratified. Fitting separate RF models contributed to an overall reduction in RMSE for lowland conifer sites (which are the most common in Poland). A stratified sampling design does not appear to require fixed positioning accuracy, as some groups were less susceptible to the drift effect than others. Depending on the proportion of different stand types in surveyed forest districts, this could allow further economic optimisation of inventory campaigns. Similar findings were emphasised by Hernández-Stefanoni et al. [15]. Nevertheless, in order to further validate these results, a post-stratification test would be required, which, however, was beyond the scope of this study. Although some promising findings resulting from the application of post-stratification can be found [53,54,55,56].
Næsset et al. [57], Næsset [7] and Andersen et al. [8] reported a maximum GNSS positioning error of 1.29–2.21 m when using similar GNSS techniques under forest canopy. Adding this initial positioning error and the artificially induced drift effect, which together cause the RMSE increase shown, one could assume that further improvement of GNSS positioning (and/or co-registration) would allow even lower error rates in GSV estimation. Such an assumption, based on the results of this study, could be an interesting case for further investigation, especially with regard to precision forestry, where accurate determination of forest attributes at different spatial scales is required [58]. Nevertheless, such an analysis is subject to a certain degree of uncertainty. We would like to emphasise that in Table 4, Table 5 and Table 6, the maximum errors, i.e., the results from the worst simulation per scenario, were given. Therefore, the worsening drift effect was even smaller in most cases. On the other hand, 30 iterations may not be sufficient to detect extremes in the variance of the specific stand-internal conditions. This limitation is all the more serious, the larger the position shift is since, with a constant number of simulations, less overlap is covered by the simulated plot positions, which in turn contributes to increased variation in ALS metrics values at larger displacements.
As already mentioned, the great advantage of the ALS inventory lies in the possibility of estimating forest attributes for small areas that were not sampled during the ground measurements. Therefore, it is of utmost importance to track the changes in GSV estimation caused by errors in positioning and co-registration at the sample plot level (Figure 1). The analysed effect looks nearly the same for all stratification groups. The only practically visible differences are found in the initial areas of the diagrams, where the error thresholds are lowest. We can observe that about half of the plots did not suffer from poor co-registration, i.e., more than a 10% difference in GSV estimation, regardless of the stratification group. However, this promising result may depend on the presence or absence of outliers, which can have a significant impact on inferences and statistical conclusions based on ALS-based forest inventories [59]. In this study, however, the authors did not perform an analysis regarding the treatment of outliers. Therefore, one can expect even more favourable results once careful data examination is conducted.
It is also worth considering that the drift/displacement effect is initially caused by positioning errors, but the final match of ground and RS layers depends on the co-registration efforts. Moreover, the potential of RS techniques other than ALS (e.g., terrestrial laser scanning [60]) combined with the development of sophisticated algorithms for co-registration (e.g., TreeNet) enables high spatial linking performance using mapping-grade GNSS receivers [61] or oven multi-constellation dual-frequency smartphones [45]. As a curiosity, Ogundipe et al. [62] reported a horizontal GNSS positioning accuracy of 3 m using a consumer-grade tablet under the forest canopy. According to the results presented in our study, this level of positioning ensured that the difference in GSV estimation relative to the very accurate positioning, i.e., the original position of the plots, was within 10% GSV difference for 70% of the sample plots and within 5% GSV difference for 50% of the plots.
In this study, a certain dependence of the GSV estimation error on the spatial distribution of the trees was found due to the shift in co-registration (Table 7). In general, more regularly spaced trees of a uniform canopy mitigate the effect of co-registration error on GSV estimates to a limited extent (r up to about 0.4), with a slight variation between stratification groups and with a gentle upward trend along the increment of the plot area. Lee et al. [63] confirmed that forest attributes related to the spatial location of trees degrade the horizontal GNSS positioning accuracy more than the variables describing their size. Brach et al. [11], on the other hand, found that the volume of trees (which is somewhat related to their size) is the most important factor in this regard. However, it is plausible that less precise positioning could be compensated by denser and more homogeneous stands.
The authors did not detect any substantial influence of terrain-related factors for all stratification groups, with the exception of hardwood stands and small plots with mixed species groups, where the correlation reached up to r ≈ 0.3. This can be explained by the fact that the DTM could be somewhat less accurate under broadleaved stands due to the denser canopies and thus weaker potential of signal ground penetration capabilities of the ALS approach. This result could be an indication of the importance of a robust DTM. It is likely that sparser LiDAR ground points resulted in a less accurate interpolation of the terrain. This is another premise that favours leaf-off ALS campaigns over those with foliage when accurate GSV estimation and terrain reconstructions are of importance [64,65].

5. Conclusions

The following conclusions can be drawn from this study. A sub-metre co-registration accuracy is important only for sample plots located in deciduous stands. For other stratification groups investigated, plots localization error of up to 4 m only marginally affect the performance of GSV estimation in the two-phase ALS forest inventory. We recommend that 500 m2 sample plots be established (at least 400 m2), as smaller units can significantly increase errors in GSV estimation. Further investigation is required as the trends observed suggest that estimation performance may increase when plot areas extend beyond the analysed area. Accurate DTM is important to exclude the negative influence of terrain factors, especially for foliage-covered sites. Weaker GNSS positioning capabilities under dense forest canopy could be compensated by the homogeneous structure of the sites. The differences in the results between certain stratification groups and error types make it possible to optimise the GNSS ground measurements according to the required efficiency and the objectives of the respective forest inventory.

Author Contributions

Conceptualisation, M.L. and K.S.; methodology, M.L.; software, M.L.; validation, M.L., K.S. and K.M.; formal analysis, M.L.; investigation, M.L.; resources, K.S.; data curation, M.L. and K.M.; writing—original draft preparation, M.L. and K.M.; writing—review and editing, KS.; visualisation, M.L.; supervision, K.S.; project administration: K.S.; funding acquisition, K.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by (1) the National Centre for Research and Development in Poland under the BIOSTRATEG programme (grant agreement number BIOSTRATEG1/267755/4/NCBR/2015), project REMBIOFOR ‘Remote sensing-based assessment of woody biomass and carbon storage in forests’, and (2) the State Forests National Forest Holding, project no. EO.271.2.12.2019 ‘Extension of the forest management inventory method using the results of the REMBIOFOR project’ (int. no. 500463).

Data Availability Statement

Data used in the research will be available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. ALS Metrics Used for Random Forest Based GSV Estimation

ALS MetricImportance [%IncMSE] *Definition
log_zq70.23545Natural logarithm of the 70th height quantile from the last echoes
zq70.23299The 70th height quantile from the last echoes
zmean.12494Mean height return from the first echoes
zmean.32322Mean height return from the all echoes above 2 m over the ground
log_zmean.32045Natural logarithm of mean height return from all echoes above 2 m over the ground
log_zq70.31742Natural logarithm of the 70th height quantile from all echoes above 2 m over the ground
log_zmean.21607Natural logarithm of mean height return from the last echoes
zq70.31599The 70th height quantile from all echoes above 2 m over the ground
zmean.21498Mean height return from the last echoes
zsd.2877Standard deviation of height returns from last echoes
zq25.3535The 25th height quantile from all echoes above 2 m over the ground
log_zq25.3513Natural logarithm of the 25th height quantile from all echoes above 2 m over the ground
log_iskew.2448Natural logarithm of the skewness of points’ intensity distribution from the last echoes
iskew.2443The skewness of points’ intensity distribution from the last echoes
log_zq20.3373Natural logarithm of the 20th height quantile from all echoes above 2 m over the ground
log_zkurt.1361Natural logarithm of the kurtosis of points’ height distribution from the first echoes
log_p3th.3356Natural logarithm of percentage of 3rd echoes from all echoes above 2 m over the ground
log_X.7341The ratio between the number of points above 3rd height threshold (as described in [26,27]), to all echoes above 2 m
pzabove2335Percentage of all returns above 2 m from all echoes
log_zq10.1316Natural logarithm of the 20th height quantile from the first echoes above 2 m over the ground
p2_zkurt.1311Power (square) transformation of the kurtosis of points’ height distribution from the first echoes
zkurt.1302Kurtosis of points’ height distribution from the first echoes
p2_zq10.1288Power (square) transformation of the 20th height quantile from the first echoes above 2 m over the ground
log_zsd.4288Natural logarithm of the standard deviation of height returns from first echoes above 2 m
zq20.3287The 20th height quantile from all echoes above 2 m over the ground
pzabovezmean.1270Percentage of returns above zmean from the first echoes
log_zq15239Natural logarithm of the 15th height quantile from all echoes above 2 m over the ground
zq10.123610th height quantile from first echoes above 2 m over the ground
log_pzabove2.1232Natural logarithm of percentage of returns above 2 m from the first echoes
log_zsd.5231Natural logarithm of the standard deviation of height returns from last echoes above 2 m
p2_zq15230Power (square) transformation of the 15th height quantile from all echoes
zq20.222520th height quantile from last echoes
log_zq25.2223Natural logarithm of 25th height quantile from last echoes
log_pzabovezmean.2221Natural logarithm of percentage of returns above zmean from the last echoes
zq1521415th height quantile from all echoes
pzabovezmean.5208Percentage of returns above zmean from last echoes above 2 m
pzabovezmean.4203Percentage of returns above zmean from first echoes above 2 m
pzabove2.1203Percentage of all returns above 2 m from first echoes
zsd.3195Standard deviation of height returns from all echoes above 2 m
log_pzabovezmean.3188Natural logarithm of percentage of returns above zmean from all echoes above 2 m
zq25.2179The 25th height quantile from last echoes
log_zq20.2177Natural logarithm of the 20th height quantile from last echoes
zq5.1176The 5th height quantile from first echoes
p2_zq5.1154Power (square) transformation of the 5th height quantile from first echoes
log_zkurt.5143Natural logarithm of kurtosis of points’ height distribution from the last echoes above 2 m
log_zkurt.4136Natural logarithm of kurtosis of points’ height distribution from first echoes above 2 m
pzabovezmean129Percentage of returns above zmean from all echoes
log_zq5.1114Natural logarithm of the 5th height quantile from first echoes
log_p2th.3109Natural logarithm of percentage of 2nd echoes from all echoes above 2 m over the ground
log_iskew.365Natural logarithm of the skewness of points’ intensity distribution from all echoes above 2 m
iskew.342Skewness of points’ intensity distribution from all echoes above 2 m

References

  1. Hou, Z.; Xu, Q.; Tokola, T. Use of ALS, Airborne CIR and ALOS AVNIR-2 Data for Estimating Tropical Forest Attributes in Lao PDR. ISPRS J. Photogramm. Remote Sens. 2011, 66, 776–786. [Google Scholar] [CrossRef]
  2. Köhl, M.; Magnussen, S.; Marchetti, M. Sampling Methods, Remote Sensing and GIS Multiresource Forest Inventory; Springer: Berlin/Heidelberg, Germany, 2010; ISBN 978-3-642-06898-0. [Google Scholar]
  3. Bakuła, M.; Oszczak, S.; Pelc-Mieczkowska, R. Performance of RTK Positioning in Forest Conditions: Case Study. J. Surv. Eng. 2009, 135, 125–130. [Google Scholar] [CrossRef]
  4. Grala, N.; Brach, M. Analysis of GNSS Receiver Accuracy in the Forest Environment. Ann. Geomat. 2009, 7, 41–45. [Google Scholar]
  5. Valbuena, R.; Mauro, F.; Rodriguez-Solano, R.; Manzanera, J.A. Accuracy and Precision of GPS Receivers under Forest Canopies in a Mountainous Environment. Span. J. Agric. Res. 2010, 8, 1047–1057. [Google Scholar] [CrossRef]
  6. Brach, M. Analiza dokładności wyznaczania współrzędnych wybranymi odbiornikami GNSS w środowisku leśnym. Sylwan 2012, 156, 47–56. [Google Scholar]
  7. Næsset, E. Effects of Differential Single- and Dual-Frequency GPS and GLONASS Observations on Point Accuracy under Forest Canopies. Photogramm. Eng. Remote Sens. 2001, 67, 1021–1026. [Google Scholar]
  8. Andersen, H.-E.; Clarkin, T.; Winterberger, K.; Strunk, J. An Accuracy Assessment of Positions Obtained Using Survey- and Recreational-Grade Global Positioning System Receivers across a Range of Forest Conditions within the Tanana Valley of Interior Alaska. West. J. Appl. For. 2009, 24, 128–136. [Google Scholar] [CrossRef]
  9. Kaartinen, H.; Hyyppä, J.; Vastaranta, M.; Kukko, A.; Jaakkola, A.; Yu, X.; Pyörälä, J.; Liang, X.; Liu, J.; Wang, Y.; et al. Accuracy of Kinematic Positioning Using Global Satellite Navigation Systems under Forest Canopies. Forests 2015, 6, 3218–3236. [Google Scholar] [CrossRef]
  10. Hussain, A.; Ahmed, A.; Magsi, H.; Tiwari, R. Adaptive GNSS Receiver Design for Highly Dynamic Multipath Environments. IEEE Access 2020, 8, 172481–172497. [Google Scholar] [CrossRef]
  11. Brach, M.; Stereńczak, K.; Bolibok, L.; Kwaśny, Ł.; Krok, G.; Laszkowski, M. Impacts of Forest Spatial Structure on Variation of the Multipath Phenomenon of Navigation Satellite Signals. Folia For. Pol. 2019, 61, 3–21. [Google Scholar] [CrossRef]
  12. Feng, T.; Chen, S.; Feng, Z.; Shen, C.; Tian, Y. Effects of Canopy and Multi-Epoch Observations on Single-Point Positioning Errors of a GNSS in Coniferous and Broadleaved Forests. Remote Sens. 2021, 13, 2325. [Google Scholar] [CrossRef]
  13. Mauro, F.; Valbuena, R.; Manzanera, J.A.; García-Abril, A. Influence of Global Navigation Satellite System Errors in Positioning Inventory Plots for Tree-Height Distribution studiesThis Article Is One of a Selection of Papers from Extending Forest Inventory and Monitoring over Space and Time. Can. J. For. Res. 2011, 41, 11–23. [Google Scholar] [CrossRef]
  14. Janssen, S.; Pretzsch, H.; Bürgi, A.; Ramstein, L.; Gallus Bont, L. Improving the Accuracy of Timber Volume and Basal Area Prediction in Heterogeneously Structured and Mixed Forests by Automated Co-Registration of Forest Inventory Plots and Remote Sensing Data. For. Ecol. Manag. 2023, 532, 120795. [Google Scholar] [CrossRef]
  15. Hernández-Stefanoni, J.L.; Reyes-Palomeque, G.; Castillo-Santiago, M.Á.; George-Chacón, S.P.; Huechacona-Ruiz, A.H.; Tun-Dzul, F.; Rondon-Rivera, D.; Dupuy, J.M. Effects of Sample Plot Size and GPS Location Errors on Aboveground Biomass Estimates from LiDAR in Tropical Dry Forests. Remote Sens. 2018, 10, 1586. [Google Scholar] [CrossRef]
  16. Frazer, G.W.; Magnussen, S.; Wulder, M.A.; Niemann, K.O. Simulated Impact of Sample Plot Size and Co-Registration Error on the Accuracy and Uncertainty of LiDAR-Derived Estimates of Forest Stand Biomass. Remote Sens. Environ. 2011, 115, 636–649. [Google Scholar] [CrossRef]
  17. Gobakken, T.; Næsset, E. Assessing Effects of Positioning Errors and Sample Plot Size on Biophysical Stand Properties Derived from Airborne Laser Scanner Data. Can. J. For. Res. 2009, 39, 1036–1052. [Google Scholar] [CrossRef]
  18. Bruchwald, A.; Rymer-Dudzinska, T.; Dudek, A.; Michalak, K.; Wroblewski, L.; Zasada, M. Wzory Empiryczne Do Okreslania Wysokosci i Piersnicowej Liczby Ksztaltu Grubizny. Sylwan 2000, 144, 5–13. [Google Scholar]
  19. Tonolli, S.; Dalponte, M.; Vescovo, L.; Rodeghiero, M.; Bruzzone, L.; Gianelle, D. Mapping and Modeling Forest Tree Volume Using Forest Inventory and Airborne Laser Scanning. Eur. J. Forest Res. 2011, 130, 569–577. [Google Scholar] [CrossRef]
  20. Mourelatou, A. Environmental Indicator Report 2017: In Support to the Monitoring of the Seventh Environment Action Programme; Publications Office: Luxembourg, 2017; ISBN 978-92-9213-926-1. [Google Scholar]
  21. Lee, J.; Phua, M. Estimation of Stand Volume of Conifer Forest: A Bayesian Approach Based on Satellite-based Estimate and Forest Register Data. For. Sci. Technol. 2010, 6, 7–17. [Google Scholar] [CrossRef]
  22. Wilson, M.F.J.; O’Connell, B.; Brown, C.; Guinan, J.C.; Grehan, A.J. Multiscale Terrain Analysis of Multibeam Bathymetry Data for Habitat Mapping on the Continental Slope. Mar. Geod. 2007, 30, 3–35. [Google Scholar] [CrossRef]
  23. Brach, M. Rapid Static Positioning Using a Four System GNSS Receivers in the Forest Environment. Forests 2022, 13, 45. [Google Scholar] [CrossRef]
  24. Stereńczak, K.; Lisańczuk, M.; Parkitna, K.; Mitelsztedt, K.; Mroczek, P.; Miścicki, S. The Influence of Number and Size of Sample Plots on Modelling Growing Stock Volume Based on Airborne Laser Scanning. Drewno 2018, 61, 5–22. [Google Scholar] [CrossRef]
  25. Lisańczuk, M.; Mitelsztedt, K.; Parkitna, K.; Krok, G.; Stereńczak, K.; Wysocka-Fijorek, E.; Miścicki, S. Influence of Sampling Intensity on Performance of Two-Phase Forest Inventory Using Airborne Laser Scanning. For. Ecosyst. 2020, 7, 65. [Google Scholar] [CrossRef]
  26. Horn, B.K.P. Horn Hill Shading and the Reflectance Map. Proc. IEEE 1981, 69, 14–47. [Google Scholar] [CrossRef]
  27. Hopkins, B.; Skellam, J.G. A New Method for Determining the Type of Distribution of Plant Individuals. Ann. Bot. 1954, 18, 213–227. [Google Scholar] [CrossRef]
  28. Roussel, J.-R.; Auty, D.; Coops, N.C.; Tompalski, P.; Goodbody, T.R.H.; Meador, A.S.; Bourdon, J.-F.; De Boissieu, F.; Achim, A. lidR: An R Package for Analysis of Airborne Laser Scanning (ALS) Data. Remote Sens. Environ. 2020, 251, 112061. [Google Scholar] [CrossRef]
  29. Roussel, J.-R.; Auty, D. lidR: Airborne LiDAR Data Manipulation and Visualization for Forestry Applications 2016, Version 4.1.1; R Foundation: Vienna, Austria, 2016. [Google Scholar]
  30. Woods, M.; Lim, K.; Treitz, P. Predicting Forest Stand Variables from LiDAR Data in the Great Lakes St. Lawrence Forest of Ontario. For. Chron. 2008, 84, 827–839. [Google Scholar] [CrossRef]
  31. Lucas, C.; Bouten, W.; Koma, Z.; Kissling, W.D.; Seijmonsbergen, A.C. Identification of Linear Vegetation Elements in a Rural Landscape Using LiDAR Point Clouds. Remote Sens. 2019, 11, 292. [Google Scholar] [CrossRef]
  32. Næsset, E. Predicting Forest Stand Characteristics with Airborne Scanning Laser Using a Practical Two-Stage Procedure and Field Data. Remote Sens. Environ. 2002, 80, 88–99. [Google Scholar] [CrossRef]
  33. Gobakken, T.; Næsset, E.; Nelson, R.; Bollandsås, O.M.; Gregoire, T.G.; Ståhl, G.; Holm, S.; Ørka, H.O.; Astrup, R. Estimating Biomass in Hedmark County, Norway Using National Forest Inventory Field Plots and Airborne Laser Scanning. Remote Sens. Environ. 2012, 123, 443–456. [Google Scholar] [CrossRef]
  34. Probst, P.; Boulesteix, A.-L. To Tune or Not to Tune the Number of Trees in Random Forest? arXiv 2017, arXiv:1705.05654. [Google Scholar]
  35. Hogg, R.V.; Tanis, E.A.; Zimmerman, D.L. Probability and Statistical Inference, 9th ed.; Pearson: Boston, MA, USA, 2015; ISBN 978-0-321-92327-1. [Google Scholar]
  36. Edwards, A.W.F.R.A. Fischer, Statistical Methods for Research Workers, First Edition (1925). In Landmark Writings in Western Mathematics 1640–1940; Elsevier: Amsterdam, The Netherlands, 2005; pp. 856–870. ISBN 978-0-444-50871-3. [Google Scholar]
  37. Mascha, E.J.; Vetter, T.R. Significance, Errors, Power, and Sample Size: The Blocking and Tackling of Statistics. Anesth. Analg. 2018, 126, 691–698. [Google Scholar] [CrossRef] [PubMed]
  38. Fraenkel, J.R.; Wallen, N.E. How to Design and Evaluate Research in Education, 7th ed.; McGraw-Hill: New York, NY, USA, 2009; ISBN 978-0-07-352596-9. [Google Scholar]
  39. Hijmans, R.J. Terra: Spatial Data Analysis 2020, Version 1.7-78; R Foundation: Vienna, Austria, 2020. [Google Scholar]
  40. Baddeley, A.; Rubak, E.; Turner, R. Spatial Point Patterns: Methodology and Applications with R; Champan & Hall/CRC Interdisciplinary Statistics Series; CRC Press/Taylor & Francis Group: Boca Raton, FL, USA; London, UK; New York, NY, USA, 2016; ISBN 978-1-4822-1020-0. [Google Scholar]
  41. Baddeley, A.; Turner, R. Spatstat: An R Package for Analyzing Spatial Point Patterns. J. Stat. Soft. 2005, 12, 1–42. [Google Scholar] [CrossRef]
  42. Liaw, A.; Wiener, M. Classification and Regression by RandomForest. R News. 2001, 2/3, 18–22. Available online: https://journal.r-project.org/articles/RN-2002-022/RN-2002-022.pdf (accessed on 1 December 2024).
  43. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  44. Rabosky, D.L.; Grundler, M.; Anderson, C.; Title, P.; Shi, J.J.; Brown, J.W.; Huang, H.; Larson, J.G. BAMMtools: An R package for the analysis of evolutionary dynamics on phylogenetic trees. Methods Ecol. Evol. 2014, 5, 701–707. [Google Scholar] [CrossRef]
  45. McRoberts, R.E.; Tomppo, E.O.; Næsset, E. Advances and Emerging Issues in National Forest Inventories. Scand. J. For. Res. 2010, 25, 368–381. [Google Scholar] [CrossRef]
  46. McGaughey, R.J.; Ahmed, K.; Andersen, H.-E.; Reutebuch, S.E. Effect of Occupation Time on the Horizontal Accuracy of a Mapping-Grade GNSS Receiver under Dense Forest Canopy. Photogramm. Eng. Remote Sens. 2017, 83, 861–868. [Google Scholar] [CrossRef]
  47. Hawryło, P.; Francini, S.; Chirici, G.; Giannetti, F.; Parkitna, K.; Krok, G.; Mitelsztedt, K.; Lisańczuk, M.; Stereńczak, K.; Ciesielski, M.; et al. The Use of Remotely Sensed Data and Polish NFI Plots for Prediction of Growing Stock Volume Using Different Predictive Methods. Remote Sens. 2020, 12, 3331. [Google Scholar] [CrossRef]
  48. Pascual, C.; Mauro, F.; García-Abril, A.; Manzanera, J.A. Applications of ALS (Airborne Laser Scanning) Data to Forest Inventory. Experiences with Pine Stands from Mountainous Environments in Spain. IOP Conf. Ser. Earth Environ. Sci. 2019, 226, 012001. [Google Scholar] [CrossRef]
  49. Parkitna, K.; Krok, G.; Miścicki, S.; Ukalski, K.; Lisańczuk, M.; Mitelsztedt, K.; Magnussen, S.; Markiewicz, A.; Stereńczak, K. Modelling Growing Stock Volume of Forest Stands with Various ALS Area-Based Approaches. For. Int. J. For. Res. 2021, 94, 630–650. [Google Scholar] [CrossRef]
  50. Goodbody, T.R.H.; Coops, N.C.; White, J.C. Digital Aerial Photogrammetry for Updating Area-Based Forest Inventories: A Review of Opportunities, Challenges, and Future Directions. Curr. For. Rep. 2019, 5, 55–75. [Google Scholar] [CrossRef]
  51. Cao, Q.; Dettmann, G.T.; Radtke, P.J.; Coulston, J.W.; Derwin, J.; Thomas, V.A.; Burkhart, H.E.; Wynne, R.H. Increased Precision in County-Level Volume Estimates in the United States National Forest Inventory with Area-Level Small Area Estimation. Front. For. Glob. Change 2022, 5, 769917. [Google Scholar] [CrossRef]
  52. Laes, D.; Reutebuch, S.E.; McGaughey, R.J.; Mitchell, B. Guidelines to Estimate Forest Inventory Parameters; US Forest Service: Salt Lake City, UT, USA, 2011.
  53. Chen, M.; Qiu, X.; Zeng, W.; Peng, D. Combining Sample Plot Stratification and Machine Learning Algorithms to Improve Forest Aboveground Carbon Density Estimation in Northeast China Using Airborne LiDAR Data. Remote Sens. 2022, 14, 1477. [Google Scholar] [CrossRef]
  54. Hauglin, M.; Rahlf, J.; Schumacher, J.; Astrup, R.; Breidenbach, J. Large Scale Mapping of Forest Attributes Using Heterogeneous Sets of Airborne Laser Scanning and National Forest Inventory Data. For. Ecosyst. 2021, 8, 65. [Google Scholar] [CrossRef]
  55. Banaś, J.; Drozd, M.; Zięba, S.; Bujoczek, L. Improving Effectiveness of Forest Inventory by Stratified Sampling. Sylwan 2017, 161, 804–811. [Google Scholar]
  56. Haakana, H.; Heikkinen, J.; Katila, M.; Kangas, A. Efficiency of Post-Stratification for a Large-Scale Forest Inventory—Case Finnish NFI. Ann. For. Sci. 2019, 76, 9. [Google Scholar] [CrossRef]
  57. Næsset, E.; Bjerke, T.; Bvstedal, O.; Ryan, L. Contributions of Differential GPS and GLONASS Observations to Point Accuracy under Forest Canopies. Photogramm. Eng. Remote Sens. 2000, 66, 403–407. [Google Scholar]
  58. Holopainen, M.; Vastaranta, M.; Hyyppä, J. Outlook for the Next Generation’s Precision Forestry in Finland. Forests 2014, 5, 1682–1694. [Google Scholar] [CrossRef]
  59. Knott, J.A.; Liknes, G.C.; Giebink, C.L.; Oh, S.; Domke, G.M.; McRoberts, R.E.; Quirino, V.F.; Walters, B.F. Effects of Outliers on Remote Sensing-assisted Forest Biomass Estimation: A Case Study from the United States National Forest Inventory. Methods Ecol. Evol. 2023, 14, 1587–1602. [Google Scholar] [CrossRef]
  60. Krok, G.; Kraszewski, B.; Stereńczak, K. Zastosowanie Naziemnego Skanowania Laserowego w Inwentaryzacji Lasu—Przegląd Wybranych Zagadnień (Application of Terrestrial Laser Scanning in Forest Inventory—An Overview of Selected Issues). Leśne Pr. Badaw. 2020, 81, 175–194. [Google Scholar] [CrossRef]
  61. Abdi, O.; Uusitalo, J.; Pietarinen, J.; Lajunen, A. Evaluation of Forest Features Determining GNSS Positioning Accuracy of a Novel Low-Cost, Mobile RTK System Using LiDAR and TreeNet. Remote Sens. 2022, 14, 2856. [Google Scholar] [CrossRef]
  62. Ogundipe, O.; Ince, S.; Bonenberg, L. GNSS Positioning Under Forest Canopy. In Proceedings of the ENC-GNSS 2014, Rotterdam, The Netherlands, 15–17 April 2014. [Google Scholar]
  63. Lee, T.; Bettinger, P.; Merry, K.; Cieszewski, C. The Effects of Nearby Trees on the Positional Accuracy of GNSS Receivers in a Forest Environment. PLoS ONE 2023, 18, e0283090. [Google Scholar] [CrossRef] [PubMed]
  64. Davison, S.; Donoghue, D.N.M.; Galiatsatos, N. The Effect of Leaf-on and Leaf-off Forest Canopy Conditions on LiDAR Derived Estimations of Forest Structural Diversity. Int. J. Appl. Earth Obs. Geoinf. 2020, 92, 102160. [Google Scholar] [CrossRef]
  65. White, J.C.; Arnett, J.T.T.R.; Wulder, M.A.; Tompalski, P.; Coops, N.C. Evaluating the Impact of Leaf-on and Leaf-off Airborne Laser Scanning Data on the Estimation of Forest Inventory Attributes with the Area-Based Approach. Can. J. For. Res. 2015, 45, 1498–1513. [Google Scholar] [CrossRef]
Table 2. Descriptive statistics for stratification factors.
Table 2. Descriptive statistics for stratification factors.
StatisticsFACTORS—Continuous Data TypesStratification Groups
Slope (1)TRI (2)TD (3)THD (4)CHD (5)Land TypeSpecies Groups
minimal10.02200.061.1Lowlands:
496 plots (49.8%)
 
Mountains:
500 plots (50.2%)
Coniferous plots:
460 (46.2%)
 
Mixed species groups:
304 (30.5%)
 
Deciduous plots:
232 (23.3%)
maximal30.60.45354015.015.4
mean9.00.136761.266.8
SD6.10.093810.752.4
CV68%69%56%59%35%
(1) In degrees, computed according to Horn algorithm [26]. (2) TRI—Terrain Rugged Index: mean of the absolute differences between the elevation value of 1 m DTM cell and its 8 surrounding cells. Averaged at plot level [22]. (3) TD—Trees density (trees/ha). (4) THD—Trees Horizontal Distribution—index according to Hopkins–Skellam Test [27] using 500 Monte Carlo simulations. 0 ≈ clustered, 1 ≈ random, 1 >≈ regular. (5) CHD—Canopy Height Diversity: standard deviation of trees height on sample plots (m).
Table 3. ALS metrics: ten most important predictors **.
Table 3. ALS metrics: ten most important predictors **.
VariableRF Importance [%IncMSE] *Definition
log_zq70.23545Natural logarithm of the 70th height quantile from the last echoes
zq70.23299The 70th height quantile from the last echoes
zmean.12494Mean height return from the first echoes
zmean.32322Mean height return from all echoes above 2 m over the ground
log_zmean.32045Natural logarithm of mean height return from all echoes above 2 m over the ground
log_zq70.31742Natural logarithm of the 70th height quantile from all echoes above 2 m over the ground
log_zmean.21607Natural logarithm of mean height return from the last echoes
zq70.31599The 70th height quantile from all echoes above 2 m over the ground
zmean.21498Mean height return from the last echoes
zsd.2877Standard deviation of height returns from last echoes
* Increase in mean squared error after variable removal. ** The full list of ALS metrics used is provided in Appendix A.
Table 4. RMSE (%) gradients in the course of positioning accuracy and plot size change.
Table 4. RMSE (%) gradients in the course of positioning accuracy and plot size change.
Non-Stratified Sample (N = 996) Lowlands (N = 496) Mountains (N = 500)
Plot Size (m2) Plot Size (m2) Plot Size (m2)
Shift [m]100200300400500 100200300400500 100200300400500
04737292623 3726211917 5846373129
14838302624 3827222018 6047373230
25038302725 3928222018 6048373231
35139312725 4129242119 6248383331
45341322726 4431242220 6350393332
55442332826 4631252320 6551403431
65543332927 4633262320 6552413433
75644343027 4733272521 6753423433
85745353028 4835282622 6754423634
95846363128 4936292622 6855433634
105946373129 5036302623 6955443635
Coniferous (N = 460) Deciduous (N = 232) Mixed (N = 304)
Plot Size (m2) Plot Size (m2) Plot Size (m2)
Shift [m]100200300400500 100200300400500 100200300400500
03928221917 5753383228 5136282223
14029222018 6154393332 5237292323
24230232118 6255403332 5338302424
34331252219 6456403433 5540302424
44632262220 6756403334 5640312525
54734282321 6957413534 5840322626
64936282421 7058433534 5843332726
75136302622 7159423535 6043342928
85138312723 7060433635 5943353027
95439322623 7360433735 5945363129
105440332724 7361443736 6145373130
error ranges (%):15–20 21–25 26–30 31–35 36–40 >40
Table 5. RMSE change (pp: percentage points) in relation to estimations based on original plots.
Table 5. RMSE change (pp: percentage points) in relation to estimations based on original plots.
Non-Stratified Sample (N = 996) Lowlands (N = 496) Mountains (N = 500)
Plot Size (m2) Plot Size (m2) Plot Size (m2)
Shift [m]100200300400500 100200300400500 100200300400500
02414620 199420 2917820
12514731 219420 3118931
22615732 2211531 3219932
32716732 2312631 3320942
42917842 2713742 35221043
53118953 2814853 36221153
631201053 2915963 37241354
733201174 30161084 38241364
833211275 31171185 38251475
935221375 32191285 39261575
1035231376 33191396 40261576
Coniferous (N = 460) Deciduous (N = 232) Mixed (N = 304)
Plot Size (m2) Plot Size (m2) Plot Size (m2)
Shift [m]100200300400500 100200300400500 100200300400500
02211520 29251040 2813600
12312531 32261153 3014700
22513631 34271254 3115811
32614842 36281254 3317822
42915952 38281256 3317822
530171164 40281275 35181043
631191174 41301476 36211154
734191395 43301476 37201265
834201396 42311586 37211275
936211596 44311586 36221386
10372315107 45321597 38231487
percentage points: 1 2 3 4 5 >5
Table 6. BIAS (%) gradients in the course of positioning accuracy and plot size change.
Table 6. BIAS (%) gradients in the course of positioning accuracy and plot size change.
Non-Stratified Sample (N = 996) Lowlands (N = 496) Mountains (N = 500)
Plot Size (m2) Plot Size (m2) Plot Size (m2)
Shift [m]100200300400500 100200300400500 100200300400500
046544 20100 57766
146554 41110 68766
257554 31110 68876
368655 41110 89877
478655 62110 710877
579655 62110 7111087
6710755 62111 8111087
7910755 72110 9111088
8911865 82111 9131187
91111855 82211 9131187
101011866 92211 8121088
Coniferous (N = 460) Deciduous (N = 232) Mixed (N = 304)
Plot Size (m2) Plot Size (m2) Plot Size (m2)
Shift [m]100200300400500 100200300400500 100200300400500
022001 60101 10100
132111 81211 21110
243111 81211 22110
353211 92311 22110
463221 93212 22110
575311 103312 33210
675321 124312 32210
7106421 123322 42200
8106522 144432 42210
9139522 174323 41210
10128622 144443 31210
error magnitude (%):0 1 2 3 4 ≥5
Table 7. Correlation between GSV estimation change (from 0 to 10 m shift) and the range of analysed factors.
Table 7. Correlation between GSV estimation change (from 0 to 10 m shift) and the range of analysed factors.
AllPlot Size (m2) ConiferousPlot Size (m2)
Factor100200300400500 Factor100200300400500
CHD0.120.220.260.280.31 CHD0.080.280.270.320.33
Slope0.060.100.080.060.09 Slope0.070.140.190.150.18
TD−0.26−0.33−0.35−0.29−0.32 TD−0.31−0.3−0.36−0.28−0.3
THD−0.16−0.18−0.17−0.15−0.17 THD−0.2−0.28−0.27−0.25−0.25
TRI0.070.100.080.070.10 TRI0.060.150.190.160.18
LowlandsPlot Size (m2) DeciduousPlot Size (m2)
Factor100200300400500 Factor100200300400500
CHD0.230.390.380.420.41 CHD0.240.180.170.330.31
Slope−0.03−0.010.00−0.03−0.03 Slope0.250.260.270.220.25
TD−0.32−0.37−0.38−0.4−0.4 TD−0.24−0.4−0.37−0.39−0.39
THD−0.21−0.25−0.27−0.26−0.28 THD0.06−0.02−0.02−0.040.03
TRI−0.02−0.010.00−0.03−0.03 TRI0.250.260.270.220.25
MountainsPlot Size (m2) MixPlot Size (m2)
Factor100200300400500 Factor100200300400500
CHD0.140.160.210.210.23 CHD0.070.210.380.260.35
Slope0.120.110.060.080.06 Slope0.280.190.050.090.10
TD−0.19−0.3−0.28−0.22−0.26 TD−0.25−0.3−0.37−0.37−0.43
THD−0.11−0.1−0.07−0.05−0.06 THD−0.21−0.23−0.25−0.18−0.22
TRI0.110.120.060.080.07 TRI0.270.180.060.090.10
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lisańczuk, M.; Mitelsztedt, K.; Stereńczak, K. The Influence of the Spatial Co-Registration Error on the Estimation of Growing Stock Volume Based on Airborne Laser Scanning Metrics. Remote Sens. 2024, 16, 4709. https://doi.org/10.3390/rs16244709

AMA Style

Lisańczuk M, Mitelsztedt K, Stereńczak K. The Influence of the Spatial Co-Registration Error on the Estimation of Growing Stock Volume Based on Airborne Laser Scanning Metrics. Remote Sensing. 2024; 16(24):4709. https://doi.org/10.3390/rs16244709

Chicago/Turabian Style

Lisańczuk, Marek, Krzysztof Mitelsztedt, and Krzysztof Stereńczak. 2024. "The Influence of the Spatial Co-Registration Error on the Estimation of Growing Stock Volume Based on Airborne Laser Scanning Metrics" Remote Sensing 16, no. 24: 4709. https://doi.org/10.3390/rs16244709

APA Style

Lisańczuk, M., Mitelsztedt, K., & Stereńczak, K. (2024). The Influence of the Spatial Co-Registration Error on the Estimation of Growing Stock Volume Based on Airborne Laser Scanning Metrics. Remote Sensing, 16(24), 4709. https://doi.org/10.3390/rs16244709

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop