Next Article in Journal
The Impact of Farmland Transfer on Urban–Rural Integration: Causal Inference Based on Double Machine Learning
Previous Article in Journal
Facilitating or Hindering? The Impact of Low-Carbon Pilot Policies on Socio-Ecological Resilience in Resource-Based Cities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Source Apportionment and Analysis of Potentially Toxic Element Sources in Agricultural Soils Based on the Positive Matrix Factorization and Geo-Detector Models

1
State Key Laboratory of Environmental Criteria and Risk Assessment, Chinese Research Academy of Environmental Sciences, Beijing 100012, China
2
Technical Centre for Soil, Agriculture and Rural Ecology and Environment, Ministry of Ecology and Environment, Beijing 100012, China
3
School of Earth Sciences and Resources, China University of Geosciences, Beijing 100083, China
4
Guangxi Institute of Geological Survey, Nanning 530023, China
5
School of Sciences, China University of Geosciences, Beijing 100083, China
6
Institute of Karst Geology, Chinese Academy of Geological Sciences, Guilin 541004, China
*
Authors to whom correspondence should be addressed.
Submission received: 14 December 2024 / Revised: 8 January 2025 / Accepted: 11 January 2025 / Published: 13 January 2025
(This article belongs to the Section Land, Soil and Water)

Abstract

:
The potentially toxic element pollution of agricultural soils has become a significant environmental threat to food safety and human health. Accurately identifying sources of potentially toxic element pollution is key to developing effective pollution prevention and control measures. In this study, regional potentially toxic element pollution of the soils in the Nanliujiang River Basin was analyzed using the positive matrix factorization (PMF) model and the geo-detector model. First, topsoil samples from the study area were collected to analyze eight potentially toxic elements in the soil, including As, Cd, Cr, Cu, Hg, Ni, Pb, and Zn. The PMF model was used to conduct source apportionment of the potentially toxic element data and identify the primary pollution sources and their contribution rates. Then, the geo-detector model was used to analyze the key factors affecting the spatial distribution of the potentially toxic elements and the influence of natural and human factors on the distribution of the potentially toxic elements. There are four potentially toxic element pollution sources of the agricultural soil in the study area: geological background, agricultural activities, industrial discharge, and river irrigation. The geological background contributed the most. The main factors affecting the spatial distribution of potentially toxic elements included agricultural activities, industrial discharge, and river irrigation. This integrated method can analyze the formation of potentially toxic element pollution in depth from the perspectives of source apportionment and spatial differentiation and provide a scientific basis and decision support for preventing and controlling potentially toxic element pollution in agricultural soils. This study provides a new method and scientific basis for identifying and preventing potentially toxic element pollution sources in agricultural soil and can guide the formulation of targeted soil pollution control measures.

1. Introduction

Potentially toxic elements (PTEs) in agricultural soil, particularly cadmium (Cd) and lead (Pb), pose significant threats to human health and the ecological environment due to their resistance to degradation and propensity for bioaccumulation in living organisms [1,2]. Not only do these potentially toxic elements degrade soil quality and reduce crop yield and quality [3], but they also enter the human body through the food chain, causing various diseases, including nervous system damage, kidney diseases, and cardiovascular diseases [1,4,5]. The pollution of soil by potentially toxic elements can lead to secondary contamination of water and the atmosphere, thereby exacerbating the ecological crisis [6,7]. Therefore, in-depth research on the source distribution of potentially toxic elements in soil is needed to ensure food safety and maintain ecological balance.
It is crucial to accurately identify sources of potentially toxic element pollution to develop effective prevention and control measures. Targeted measures can be taken to reduce or eliminate pollution sources once the source of potentially toxic elements has been traced. This way, the spread of potentially toxic element pollution can be effectively curbed. Potentially toxic element traceability in soil is challenging, and a series of different source apportionment methods have been developed. The main methods include the isotope tracer and multivariate statistical methods [8,9,10]. The isotope tracer method can be used to infer the origin and migration path of potentially toxic elements by measuring their composition and isotope proportions [9]. The multivariate statistical analysis method analyzes the correlation between the potentially toxic elements in the soil and their relationship with various environmental factors to identify possible pollution sources [8,10]. Although soil potentially toxic element source analysis methods can reveal the source and distribution of potentially toxic elements, these methods still have some limitations [8,9,10]. Although isotope tracers are very accurate, the analyses are complex and expensive, complicating their general application in large-scale studies. Sample quantity and weak interpretability result in inaccurate multivariate statistical analysis results.
The positive matrix factorization (PMF) and geo-detector models hold significant advantages for potentially toxic element tracing in soil. The PMF model is an effective factor analysis model that uses the content data of potentially toxic elements in the soil to determine the main pollution sources and their contribution rates through weight calculation and the least squares method. This method can manage complex multi-source pollution problems and is highly flexible and adaptable [11,12]. The geo-detector model uses geospatial information to detect and analyze the key factors affecting the spatial distribution of potentially toxic elements and to explore the influence of natural and human factors on the distribution of potentially toxic elements. This method intuitively reveals the spatial differentiation characteristics of potentially toxic element distribution and its driving mechanisms by calculating the contribution of the independent variables to the dependent variables [13,14,15].
The Nanliujiang River, located in the southeastern part of the Guangxi Zhuang Autonomous Region, is the largest river in Guangxi that empties into the sea. The region’s abundant water resources, the flat river basin, and the alluvial plain provide a convenient basis for regional industrial and agricultural development. However, potentially toxic element pollution associated with industrial development occurs in some areas of the Nanliujiang River Basin [16].
Due to the severity of potentially toxic element pollution in the Nanliujiang River Basin, this study aimed to determine the sources of pollution from potentially toxic elements in the soil by using the positive matrix factorization (PMF) and the geo-detector models. This study attempts to conduct an in-depth analysis of potentially toxic element pollution from source apportionment and spatial differentiation perspectives. This will provide a scientific basis and decision support for preventing and controlling potentially toxic element pollution in agricultural soils. This study will also provide a new method and scientific basis for identifying soil potentially toxic element pollution sources and preventing soil contamination and can guide authorities in the formulation of targeted soil pollution control measures.

2. Materials and Methods

2.1. Overview of the Study Area

The Nanliujiang River originates from the southern foot of the Darong Mountain at Beiliu City, Guangxi, and flows from north to south. It has a total length of 287 km and a basin area of 9320 km2 (Figure 1). The whole basin is located south of the Tropic of Cancer and is part of the typical subtropical monsoon climate and tropical marine climate, with an average temperature of 21.5–22.4 °C and an average annual rainfall of 1400–1760 mm [17].
The geological strata of the basin are exposed. Apart from the Quaternary system developed along the river system, the Permian and Triassic granites are widely exposed, covering about a third of the region’s total area. Second, the Neogene, Cretaceous, and Silurian clastic rocks are widely distributed from the east side of Yulin City to the middle reaches of Shahe Town and then to the lower reaches of Wujia Town. The area of exposed carbonate rock is small and mainly located in the northwestern part of Xingye County and the southern part of Beiliu City [18,19].
Figure 2 shows the land use type in the Nanliujiang River Basin. The total forest area is 4748 km2, accounting for 50.79%; the cultivated land area is 3005 km2 (32.15%); the area of the orchards and other gardens is 969 km2 (10.36%); the area of the rivers and lakes is 316 km2 (3.38%); and the area of the cities, towns, and villages is 297 km2 (3.18%). A small proportion of coastal beaches and other utilization types cover the rest of the land uses in the basin. Chemical, metallurgical, and other industrial factories that cause metal pollution and put pressure on the agricultural soil environment occur throughout the region.

2.2. Sampling and Analyses

The Guangxi Geological Survey Institute has carried out multi-objective geochemical surveys over 22,059 km2 of the Beibu Gulf region of Guangxi and collected 7327 topsoil samples [18,19]. This paper’s study area covers 2257 topsoil samples (0–20 cm) in the southern part of the Beibu Gulf Region survey area.
A grid system was used to determine the sampling localities and ensure the spatial uniformity of the sampling points. The usual sampling depth of topsoil samples is 0–20 cm, with a basic sampling density of 1 point/km2, and that of deep soil samples is usually 150–180 cm, with a basic sampling density of 1 point per 4 km2. The surface plant residues and other debris were first removed before collecting the topsoil (0–20 cm). To ensure uniformity, the soil was sampled vertically from the surface to a depth of 20 cm. To avoid surface pollution when collecting deep soils, a 10–50 cm long soil column was used to collect soil samples below the specified starting depth continuously. Each soil sample’s original weight had to be at least 1 kg.
The specific process for sampling and analysis is as follows:
(1) A 5.0 g powder sample was weighed and pressed under 35 tons of pressure. Eleven elements, including Cr, Pb, and Zn, were determined via X-ray fluorescence spectrometry (XRF).
(2) A 0.2000 g sample was weighed and then processed with HC1, HNO3, HF, and HClO4. Thereafter, the processed sample was transferred to a 25 mL colorimetric tube for constant volume measurement. An Inductively Coupled Plasma Optical Emission Spectrometer (ICP-OES, iCAP 7400 Radial, Thermo, Waltham, MA, USA) was used to determine 3 elements, including Cu and Ni. The solution indicated above was diluted 10 times, and the Cd content was determined using inductively coupled plasma mass spectrometry (ICP-MS, iCAP Q, Thermo, USA).
(3) A 0.5000 g sample was weighed and decomposed with aqua regia. The solution was then reduced using potassium borohydride. The Hg content was determined using the atomic fluorescence method (AFS). The solution was pre-reduced by reducing the masking agent (thiourea-ascorbic acid) and then reducing this with potassium borohydride. The arsenic content was determined using the atomic fluorescence method (AFS).
The samples were then analyzed according to the standard requirements of DZ/T0258-2014 [20], the national first-class soil geochemical reference materials GBW07401-GBW07408 and GBW07423-GBW07430 (GSS1-GSS12), the effective state reference material GBWO7412-GBW07417, and trace gold reference substances GBW07228, GBW07229, and GBW07243-GBW07248 and other series of samples for analysis data quality monitoring. The logarithmic deviation (∆lgC) or the average relative error (RE%) between the measured mean and the standard value of each measured element was used to test the accuracy of the method. To evaluate the precision of the analytical method, the relative standard deviation (RSD%) between the mean value and the standard value was calculated.

2.3. PMF Model Construction

The positive matrix factorization (PMF) model proposed by Paatero and Tapper and endorsed by the EPA is a robust factor analysis tool [11,21]. It decomposes data into source contribution and composition matrices, aiding in the identification of primary pollution sources and their respective contribution rates. The PMF model rationale assumes that a soil sample X can be represented as an n × m matrix, where “n” represents the number of samples and “m” represents the chemical composition [11,21]. The matrix X can be further decomposed into a pollution source contribution matrix and a pollution source composition spectral matrix. The method calculates the error of each chemical component in the pollutant by weight and then uses the least squares method to determine the main pollution source of the pollutant and its contribution rate. The least squares method is a multivariate factor analysis tool.
The basic principle of the PMF model is to factorize the original matrix X (n × m) and decompose it into a factor contribution matrix G (n × p), a factor component matrix F (p × m), and a residual matrix E (n × m) as follows (Equation (1)):
E n m = X n m k = 1 p G n p F p m
where Xnm represents m chemical components in n samples; p is the number of sources apportioned; Gnp is the source contribution matrix; and Fpm is the source contribution matrix. The elements in the matrix Gnp and Fpm are all positive values, meaning they are non-negative constraints, and all parameters in the calculation process outlined above are dimensionless. The PMF defines an objective function Q and minimizes the value of this objective function as follows (Equation (2)):
Q   E = i = 1 m j = 1 n E i j / σ i j 2
where Eij represents the residual of the ith chemical component in the jth sample, and σij represents the uncertainty of the ith chemical component in the jth sample. All parameters in the calculation outlined above are dimensionless.
When the concentration of the chemical component is less than or equal to the corresponding method detection limit (MDL), the uncertainty ( U ) is calculated according to Equation (3) as follows:
U = 5 / 6 × MDL
When the concentration of the chemical component is greater than the corresponding MDL, U is calculated according to Equation (4) as follows:
U = s × c 2 + 0.5 × MDL 2
where U is the uncertainty; s is the percentage of error; c is the measured potentially toxic element content, with a unit of mg·kg−1; and the MDL is the method detection limit.

2.4. Geographical Detector

A geographical detector (geo-detector) is a method developed by Wang et al. to explore the influence of geographical spatial zoning factors on disease risk [22]. The factor detector, a sub-module, was used in this study. The factor detector measures the contribution of independent variables to dependent variables by calculating the ratio of the sum of the variance of each independent variable and that of the dependent variables after classification. It is used to detect the spatial differentiation of dependent variables and the ability of each independent variable to explain the degree of influence of the dependent variables, which is measured using the q value as follows:
S S W = h = 1 l N h σ h 2                                 h = 1 , 2 , 3 , , l
S S T = σ h 2
q = 1 S S W S S T
where h = 1, 2, …, l represents the number of classifications of the independent variable X; Nh represents the number of units in the partition h; and σ h 2 represents the variance of the variable Y in partition h. The SSW and SST represent the sum of the variances for all classes of the independent variable X and the total variance within the region, respectively. The range of q is [0,1]. The greater the value of q, the greater the influence of the independent variable X on the dependent variable Y.

2.5. Analytical Statistical Method

Considering the overall situation of the study area and based on the overall analysis of the literature, 4 potential factors that may affect the distribution of soil PTE contents were considered. Of these, the natural factors include the distance from the riverbank and the parent material of the soil, and human factors include the number of factories and land use types. Microsoft Excel 2016 (Microsoft Corporation, Redmond, WA, USA) and SPSS22.0 (International Business Machines Corporation, Amonk, NY, USA) were used to perform basic statistical analyses; ArcGIS10.8 (Environmental Systems Research Institute Inc., RedLands, CA, USA) was used to analyze the spatial distribution characteristics and topographic parameters of the potentially toxic elements. The other graphics were plotted in Origin (2022) (OriginLab, Northampton, MA, USA) and CorelDRAW 2019 (Corel Corporation, Ottawa, ON, Canada).

3. Results and Discussion

3.1. Content Characteristics of Cd and Other Elements in Topsoil

Table 1 shows the descriptive statistics of the eight potentially toxic elements in the topsoil of Nanliujiang. The statistics include the minimum value, maximum value, mean value, median, quantile (25th and 75th), standard deviation, coefficient of variation, and MDL. The median values of the eight elements were 9.17, 0.13, 43.00, 14.40, 0.10, 11.20, 31.20, and 45.50 mg/kg, respectively. The large gap between the minimum, 75th quantile, and maximum indicates the presence of outliers (i.e., maximum values) in the dataset of the study area, resulting in a positively skewed distribution of all elements (Figure 3). The coefficients of variation of As, Cd, and Hg were relatively high (>100%), indicating that human factors interfered with their concentrations. The moderate spatial variability of Cr, Cu, Ni, Pb, and Zn indicates that these elements came from similar sources and are relatively less disturbed by human activities.
The Spearman rank correlation coefficient was used to quantitatively measure the strength of the relationship between variable pairs of eight potentially toxic elements (Figure 4). Histograms and normal curves indicating the normality of each chemical substance showed that not all the variables followed the standard normal distribution (K-S test < 0.05), with a long right-skewed distribution. Therefore, the Spearman rank correlation coefficient based on non-parametric statistics was selected for correlation analysis.
Generally, a significant strong correlation suggests that these variables have similar chemical properties. This means that they may be controlled by a common source. In addition to the strong positive correlation among Cr, Cu, Pb, and Zn, these PTEs belong to transition metals with similar chemical properties, mainly derived from natural sources such as bedrock and parent materials. Cd and As, however, correlated poorly with the other elements. Most studies have recorded significant agricultural disturbance effects on As and Cd [12], and the distinction between natural factors and human factors on these two elements must be further analyzed in this study.

3.2. Spatial Distribution of Components

The spatial distribution of eight potentially toxic elements in the topsoil of Nanliujiang is shown in Figure 5. The distribution of As and Cd differed from the other elements. The As and Cd concentrations increased near Yulin City (Figure 5a,b). The influence of natural factors and agricultural activities was observed in the regional distribution of manganese rocks and dry cultivated land in the Liujiang Formation. The Cr and Hg concentrations were similar in the Quaternary sediments, indicating that human factors had a significant influence on the topsoil in Nanliujiang.
Furthermore, the concentrations of Cr, copper (Cu), nickel (Ni), lead (Pb), and zinc (Zn) are higher in the north-central region and lower in the southeast. This geospatial division aligns with the boundary of the granites, as shown in Figure 5. The natural mineralization of these granites contributes to the enrichment of Ni, Pb, and Zn in the topsoil of Nanliujiang.

3.3. Source Apportionment of Potentially Toxic Elements in the PMF Model

The PMF model was used to further identify and distinguish the potentially toxic element sources in the topsoil of Nanliujiang. Figure 6 provides an overview of the basic factors, including the contribution percentages and species concentrations. Although the PMF model has advantages that other receptor models cannot match, the determination of the optimal number of factors is still challenging. By adjusting the number of input factors from three to six in this study, we found that when the number of factors was four, the Q value was at a minimum and stable, providing the optimal solution (Figure 6).
The contribution rate of As in Factor 1 was far higher than the other potentially toxic elements, reaching as high as 81.59%. This is because As exists in different chemical forms, such as As3+ and As5+, and because its solubility is much higher than other potentially toxic elements. Human factors, such as agricultural chemical inputs, often control the As concentration. Previous studies have indicated that long-term cultivation methods, which include the application of chemical fertilizers and pesticides, can cause the accumulation of potentially toxic elements, such as As, in soil [23,24]. These relatively high As loads and the high concentrations in cultivated land further validate the results of the PMF model [23,24]. The coefficient of variation of As (101.7%) was also higher than that of the other seven potentially toxic elements, indicating that it was more significantly affected by human activities. The high contribution area of Factor 1 was mainly concentrated in the central and eastern parts of the study area (Figure 7a), and the contribution scores gradually decreased along other directions, similar to the distribution of river channel density. Factor 1, therefore, might represent irrigation water sources characterized by river channel distribution or other agricultural inputs.
Factor 2 was dominated by Cr, Cu, and Ni, which contributed 75.56%, 65.10%, and 66.91%, respectively. In Table 1, the coefficient of variation of Cr was the lowest of the eight potentially toxic elements (53.59%), indicating that the soil Cr content in the study area was stable and less affected by human activities. Cr and Ni are mainly derived from soil parent materials [25,26,27], and human activities in the environment usually do not affect their distribution [25,26]. However, the high contribution region of Factor 2 was widely distributed in the northeast and central regions, like industrial factories’ distribution. Electroplating factories were found in these areas during the site survey. It was tentatively inferred that Factor 2, therefore, may represent a mixture of natural and industrial sources.
Pb and Zn dominated Factor 3, with contribution rates of 81.19% and 67.80%, respectively. Motor vehicle emissions are a major source of Pb in agricultural soils. Vehicle exhaust accounts for about two-thirds of global Pb emissions [28,29,30], while high Zn concentrations are associated with metal manufacturing, mineral aggregates, and mining activities [28,29,31]. However, the high contribution score area of Factor 3 was mainly located in the central and northern parts of the study area (Figure 7c). The contribution decreased in the surrounding areas and coincided with the medium-acid magmatic rocks. The coefficients of variation of Pb and Zn were both low in these areas, indicating that human pollution was relatively limited. Therefore, Factor 3 probably represents a mixture of natural sources and exhaust emissions, while the potential mineralization in the granite area further promotes the further enrichment of Pb and Zn.
Factor 4 was dominated by Cd and Hg, with contribution rates of 71.97% and 72.83%, respectively. The coefficients of variation of both Cd and Hg varied widely, indicating that they were influenced by human factors or other natural sources. Agricultural activities, such as the use of phosphate fertilizers, pesticides, organic manure, and sewage irrigation, often introduce large amounts of Cd into the environment [32,33]. Crop production in China is often accompanied by the large-scale use of fertilizers and pesticides, causing the accumulation of potentially toxic elements in the soil and related environmental problems [32,33,34]. Since Cd had the highest coefficient of variation in this farmland area, agricultural activities are probably the primary source of Cd. Factor 4 might, therefore, be closely related to agricultural activities.

3.4. Source Identification of Geographical Detectors

The spatial distribution of potentially toxic elements in soil in nature results from the joint action of multiple factors [34]. The interaction detector model in the geo-detector was used to conduct interaction analysis for the main drivers of potentially toxic elements in soil (α < 0.05) to further explore how these influencing factors work together (Figure 2 and Figure 8).
The interaction between the parent material and land use type contributed significantly to Factor 1, and its effect was more than the sum of the two effects alone. Specifically, the interaction between the parent material and the land use type explains 7.4% of the spatial distribution of potentially toxic element concentration in the study (i.e., q(SP∩LU) = 0.074). This is higher than the sum of the two factors alone (q(SP + LU) = 0.061). The combined effect of each pair of driving factors was greater than that of a single factor, which is a two-factor enhancement effect. Also, the interaction between other driving factors showed nonlinear enhancement. The distance from the riverbank was the strongest driver explaining the spatial distribution of Factor 1, and its contribution to the interaction with other major drivers was significantly enhanced, far exceeding the interaction between any two drivers.
Observations for Factor 2 were similar to those of Factor 1, in which the contribution of the pair interaction was also greater than the sum of the individual contributions of the two influencing factors. The combined effect of each pair of drivers was greater than any single factor. The distance from the factory was the strongest driver explaining the spatial distribution, and its interaction contribution with other major drivers was also significantly enhanced.
The interaction between the parent material and other driving factors for Factor 3 also showed a two-factor enhancement effect. The parent material was the driver, which best explained the spatial distribution of Factor 3.
For Factor 4, the influence of the interaction between land use and the distance from the factory was less than the sum of the two contributions alone. Land use became the most powerful driver influencing the spatial distribution of Factor 4 and showed nonlinear enhancement.
Combined with the results of the geo-detector model, the main factors affecting the spatial distribution of potentially toxic elements in the study area are as follows: parent material (q = 0.327) > land use type (q = 0.010) > distance from the factory (q = 0.008) > distance from the riverbank (q = 0.007). Of these, the parent material explained the spatial distribution of the potentially toxic elements the best and reflected the significant influence of human activities on the accumulation of the potentially toxic elements. These results not only reveal the complex mechanisms affecting the spatial distribution of the potentially toxic elements but also emphasize the key role of multi-factor synthesis in the distribution of potentially toxic elements.

3.5. Summary of Source Apportionment

The spatial analysis data of the auxiliary dataset (categorical and numerical variables) and the comprehensive model proposed in this study (which combines the PMF with the geo-detector model) provide a deep understanding of the multi-source allocation [12,13].
F1, as obtained using the PMF model, showed a wide range of high values in the central and eastern regions (Figure 5). The spatial pattern of F1 showed high spatial consistency with the distribution of the riverbank when combined with the F1 distribution obtained using inverse distance weights. The distance from the riverbank had the highest and most significant spatial correlation with F1 in the geo-detector model, and the interaction between the distance from the riverbank and other influencing factors showed nonlinear enhancement. Therefore, in summary, F1 can be defined as a source of irrigation water. Urban sewage and agricultural activities are the main sources of As pollution in irrigation systems. As-containing substances enter the irrigation system through runoff and infiltration and then enter the farmland via the irrigation water.
The PMF model showed a high value in the northeast suburbs with strong industrial development and urban expansion for F2. From the spatial distribution, the high value of F2 mainly occurs in the suburbs. The spatial correlation between industrial production and F2 was the highest and most significant in the geo-detector model. In terms of interaction, the q value of the combination of the distance from the riverbank and the distance from the factory increased the most. This further confirms that F2 mainly resulted from industrial activities, and the discharge of wastewater and gas caused the increase in the soil’s potentially toxic elements.
The PMF-derived F3 was high in the south-central region. The parent material had the highest and most significant spatial correlation with F3 in the geo-detector model. Regarding interaction, the PD value of the combined effect of the soil parent material and the distance from the factory was the highest. Therefore, the F3 is related to the parent material. In particular, the granitic area controls the occurrence of Pb.
The F4 obtained using the PMF model showed high values in the midwest and the northeast. The spatial distribution shows that the high F4 value was mainly limited to the farming area. The spatial correlation between cultivated land distribution and F4 was the highest and most significant in the geo-detector model. Regarding interaction, the q value of the combined effect of land use type superimposed by distance from the riverbank increased the most. This further confirms that the main source of F4 is agricultural activities. Agricultural inputs, such as phosphate fertilizer and livestock manure, increase the potentially toxic element content of the soil.
In terms of the contribution of various pollution sources to potentially toxic elements, 59.4% of the potentially toxic elements in the soil can be attributed to human sources, and 40.6% can be attributed to natural sources. Of the human sources, agricultural pollution accounted for 22.9% of the total pollution, and industrial pollution accounted for 36.5%. The main source of Pb and Zn is the soil parent material, and the source of Cd and Hg is agricultural activities such as fertilization. Arsenic mainly comes from irrigation water, while Cr, Cu, and Ni are related to electroplating factories.

3.6. Comparison and Verification of the Results of the Two Models

The PMF model and the geo-detector model agreed in several respects. Both models identified agricultural activities and industrial emissions as the main sources of potentially toxic element pollution in the study area. This is consistent with the characteristics of regional industrial structure. The key factors affecting metal pollution, as analyzed using the geo-detector model, corresponded to the types of pollution sources analyzed using the PMF model. This verifies the reliability of the results. The PMF model can also quantify the contribution of each pollution source, while the geo-detector model reveals the spatial heterogeneity and interaction of the influencing factors. The combined use of the two models allows for the comprehensive analysis of the distribution of potentially toxic element pollution from two-dimensional source apportionment and spatial differentiation, further confirming the effectiveness of soil potentially toxic element traceability research methods [14].

4. Conclusions

This paper combined the positive matrix factorization (PMF) model and the geo-detector model to analyze the regional soil potentially toxic element pollution sources in the Nanliujiang River Basin. The main sources of the potentially toxic element pollution of the agricultural soil in the Nanliujiang River Basin mainly included four categories: geological background, agricultural activities, industrial discharge, and river irrigation. The geological background contributed the most, while agricultural activities, industrial discharge, and river irrigation mainly affected the spatial distribution of the potentially toxic elements. The various pollution sources and their contribution rates were successfully identified through the PMF model, and the key factors affecting the spatial distribution of the potentially toxic elements were further analyzed using the geo-detector model.
Regarding the pollution source apportionment, the geological background was the main source of potentially toxic element pollution, which reflects the abundant exposure in the region’s interior and the critical influence of soil parent material on the potentially toxic element content. Human factors, such as agricultural activities, industrial discharge, and river irrigation, have also contributed significantly to potentially toxic element pollution. The parent material, land use type, distance from the factory, and distance from the riverbank jointly affected the spatial distribution of potentially toxic elements, of which the parent material had the largest influence.
By integrating the PMF and geo-detector models, this study provides a comprehensive analysis of soil PTE pollution sources in the Nanliujiang River Basin. The results highlight the significant contribution of geological background and human activities to PTE pollution, offering valuable insights for targeted soil pollution control measures.

Author Contributions

X.L. contributed to conceptualization, methodology, writing—original draft, writing—review and editing, and visualization. Z.Y. contributed to conceptualization, writing—review and editing, and supervision. B.L. contributed to visualization. Z.W. contributed to the methodology. L.W. contributed to writing—review and editing. T.Y. contributed to methodology. C.L. contributed to visualization. Z.H. contributed to writing—review and editing. M.X. contributed to data curation. C.D. contributed to visualization. H.S. contributed to conceptualization, writing—review and editing, and supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the National Key R&D Program of China (Granted No. 2022YFC3704805), the National Natural Science Foundation of China (42330703, and 42377259), and the Assessment of the Effectiveness of Source Control of Heavy Metal Pollution in Farmland of Typical Historical Coal Mine Areas in Pingxiang.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; the collection, analyses, or interpretation of data; the writing of the manuscript; or the decision to publish the results.

References

  1. Haque, E.; Thorne, P.S.; Nghiem, A.A.; Yip, C.S.; Bostick, B.C. Lead (Pb) concentrations and speciation in residential soils from an urban community impacted by multiple legacy sources. J. Hazard. Mater. 2021, 416, 125886. [Google Scholar] [CrossRef] [PubMed]
  2. Aoshima, K. Itai-itai disease: Renal tubular osteomalacia induced by environmental exposure to cadmium-historical review and perspectives. Soil. Sci. Plant Nutr. 2016, 62, 319–326. [Google Scholar] [CrossRef]
  3. Qin, S.; Liu, H.; Nie, Z.; Rengel, Z.; Gao, W.; Li, C.; Zhao, P. Toxicity of cadmium and its competition with mineral nutrients for uptake by plants: A review. Pedosphere 2020, 30, 168–180. [Google Scholar] [CrossRef]
  4. Åkesson, A.; Barregard, L.; Bergdahl, I.A.; Nordberg, G.F.; Nordberg, M.; Skerfving, S. Non-Renal Effects and the Risk Assessment of Environmental Cadmium Exposure. Environ. Health Perspect. 2014, 122, 431–438. [Google Scholar] [CrossRef]
  5. Angon, P.B.; Islam, M.S.; KC, S.; Das, A.; Anjum, N.; Poudel, A.; Suchi, S.A. Sources, effects and present perspectives of heavy metals contamination: Soil, plants and human food chain. Heliyon 2024, 10, e28357. [Google Scholar] [CrossRef] [PubMed]
  6. Wang, X.; Han, Z.; Wang, W.; Zhang, B.; Wu, H.; Nie, L.; Zhou, J.; Chi, Q.; Xu, S.; Liu, H.; et al. Continental-scale geochemical survey of lead (Pb) in mainland China’s pedosphere: Concentration, spatial distribution and influences. Appl. Geochem. 2019, 100, 55–63. [Google Scholar] [CrossRef]
  7. Shi, P.; Zhang, Y.; Li, Z.; Li, P.; Xu, G. Influence of land use and land cover patterns on seasonal water quality at multi-spatial scales. Catena 2017, 151, 182–190. [Google Scholar] [CrossRef]
  8. Liu, Z.; Wang, L.; Yan, M.; Ma, B.; Cao, R. Source apportionment of soil heavy metals based on multivariate statistical analysis and the PMF model: A case study of the Nanyang Basin, China. Environ. Technol. Innov. 2024, 33, 103537. [Google Scholar] [CrossRef]
  9. Yao, C.; Yang, Y.; Li, C.; Shen, Z.; Li, J.; Mei, N.; Luo, C.; Wang, Y.; Zhang, C.; Wang, D. Heavy metal pollution in agricultural soils from surrounding industries with low emissions: Assessing contamination levels and sources. Sci. Total Environ. 2024, 917, 170610. [Google Scholar] [CrossRef]
  10. Kumar Chakraborty, T.; Zaima Mobaswara, M.; Simoon Nice, M.; Rashedul Islam, K.; Nahar Netema, B.; Sozibur Rahman, M.; Habib, A.; Zaman, S.; Chandra Ghosh, G.; Tul-Coubra, K.; et al. Application of machine learning and multivariate approaches for source apportionment and risks of hazardous elements in the cropland soils near industrial areas in Bangladesh. Ecol. Indic. 2023, 154, 110856. [Google Scholar] [CrossRef]
  11. Xu, H.; Wang, H.; Singh, B.P.; Croot, P.; Zhang, C. Identification of possible sources for potentially toxic elements and polycyclic aromatic hydrocarbons and their spatially varying relationships in urban soils of Dublin, Ireland. Environ. Pollut. 2023, 333, 122034. [Google Scholar] [CrossRef] [PubMed]
  12. Fei, X.; Lou, Z.; Xiao, R.; Ren, Z.; Lv, X. Contamination assessment and source apportionment of heavy metals in agricultural soil through the synthesis of PMF and GeoDetector models. Sci. Total Environ. 2020, 747, 141293. [Google Scholar] [CrossRef] [PubMed]
  13. Shi, H.; Wang, P.; Zheng, J.; Deng, Y.; Zhuang, C.; Huang, F.; Xiao, R. A comprehensive framework for identifying contributing factors of soil trace metal pollution using Geodetector and spatial bivariate analysis. Sci. Total Environ. 2023, 857, 159636. [Google Scholar] [CrossRef] [PubMed]
  14. Qiao, Y.; Wang, X.; Han, Z.; Tian, M.; Wang, Q.; Wu, H.; Liu, F. Geodetector based identification of influencing factors on spatial distribution patterns of heavy metals in soil: A case in the upper reaches of the Yangtze River, China. Appl. Geochem. 2022, 146, 105459. [Google Scholar] [CrossRef]
  15. Huang, S.; Xiao, L.; Zhang, Y.; Wang, L.; Tang, L. Interactive effects of natural and anthropogenic factors on heterogenetic accumulations of heavy metals in surface soils through geodetector analysis. Sci. Total Environ. 2021, 789, 147937. [Google Scholar] [CrossRef] [PubMed]
  16. Chen, G.; Zhao, Y.; Hu, B.; Yang, J.; Huang, Y. Spatial and temporal distribution of heavy metals in riparian zone and soil environmental quality analysis. Environ. Pollut. Control 2017, 39, 981–986, (In Chinese with English abstract). [Google Scholar]
  17. Dal, J.; Yi, Y.; Luyu, W.; Wang, Y. Segmentation of the Point Source and Non-point Source Pollution Load of Several Pollutants in the River. Ind. Saf. Environ. Prot. 2016, 42, 72–75, (In Chinese with English abstract). [Google Scholar]
  18. Yang, Q.; Yang, Z.; Filippelli, G.M.; Ji, J.; Ji, W.; Liu, X.; Wang, L.; Yu, T.; Wu, T.; Zhuo, X.; et al. Distribution and secondary enrichment of heavy metal elements in karstic soils with high geochemical background in Guangxi, China. Chem. Geol. 2021, 567, 120081. [Google Scholar] [CrossRef]
  19. Yang, Q.; Yang, Z.; Zhang, Q.; Liu, X.; Zhuo, X.; Wu, T.; Wang, L.; Wei, X.; Ji, J. Ecological risk assessment of Cd and other heavy metals in soil-rice system in the karst areas with high geochemical background of Guangxi, China. Sci. China Earth Sci. 2021, 64, 1126–1139. [Google Scholar] [CrossRef]
  20. DZ/T 0258-2014; MLR Specification of Multi-Purpose Regional Geochemical Survey (1: 250000). Standards Press of China: Beijing, China, 2014.
  21. Liang, J.; Liu, Z.; Tian, Y.; Shi, H.; Fei, Y.; Qi, J.; Mo, L. Research on health risk assessment of heavy metals in soil based on multi-factor source apportionment: A case study in Guangdong Province, China. Sci. Total Environ. 2023, 858, 159991. [Google Scholar] [CrossRef]
  22. Wang, J.; Hu, Y. Environmental health risk detection with GeogDetector. Environ. Modell. Softw. 2012, 33, 114–115. [Google Scholar] [CrossRef]
  23. Jiang, X.; Xiong, Z.; Liu, H.; Liu, G.; Liu, W. Distribution, source identification, and ecological risk assessment of heavy metals in wetland soils of a river–reservoir system. Environ. Sci. Pollut. Res. 2017, 24, 436–444. [Google Scholar] [CrossRef]
  24. Qiao, M.; Cai, C.; Huang, Y.; Liu, Y.; Lin, A.; Zheng, Y. Characterization of soil heavy metal contamination and potential health risk in metropolitan region of northern China. Environ. Monit. Assess. 2011, 172, 353–365. [Google Scholar] [CrossRef] [PubMed]
  25. Hu, B.; Zhou, Y.; Jiang, Y.; Ji, W.; Fu, Z.; Shao, S.; Li, S.; Huang, M.; Zhou, L.; Shi, Z. Spatio-temporal variation and source changes of potentially toxic elements in soil on a typical plain of the Yangtze River Delta, China (2002–2012). J. Environ. Manag. 2020, 271, 110943. [Google Scholar] [CrossRef] [PubMed]
  26. Wang, S.; Cai, L.; Wen, H.; Luo, J.; Wang, Q.; Liu, X. Spatial distribution and source apportionment of heavy metals in soil from a typical county-level city of Guangdong Province, China. Sci. Total Environ. 2019, 655, 92–101. [Google Scholar] [CrossRef]
  27. He, S.; Lu, Q.; Li, W.; Ren, Z.; Zhou, Z.; Feng, X.; Zhang, Y.; Li, Y. Factors controlling cadmium and lead activities in different parent material-derived soils from the Pearl River Basin. Chemosphere 2017, 182, 509–516. [Google Scholar] [CrossRef] [PubMed]
  28. Guo, W.; Zhang, H.; Cui, S.; Xu, Q.; Tang, Z.; Gao, F. Assessment of the distribution and risks of organochlorine pesticides in core sediments from areas of different human activity on Lake Baiyangdian, China. Stoch. Environ. Res. Risk A 2014, 28, 1035–1044. [Google Scholar] [CrossRef]
  29. Cui, Z.; Wang, Y.; Zhao, N.; Yu, R.; Xu, G.; Yu, Y. Spatial Distribution and Risk Assessment of Heavy Metals in Paddy Soils of Yongshuyu Irrigation Area from Songhua River Basin, Northeast China. Chinese Geogr. Sci. 2018, 28, 797–809. [Google Scholar] [CrossRef]
  30. Dong, S.; Ochoa Gonzalez, R.; Harrison, R.M.; Green, D.; North, R.; Fowler, G.; Weiss, D. Isotopic signatures suggest important contributions from recycled gasoline, road dust and non-exhaust traffic sources for copper, zinc and lead in PM10 in London, United Kingdom. Atmos. Environ. 2017, 165, 88–98. [Google Scholar] [CrossRef]
  31. Jones, L.; Sullivan, T.; Kinsella, B.; Furey, A.; Regan, F. Occurrence of Selected Metals in Wastewater Effluent and Surface Water in Ireland. Anal. Lett. 2017, 50, 724–737. [Google Scholar] [CrossRef]
  32. Shao, D.; Zhan, Y.; Zhou, W.; Zhu, L. Current status and temporal trend of heavy metals in farmland soil of the Yangtze River Delta Region: Field survey and meta-analysis. Environ. Pollut. 2016, 219, 329–336. [Google Scholar] [CrossRef] [PubMed]
  33. Zhao, Y.; Yan, Z.; Qin, J.; Xiao, Z. Effects of long-term cattle manure application on soil properties and soil heavy metals in corn seed production in Northwest China. Environ. Sci. Pollut. Res. 2014, 21, 7586–7595. [Google Scholar] [CrossRef] [PubMed]
  34. Sun, L.; Guo, D.; Liu, K.; Meng, H.; Zheng, Y.; Yuan, F.; Zhu, G. Levels, sources, and spatial distribution of heavy metals in soils from a typical coal industrial city of Tangshan, China. Catena 2019, 175, 101–109. [Google Scholar] [CrossRef]
Figure 1. Sampling points of the study area.
Figure 1. Sampling points of the study area.
Land 14 00146 g001
Figure 2. Distribution map of various variables: (a) lithological map, (b) land use map, (c) distance map from enterprise, (d) distance map from river.
Figure 2. Distribution map of various variables: (a) lithological map, (b) land use map, (c) distance map from enterprise, (d) distance map from river.
Land 14 00146 g002
Figure 3. Boxplots showing a comparison of 8 PTEs in the topsoil.
Figure 3. Boxplots showing a comparison of 8 PTEs in the topsoil.
Land 14 00146 g003
Figure 4. Correlation plots showing Spearman correlation coefficients for the 8 PTEs.
Figure 4. Correlation plots showing Spearman correlation coefficients for the 8 PTEs.
Land 14 00146 g004
Figure 5. Spatial distribution maps for PTEs in the topsoil of the Nanliujiang River Basin: (a) As, (b) Cd, (c) Cr, (d) Cu, (e) Hg, (f) Ni, (g) Pb, (h) Zn.
Figure 5. Spatial distribution maps for PTEs in the topsoil of the Nanliujiang River Basin: (a) As, (b) Cd, (c) Cr, (d) Cu, (e) Hg, (f) Ni, (g) Pb, (h) Zn.
Land 14 00146 g005
Figure 6. Stacked histogram graphs showing factor contributions associated with each source.
Figure 6. Stacked histogram graphs showing factor contributions associated with each source.
Land 14 00146 g006
Figure 7. Spatial distribution of normalized contributions of four factors: (a) Factor 1, (b) Factor 2, (c) Factor 3, (d) Factor 4.
Figure 7. Spatial distribution of normalized contributions of four factors: (a) Factor 1, (b) Factor 2, (c) Factor 3, (d) Factor 4.
Land 14 00146 g007
Figure 8. Interactions among the main driving factors of PTEs in high q value: (a) Factor 1, (b) Factor 2, (c) Factor 3, (d) Factor 4. (SP—soil parent material; LU—land use type; RD: distance from the riverbank; FD: distance from the factory).
Figure 8. Interactions among the main driving factors of PTEs in high q value: (a) Factor 1, (b) Factor 2, (c) Factor 3, (d) Factor 4. (SP—soil parent material; LU—land use type; RD: distance from the riverbank; FD: distance from the factory).
Land 14 00146 g008
Table 1. Summary of basic statistics of eight selected PTEs in the topsoil of Nanliujiang.
Table 1. Summary of basic statistics of eight selected PTEs in the topsoil of Nanliujiang.
Min.Q25MeanMedianQ75Max.Std.CV (%)
As1.176.2811.769.1713.30246.0011.96101.70
Cd0.010.090.160.130.196.180.19119.84
Cr4.5031.0045.4843.0053.00319.0024.3853.59
Cu1.8710.4516.5614.4020.8599.709.3056.13
Hg0.020.070.130.100.128.110.30225.58
Ni2.048.0912.6711.2015.9281.207.0455.53
Pb7.4021.3032.2531.2041.65171.0014.3344.42
Zn5.5030.2047.8945.5059.40275.0025.9154.11
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, X.; Yang, Z.; Li, B.; Wu, Z.; Wang, L.; Yu, T.; Li, C.; He, Z.; Xie, M.; Deng, C.; et al. Source Apportionment and Analysis of Potentially Toxic Element Sources in Agricultural Soils Based on the Positive Matrix Factorization and Geo-Detector Models. Land 2025, 14, 146. https://doi.org/10.3390/land14010146

AMA Style

Liu X, Yang Z, Li B, Wu Z, Wang L, Yu T, Li C, He Z, Xie M, Deng C, et al. Source Apportionment and Analysis of Potentially Toxic Element Sources in Agricultural Soils Based on the Positive Matrix Factorization and Geo-Detector Models. Land. 2025; 14(1):146. https://doi.org/10.3390/land14010146

Chicago/Turabian Style

Liu, Xu, Zhongfang Yang, Bo Li, Zhiliang Wu, Lei Wang, Tao Yu, Cheng Li, Zexin He, Minghui Xie, Chenning Deng, and et al. 2025. "Source Apportionment and Analysis of Potentially Toxic Element Sources in Agricultural Soils Based on the Positive Matrix Factorization and Geo-Detector Models" Land 14, no. 1: 146. https://doi.org/10.3390/land14010146

APA Style

Liu, X., Yang, Z., Li, B., Wu, Z., Wang, L., Yu, T., Li, C., He, Z., Xie, M., Deng, C., & Shi, H. (2025). Source Apportionment and Analysis of Potentially Toxic Element Sources in Agricultural Soils Based on the Positive Matrix Factorization and Geo-Detector Models. Land, 14(1), 146. https://doi.org/10.3390/land14010146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop