1. Introduction
Accurate simulations help ecologists and conservationists effectively address ecosystem challenges and provide scientific foundations for developing reasonable strategies [
1]. Species richness serves as a fundamental element of ecosystems and plays a crucial role in maintaining ecological balance, enhancing ecosystem services, and promoting human well-being [
2]. By simulating species interactions, population dynamics, and ecological processes, simulations provide important insights that inform conservation strategies, guide policy decisions, and enhance ecosystem management [
3,
4]. Additionally, simulations offer a controlled and cost-effective framework for testing hypotheses and exploring potential outcomes of different interventions, providing support for both scientific research and practical applications [
5]. As the pressures on ecosystems continue to increase, the demand for precise, user-friendly, and scalable simulation tools becomes more urgent. These tools assist researchers and decision-makers in formulating effective biodiversity conservation and ecosystem sustainability strategies. Furthermore, existing population dynamics simulation tools often rely on a single perspective for analysis, which may lead to partial observability issues during the simulation process. This limitation hinders a comprehensive reflection of the complex processes of species succession. Therefore, the simulator must integrate various biological models to provide multi-perspective analyses of population dynamics.
Based on the research, the current tools for multidisciplinary expert wildlife research still face three major challenges. (I) Insufficient multi-view assessment: specialized simulators [
6,
7,
8], for example, specialized in free-roaming dogs, Ursus arctos, and Maned wolfs, despite their focused and tailored nature, may sometimes provide a narrower perspective on ecological systems, potentially leading to less comprehensive assessments [
9]. (II) A high learning curve: General-purpose tools [
10,
11,
12,
13,
14,
15,
16,
17], including those designed for educational purposes [
18,
19,
20], are applicable to a wide range of biological species and ecological scenarios. But they often require a certain level of programming expertise, which can present challenges for users without a computational background [
21]. Programming is necessary for customized development, such as adding new dimensions or influencing factors for population dynamics simulations. Users without programming backgrounds may find it challenging to quickly implement their desired customizations. This technical barrier may unintentionally limit the participation of diverse stakeholders, particularly those from non-computational fields such as ecology or policy-making, potentially impeding the widespread dissemination and educational potential of these valuable tools. (III) The lack of seamless secondary development with Python (3.10.12): With the development of AI and data science, there are higher requirements for simulators, which need to integrate with AI and data science. Since Python is the dominant language in AI and data science, there is a need for simulator tools that integrate seamlessly with Python environments. However, many general-purpose tools tend to be commercial, closed-source, and developed in non-Python languages (e.g., Java), which limits users to specific platforms (e.g., only being able to use a certain company’s products or download specific tools). When users need to integrate with Python development environments, they still face significant difficulties and challenges (e.g., package management and multi-file import issues [
22]). Even with open-source options available, they may still be developed in languages other than Python, presenting similar integration challenges. The Python-based simulator tool bridges the gap between comprehensive modeling capabilities and a user-friendly interface. It also facilitates smooth integration with Python to enhance rapid decision-making processes and broaden educational opportunities in the fields of population dynamics and biodiversity conservation [
23,
24].
To address these issues, we developed a novel software tool named the
Wildlife
Analysis and
Population
Ecology
Tool (WAPET) (Python 3.10.12, code url:
https://github.com/JoshuaYuYide/WAPET.git (accessed on 1 December 2024)) to address these challenges through a three-pronged approach. Firstly, to overcome
challenge (I), WAPET incorporates Monte Carlo simulation [
25] and integrates ecological models, including Logistic Growth [
26], Random Walk [
27,
28], and Cellular Automata [
29]. These models are widely used in ecological research to simulate population dynamics and spatial distribution from particular perspectives.
This approach enables WAPET to simulate population dynamics from multiple perspectives, including competition, reproductive networks, habitat analysis, and population trends, thus providing a more comprehensive assessment of ecological systems. Secondly, to address
challenge (II), WAPET employs a fully parameterized input paradigm, allowing users without coding to easily explore simulations. Users can operate WAPET solely by inputting parameters on GUI. The paradigm facilitates the participation of diverse stakeholders, particularly those from non-computational fields such as ecology or policy-making, thereby enhancing the tool’s educational potential and accessibility. Lastly, to tackle
challenge (III), WAPET’s development is entirely Python-based, utilizing PySide6 [
30] and Mesa [
31] libraries. This framework enables seamless development in Python environments, providing robust extensibility while avoiding common issues like package management or multi-file imports. Therefore, WAPET can function as a module for Python projects, imported into AI or data science projects, allowing seamless interaction with packages like Pytorch and scikit-learn, and so on. Additionally, WAPET employs agent-based modeling (ABM) [
32] to simulate individual behaviors, offering a more realistic representation of natural systems by accounting for the randomness and variability inherent in individual organisms and their impact on population dynamics. This comprehensive approach positions WAPET as a versatile tool for both experienced researchers and non-computational stakeholders in biodiversity decision-making processes, bridging the gap between comprehensive modeling capabilities and user-friendly interfaces.
Our contributions are as follows: (I) We integrated Logistic Growth, Random Walk, and Cellular Automata via Monte Carlo simulation, offering WAPET multiple perspectives to gain a better understanding of ecological processes. (II) We developed a no-code mode of human-computer interaction in WAPET. The no-code simulation tool for population dynamics offers a low-barrier entry for biodiversity stakeholders and researchers. (III) We implemented a framework combining PySide6 for the GUI and Mesa for agent-based modeling, both Python-based libraries. This integration facilitates easy extension and customization, allowing users to adapt or rewrite components to suit their research needs.
3. Methodology
The WAPET framework consists of two components: the software framework and ecological models. The software framework provides users with an intuitive interface, allowing them to set parameters for the ecological models and execute simulations through code, subsequently delivering the simulation results back to users. The ecological models conduct more comprehensive simulations by considering the multidimensional influencing factors during the species succession process, thereby reducing the partial observability issues arising from single-factor analysis. Together, these components complement each other, enabling WAPET to effectively support ecological research and the formulation of conservation strategies.
3.1. Architecture
As shown in
Figure 1, the architecture is structured into three layers: the Presentation Layer (user interface and visualized results), the Agent Layer (agents for environmental and species objects), and the Interaction Layer (interactions between agents generating simulation results). This structure enhances modularity and maintainability.
The information flow begins with the GUI receiving user input data or parameters, which triggers the initialization of the environment and species agents. These agents then enter the Interaction Engine, where they engage in dynamic interactions, resulting in updates to their states. The Simulation Controller subsequently generates results and directs further cycles according to a predetermined schedule. This process is repeated iteratively until the simulations are complete. Upon completion, the Simulation Controller sends the results to the visualization module, which presents the findings through the GUI.
We can abstract the interaction flow into a Markov decision process (MDP) [
35]. An MDP is 5-tuple
, where the
is the state space, the
is the action space,
is the reward function,
is the reward set,
is the state transition function, and
is the discount factor.
An agent at the current state, , chooses an action, , and give this action to the environment. Then, the environment responds to a reward, r, through the reward function and the next state via the state transition function . This process runs constantly until the end of the episode, .
A state,
, consists of multiple observational dimensions, with each dimension representing an ecological model. However, when the number of dimensions is limited, the state,
s, may fail to fully capture the dynamics of the environment. In this case, the agent receives an observation,
o, where the dimensionality of
o is smaller than that of the true state,
s. This leads the agent to face a partially observable Markov decision process (POMDP) [
36] problem. Incorporating more ecological models increases the number of dimensions, thereby mitigating the issue of partial observability.
3.2. GUI
Figure 2 and
Figure 3 show the human–computer interaction interface. The main interface is divided into three regions: left, center, and right. The central region in
Figure 2 includes the Species Data Management Panel for population parameters and the Environment Data Management Panel for environmental parameters. The rightmost section is the Operation Panel for simulation controls.
Figure 3 shows the landscape configuration. The landscape is a grid with adjustable dimensions. Users can generate random maps or manually modify grid types. Population distribution heat maps and environmental carrying capacity maps can be rendered. The left region displays the Species Data Table and Simulation Result Table. The right region features the Plotting Panel for visualizing simulation results.
3.3. Environment Agent
The Environment Agent represents a realistic natural environment, with the primary function of interacting with the Species Agent and providing feedback based on these interactions. In this paper, the Environment Agent is constructed by the Soil Agent and the Climate Agent, which simulate soil conditions and climate factors, respectively. This design allows the Environment Agent to offer more authentic and multidimensional environmental feedback to the Species Agent.
3.3.1. Soil Agent
The Soil Agent simulates the landscape, representing habitat changes and their impacts on population dynamics. It is an array with grid cells containing information on carrying capacity and species occupancy.
The Soil Agent consists of many grids. Mathematically, let a Soil Agent be a function of , where and are the map’s size. For any grid , is the carrying ability function, is the function of the Species Agent set K on the grid, and is the soil type function.
Users initialize the soil type function
with the carrying ability mean
and standard deviation
, sampling data from the normal distribution [
37]:
The Soil Agent updates the carrying ability and Species Agent number via at each time-step t.
3.3.2. Climate Agent
Climate impacts carrying capacity, with extreme weather affecting land’s ability to support populations [
38,
39]. We model extreme weather as events deviating from the normal state [
40]. We offer three climate models: tropical rainforest, tropical desert, and the Mediterranean. Each has a discrete probability distribution, sampled each time-step. Prolonged single weather types reduce the carrying capacity, simulating extreme weather impacts.
3.4. Species Agent
Species Agents have survival, locomotion, and reproduction abilities. Survival includes attack and escape abilities, hunger rates, growth rates, and survival rates. Users initialize locomotion speed with the mean and the standard deviation . Species Agents sample speed from this distribution at each time-step. Reproductive capacity allows mating within the same grid, with options for monogamy.
New Species Agents arise from initial setup and reproduction. In multi-species scenarios, there are predator, target, and prey agents. This food chain impacts each species’ survival and reproduction, modeled by their interactions.
3.5. Interaction Engine
3.5.1. Logistic Growth Model
The Interaction Engine integrates ecological models through Monte Carlo simulation. We use the Logistic Growth Model for feeding and reproduction success rates. The continuous-time differential equation is as follows:
where
N is the population size,
r is the growth rate, and
is the carrying capacity. The discrete-time form is as follows:
We sample environmental regulation and individual success rate , where D is attenuation.
3.5.2. Random Walk Model and Cellular Automata
The random walk model represents the Species Agent movement; the formula of a Species Agent position changing along with the time horizon is as follows:
where
is movement distance per time-step, and
is the start location.
Although species move randomly across the landscape, their movement decisions are also influenced by other factors. For instance, they tend to move toward areas where conspecifics are concentrated. The tendency follows the movement of the cellular Automata model, which is as follows:
where
is the sum of
neighbors of cell
. Cells can be as follows:
Birth: dead cell (0) becomes living (1) if it has three living neighbors.
Survival: living cell (1) remains alive with two or three living neighbors.
Death: living cell (1) dies (0) if neighbors are not two or three.
Species Agents tend to move towards survival cells. The tendency probability is , with the maximum distance .
3.5.3. Other Models
Attack and Escape
In multi-species scenarios, food chain dynamics emerge. Each species has attack,
, and escape capabilities,
, as actions, modeled probabilistically. Ås the age
grows, the Species Agent’s
k attack and escape capabilities fluctuate. For a Species Agent
k, the formula of the attack capability changing is
where
is the coefficient of variation of predatory capacity, and
is the marriage age of the Species Agent
k. the formula of the escape capability changing is
Predation success depends on relative strengths, leading to capture or survival. The stronger the predation ability of a predator, the higher the probability of capturing prey. However, if the prey has a stronger escape ability, the probability of evading predation increases. Predation serves as another primary source of food, in addition to the food provided via the environment.
Mating and Giving Birth
In the simulated ecosystem, the reproductive behavior of individual species is a crucial factor for species continuation and population dynamics. To simulate the mating and reproductive processes among individuals, this paper proposes a set of multi-condition mating rules. Specifically, individuals of the same species, , can mate if they meet the following conditions: they must be of opposite sexes, be located in the same grid, reach the appropriate mating age, and be unmarried and in a living state.
In the model, the mating behavior of individuals is controlled through the following methods:
Mating condition assessment: First, check whether the individual’s age exceeds the specified minimum mating age, , and the individual must be alive. Additionally, the individual must be of the opposite sex to the potential partner. For monogamous individuals, it further checks whether the individual already has a partner. If all these conditions are met, they indicate that the individual can mate; otherwise, they indicate that mating is not possible.
Mathematically, this rule can be expressed as follows:
where
is the gender of the agent
k,
is the agent
k’s survival, and
is the agent
k being married.
Mating execution: When the mating conditions are satisfied, the Species Agent attempts to find potential mating partners within the grid where the current individual resides. By traversing the same grid for conspecifics, if both individuals meet the mating conditions, they will pair up as partners, updating each other’s partner and status to reflect their new married state.
Reproductive process: Following successful mating, individuals can reproduce. The reproductive behavior of the population is regulated by population density. Specifically, the population size
N and the environmental carrying capacity
affect the reproductive probability, which is calculated using the logistic growth model:
where
r represents the growth rate,
N is the current population size in the grid,
is the carrying capacity of that grid, and
is the fertility factor for individual fertility as age changes. If the reproductive conditions are met, a new individual is generated, and its attributes (such as location, parent information, etc.) are initialized.
This mating and reproductive rule not only considers biological characteristics such as gender and age but also incorporates mating patterns (e.g., monogamy) and environmental factors (e.g., population density), ensuring the rationality and complexity of the simulation process. Through these rules, the model can dynamically simulate the mating and reproductive behaviors of individual species, further facilitating the analysis of population dynamic changes and their impacts on the ecosystem.
3.6. Simulation Controller
3.6.1. Scheduling
Overview
The Simulation Controller manages time, agent, and execution order scheduling. We use the mesa.time.SimultaneousActivation scheduler for simultaneous agent updates. Soil Agents refresh species count and carrying capacity. The Climate Agent selects weather based on its probability distribution at each time-step.
3.6.2. Agent Scheduling
The scheduling order for any Species Agent follows a systematic sequence. Initially, the agent’s vitality status is checked. Deceased agents are removed from the simulation, and new agents are introduced. Next, the agent performs a random movement within the environment. The agent then consumes available food resources. If the agent is not attached, it may participate in mating, forming a pair bond with a compatible agent. Successful mating can lead to reproduction, generating offspring. Finally, the agent undergoes growth, advancing in age and potentially transitioning through different life stages.
3.6.3. Component Interaction
The Simulation Controller coordinates interactions among various components. It initializes the Environment Agent and randomly distributes Species Agents. Based on the user-defined maximum time-step, the Simulation Controller invokes the internal scheduling of each agent class in every iteration. The internal scheduling of agent classes partially derives results from the Interaction Engine. The Simulation Controller then consolidates these results and transmits them to the Visualization Tool. This coordinated workflow enables a comprehensive and synchronized simulation of the ecosystem.
3.7. Visualization Tool
The visualization toolkit offers various graphical display methods designed to help researchers intuitively understand and analyze complex data within simulated ecosystems. First, the gender ratio pie chart clearly illustrates the proportion of male and female individuals in the ecosystem, aiding researchers in understanding the gender distribution. Second, the age structure pie chart categorizes individuals into different age groups, facilitating the observation of the quantity distribution and changing trends of individuals at various age stages.
Additionally, the alive status pie chart displays the ratio of surviving to deceased individuals, reflecting the survival status of individuals in the ecosystem and enabling researchers to assess the health of the ecosystem. The population change curve graph serves as a dynamic tool that illustrates how population numbers fluctuate over time, helping researchers grasp the growth, decline, or volatility of populations from a temporal perspective. Finally, the reproductive network tree provides an intuitive view by showcasing the familial relationships between individuals, allowing researchers to easily trace the breeding history and lineage structure of individuals.
Furthermore, the population distribution map records the density of populations on the landscape after the simulation is complete. This map reflects the geographical distribution of populations following random movement to some extent. The land carrying capacity map displays the environmental carrying capacity of each soil agent (grid) on the landscape, which influences the population density. The real population line chart represents the actual trends in population number changes, allowing for the validation of the results and trends observed in the simulated population line chart.
Figure 4 showcases the diverse visualization capabilities of WAPET, demonstrating its ability to present complex ecological data in an accessible and informative manner. The figure comprises seven simulation-generated visualizations (
Figure 4a–g) and one real-world data plot (
Figure 4h), illustrating the tool’s comprehensive approach to population dynamics analysis. Specifically, the visualizations include the following: (
Figure 4a) the gender ratio, (
Figure 4b) the age distribution, (
Figure 4c) birth and death rates, (
Figure 4d) the simulated population trend, (
Figure 4e) the reproductive network, (
Figure 4f) the spatial distribution of the population within the habitat, and (
Figure 4g) the land carrying capacity distribution. The simulated population trend (
Figure 4d) is juxtaposed with the actual species data curve (
Figure 4h) from the UK butterfly population study [
41] to demonstrate the tool’s predictive capabilities. While the exact replication of real-world data is not the primary goal of this demonstration, the simulated trend closely approximates the general trajectory of population decline observed in the actual data. It is worth noting that the simulated curve (
Figure 4d) exhibits non-smooth characteristics and inherent randomness, which are natural consequences of employing ABM and Monte Carlo simulation techniques. Despite these differences in curve smoothness, both the simulated (
Figure 4d) and actual (
Figure 4h) data converge on a declining population trend, underscoring WAPET’s potential as a valuable tool for predicting broad ecological patterns. These visualizations collectively demonstrate WAPET’s capacity to provide a multi-dimensional understanding of population dynamics, offering valuable insights for biodiversity conservation efforts and policy-making. It is important to note that the visualizations presented in
Figure 4 are primarily intended to showcase the functionality of the WAPET tool, rather than to provide definitive ecological analyses. By simulating possible real-world conditions such as butterfly gender, age, survival, population changes, and reproductive networks, WAPET aids in understanding butterfly population dynamics and their influencing factors, even when the direct observation of these factors is challenging.