In this section, details about the adopted scenario for the case study of the comparison between the proposed intelligent algorithm for solar trackers and the conventional algorithm largely used in the market are presented. In addition, the equipment involved in the process is detailed.
Moreover, this section will outline the procedure used to develop the algorithm based on artificial intelligence (AI), presenting the methodology adopted from data collection through simulation to the validation process for algorithm development. Additionally, it will discuss the performance improvement process of the proposed solution, considering the various techniques used, the criteria for choosing the variables, and finally, the considerations made to quantify the energy gain of the developed algorithm according to the weather classification.
3.1. Materials
In this subsection, a brief overview of solar tracker technologies is presented. In addition, the equipment used in the proof of concept scenario is detailed, listing its functions and the communication between them.
To summarize, the developed AI-based solution operates as a plug-in at the control and field levels of the automation pyramid, positively impacting the generation results, as shown in the following sections. As seen in
Figure 1, for this case study, the developed algorithm operates at the same level as the control layer, interacting with the field process (weather station and solar trackers) through the Modbus TCP protocol, sending the calculated angle to the solar trackers based on the variables read from the sensors.
3.1.1. Solar Tracker
The tracker system utilized was the STI-250 from STI Norland, which is predominantly used in commercial solar plants in Brazil. This system is categorized as a one-axis solar tracker. The panels were installed facing north (for countries below the equator line), and the solar tracker system tracks the sun from east to west.
To follow the sun, an astronomical algorithm named SPA is used to calculate the sun’s position. NREL developed this algorithm, which STI Norland has implemented in their solar tracker systems. In addition to tracking the sun, the start of the morning and the end of the afternoon mark the backtracking period, when the tracker rows may shadow each other. STI Norland uses a proprietary algorithm to avoid shadowing the panels during this period.
This article labels the junction of these two algorithms as a “commercial solution”. As such, it is used as a reference to compare the developed solution.
Moreover, the trackers’ architecture is based on a tracker control unit (TCU) per tracker row. The unit can individually control the motor that regulates the panel’s position and guarantees the system’s operation.
Furthermore, a network controller unit (NCU) is used for larger solar plants to operate multiple TCUs. The communication between NCUs and TCUs is made through the ZigBee protocol, and the NCU can be accessed by the Modbus TCP communication protocol.
The tracker’s standard operation is the “automatic mode”, which uses the STI Norland backtracking algorithm with the SPA to calculate the sun’s position. To adopt the developed solution in the field, the trackers are set to “manual mode” through the NCU using the Modbus TCP communication protocol. Then, the optimized angles calculated by AI are written in the STI tracker system registers.
3.1.2. Test Environment Architecture
The tracker’s standard operation is an “automatic mode”, which uses the STI Norland backtracking algorithm with the SPA to calculate the sun’s position. The trackers are set to “manual mode” through the NCU using the Modbus TCP communication protocol to use the developed solution. Then, the optimized angles calculated by AI are written in the STI tracker system registers.
To minimize the influence of external variables, these two inverters are located near the weather station and have similar physical characteristics. The plant parameters are listed in
Table 1.
As shown in
Table 1, each inverter is connected to six solar tracker rows, with 360 solar panels per set. This distribution is shown in
Figure 2.
In addition, it is necessary to receive and send data to the equipment to operate the AI solution. The communication protocols used in the implementation are listed in
Table 2.
The data are read from and written directly to the solar tracker NCU by the respective TCUs. System-generated energy and instantaneous power are obtained through the Huawei SmartLogger, which is connected to the inverters; moreover, weather data are collected from the weather station.
Lastly, some solar plant parameters are listed as follows:
3.2. Methods: Developing the Intelligent Algorithm
The procedure presented in
Figure 3 is followed for the algorithm development. One of the prerequisites for developing the intelligent algorithm is the simulation of the optimal behavior according to the current weather situation. Thus, the first step of the methodology is to download the weather data to use the NREL (National Renewable Energy Laboratory) satellite database; concerning the simulation environment, the pvlib package [
22] available in Python and Matlab was chosen; finally, the algorithm training follows with the optimal angles generated in step two.
So, as the first step, the NREL database was chosen after analytical research on the other available climate databases, taking into account some criteria:
Free access: Not considered a mandatory prerequisite but is seen as a positive point;
Data availability for the study region: The case study was conducted in a relevant environment in the northeast of Brazil. Therefore, the availability of data for the region is a mandatory prerequisite;
Availability of climate data, mainly the DNI, DHI, and GHI variables: For the simulation of optimal behavior, it is necessary to have these three variables available as inputs to the adopted irradiance model;
The smallest sampling interval: It is desired that the algorithm operates in accordance with weather changes and the positioning of the sun. So, it is essential that the intervals of the training data are as short as possible; data were obtained at intervals spaced every thirty minutes.
The NREL climate data coverage can be seen in
Figure 4, which shows that, depending on the region, data have been available since 1998 with ten-, thirty-, and sixty-minute sampling intervals. Among the available data, to mention just a few, are GHI (global horizontal irradiance), DNI (direct normal irradiance), DHI (diffuse horizontal irradiance), wind speed, wind direction, temperature, precipitation, and atmospheric pressure, meeting the minimum requirements for available variables.
The next step involved simulating the optimal behavior of a solar tracker, given the weather conditions and the characteristics of the solar plant. Python was the programming language used, and pvlib was chosen after a comprehensive review of Python packages for modeling photovoltaic systems. Its selection was based on the extensive availability of documentation and a wide variety of mathematical models for simulating a solar plant. These models include controls for trackers, the absorption of irradiance by mono/bifacial panels, and electrical modeling, among others.
In this sense, the main function used was the “infinite sheds model” [
23,
24], which is an irradiance transposition model that takes into account the main variables that influence a solar plant, such as the following:
Height: height of the panel relative to the surface;
Pitch: distance between rows of panels;
Sun’s zenith and azimuth;
Tracker orientation angle;
Width solar panel;
Bifaciality;
GHI, DNI e DHI;
Albedo.
Thus, keeping all parameters constant and varying only the orientation angle of the trackers, it was possible to determine which angle absorbs more irradiance for each specific weather situation. In
Figure 5, it is possible to observe a comparison between the angles that collect more irradiance on a sunny day and another on a cloudy day. The black line in the plot represents the optimal angle to absorb more irradiance. It can be seen that following the sun’s position is the best alternative on sunny days, as the DNI is higher, making it better to align with the sun’s normal angle. However, this is not the most effective strategy on cloudy days, when the best angles are closer to the horizontal position (0°), due to the GHI being higher than the DNI on such days. This observation corroborates literature studies indicating that this position is more advantageous as it takes advantage of the absorption of diffuse irradiance.
As mentioned earlier, it is possible to simulate the conventional positioning of commercial solar trackers using pvlib. This library utilizes the SPA (solar position algorithm) [
25] to calculate the sun’s position and subsequently determine the position that minimizes the difference between the PV panel surface normal and the sun’s position. When generating a frequency histogram of each angle’s occurrence, a uniform distribution can be observed, as shown in
Figure 6.
On the other hand, when comparing the frequency of occurrence for each angle by calculating the “infinite sheds model”, there is a higher frequency for the positioning in the horizontal position, as seen in
Figure 7. The frequency of positioning of the photovoltaic panels near the horizontal position is higher, indicating that the algorithm’s opportunities for improvement are concentrated during cloudier days.
Finally, in the last step, the focus was on developing an AI-based solar tracker algorithm. This task demanded more time because many machine learning topologies are known in the literature, resulting in numerous possibilities for creating a solar tracker algorithm with the proposed purpose. Therefore, the authors chose only four powerful techniques used in regression problems to analyze their performance as a solar tracker algorithm and save time.
To decide which AI-based technique would be the most suitable to act as a solar tracker algorithm, a comparison between four machine learning/deep learning techniques was made, analyzing the performance of each one about the optimized angles calculated. These are as follows:
Before evaluating the performance of each one, the dependence level between the input and output variables was studied using mutual information estimation [
26], i.e., the more significant the dependence, the higher the value of this index. In this study, the dependency between all input variables concerning the optimal angle of reference was evaluated, and these variables were chosen as input variables to be applied to machine learning techniques. The variables that scored higher than 0.3, as presented in
Table 3, were selected to filter out those that could have little influence.
The current timestamp was used in trigonometric calculations to create the variables
Hour_sin and
Hour_cos. This operation was adopted because azimuth and zenith are also trigonometric variables that could synergize more effectively during AI model training. The mathematical calculations applied are detailed in Equations (
1) and (
2), where
t is the index representing the current time instant,
h is the current hour, and
m is the current minute.
With the definition of the variable selection criterion established, the performance evaluation of each regression technique was conducted using MSE (mean squared error), MAE (mean absolute error), and ME (max error) as metrics to determine the best technique. Furthermore, parametrizations were carried out for each regression technique to optimize performance, such as altering the number of neurons and layers in neural networks and testing different numbers of trees in the random forest. After all parametrization tests for each artificial intelligence topology were completed, the random forest emerged as the most accurate technique across all metrics adopted, as shown in
Table 4.
Once the random forest was identified as the AI technique that delivered the best results according to the criteria used, further efforts were made to improve the algorithm’s performance. The first step involved evaluating its performance based on the combination of available input variables. Out of the 11 available variables, 1024 combinations were tested. The performance of the top 7 combinations was very similar, with differences only in the third decimal place, as seen in
Table 5.
Thus, following the results obtained in the created ranking, we decided to use the best-ranked combination of input variables in the RF model, as shown in the model schematic presented in
Figure 8. Up until this point, the solar tracker algorithm was composed of an RF model that infers the optimal angle according to the seven variables indicated.
Finally, the last action for improving the algorithm involved studying the moments when it faced the most significant challenges in accurately inferring the correct output angle from its application with a set of validation data in a power generation simulation. The simulations revealed that the most challenging times for the algorithm’s inference were during sunrise and sunset, or in other words, during backtracking.
In this context, the idea was to divide the algorithm into three parts, training three RF models to operate during each period of the day: one for morning backtracking (set from 05:30–07:30), another for “common time” (from 07:30–15:30), and a last one for afternoon backtracking (set from 15:30–17:30). For the remaining times that could still experience sunlight but were not covered in the pre-determined time windows, the SPA algorithm was applied to avoid interrupting the operation. In this way, the algorithm’s operation scheme can be summarized as follows in
Figure 9, where the night position denotes the application of the SPA, with the additional step of positioning the trackers at 10°.
The intelligent algorithm, meticulously developed, was poised for field application to validate its efficiency in a real scenario. However, certain practical aspects, such as determining the optimal time for the algorithm to operate on the trackers, are yet to be defined.
As a final step before applying the algorithm in a commercial solar plant, a thorough evaluation was conducted. This included determining the ideal time to act on the trackers, how the weather data (ghi and wind speed) would be used as input for the AI model, and whether the instantaneous value would be used or treated in a specific manner.
For the first evaluation, a large set of variables could be considered to determine the optimal time of action, such as the time the tracker spends to move from one angle to another, the energy cost to move the trackers, or fast variations in weather, to name a few examples. For simplicity, the ideal time to apply the angle inferred by the AI model would be 4 min, as this is the average time it takes for a conventional algorithm to update the tracker’s position by one degree.
For the input data, we chose to use a moving average of 4 min—the same interval used for the algorithm to act—to mitigate the effects of instantaneous weather changes, since the rapid passage of a cloud can lead to incorrect algorithm inferences. Because it cannot predict the exact timing of cloud movements, directly applying the measured instantaneous values could cause the algorithm to position for diffuse irradiance when conditions are more favorable for tracking the sun, and vice versa.
With these two final definitions, the algorithm was ready for its field implementation; the results can be seen in
Section 4.