1. Introduction
As transportation hubs connecting various cities, high-speed railway stations play an increasingly vital role in the social and economic life of China [
1]. The waiting hall is the core space of the high-speed railway station [
2], where passengers stay for a long time; therefore, passengers have increasingly high requirements for the quality of the light environment in the waiting hall [
3], and it is urgently needed to build a healthy and comfortable area [
1,
4]. People’s needs are multi-level from low to high, and with reference to Maslow’s hierarchy of needs model, people will naturally pursue and aspire to higher comfort and health needs after satisfying their survival needs and safety needs [
5]. It has been highlighted that the use of sunlight in buildings not only reduces lighting energy consumption, but also helps to improve the comfort of building occupants [
6]. In addition, many studies have shown that bringing sunlight into indoor spaces can improve visual quality and reduce a range of symptoms, such as depression and seasonal affective disorder (SAD) [
7,
8]. Daylight is considered an indispensable factor in the design of waiting halls. Natural light shows the importance of keeping the physiological and psychological balance of the building users [
9]. Muñoz-González et al., found sunshine affects people’s circadian rhythm, and the lack of natural lighting in the home office can easily lead to headaches, fatigue and inattention [
10]. In recent years, many experts have studied and discussed the light environment of architectural spaces under different conditions and environments, but scholars have paid more attention to the comfort of the indoor light environment of residential buildings [
11] and office buildings [
12], while less attention has been paid to the built environment of the waiting hall, which are more open and have a higher crowd density. In addition, most of the relevant research results focus on the indoor thermal environment of waiting halls [
13] and ignore the research on the light environment. With the popularization of the concept of green health, the lighting of the waiting hall is valued by passengers. As mentioned above, an appropriate indoor daylighting design can greatly reduce the energy consumption of artificial lighting, and facilitate early achievement of the dual goals of carbon peaking and carbon neutrality.
In order to guide the lighting design more effectively, the accuracy of the lighting simulation technology has raised higher requirements. The natural illuminance and color rendering provided by sunlight is hard to replace with artificial light sources, and the sunlight quality and energy-saving effect are better than artificial lighting [
14]. However, the condition of the natural light environment is constantly changing. In order to accurately evaluate the real condition of the daylight environment at the initial stage of scheme design, designers need to use lighting simulation technology to simulate and predict the condition of natural light [
15]. Natural light is dynamic and it is difficult to control its dynamic changes; it changes with geography, time and weather conditions, which increases the difficulty and accuracy of computer simulation [
16]. With the high-speed advancement of science and technology, daylighting simulation technology has been significantly developed, and natural daylighting simulation technology has changed from static daylighting simulation to dynamic daylighting simulation. The traditional static daylighting simulation method can only simulate a single sky condition, while the dynamic daylighting simulation technology makes up for the shortcomings of the static daylighting simulation method. By optimizing and upgrading the more advanced core algorithm, dynamic sky conditions can be predicted. In addition, the accuracy of daylighting simulation analysis technology is improving. By integrating the information on sky meteorology, geographical location and environmental conditions, the research on the accuracy of the skylight climate model has made great progress, which can greatly improve the accuracy of waiting hall lighting simulation. Ruiz proposed a new daylight dynamic indicator, partial daylight autonomy (DAp), which was verified under real sky conditions. This method can accurately quantify the accurate switching time and energy consumption of seasonal space electric lighting [
17]. Susa-Páez made use of useful daylight illuminance to simulate the dynamic lighting of the atrium and adjacent space to assess the lighting potential of the vertical central atrium, which contributes to improving the light comfort of the building’s users [
18].
Although the above research can improve the precision of lighting design, the current environmental simulation tools have professional barriers, and there is a lack of reliable interfaces among various environmental simulation and evaluation tools. A large amount of environmental simulation analysis software is time-consuming and inefficient, and cannot provide effective support for the performance-driven design process. Therefore, it is vital to improve the efficiency of the corresponding lighting simulation tools and the friendliness of tool operation, and increase the research on the universal plug-in interface and platform. e Silva introduced a parametric design method to measure the energy performance of buildings; all simulations and optimizations were accomplished on a single platform, using Grasshopper and its plug-ins Ladybug, Honeybee and Galapagos, compared with the traditional method, and it not only improves the accuracy of simulation but also saves time [
19]. Kamel set up a parameterized workflow for evaluating outdoor thermal comfort. Compared with other current methods, this workflow has considerable universality through Ladybug and Honeybee, and Butterfly plug-ins on the grasshopper interface, and can be used for parameterized research on indoor and outdoor environments [
20]. The above study shows that the natural light performance simulation tool developed based on the parametric simulation platform can significantly simplify the modeling process and improve the efficiency of natural light performance simulation.
In addition, in order to assess the natural lighting performance of the waiting hall in a timely manner, an efficient and accurate prediction method needs to be found. Traditional forecasting methods use software such as EnergyPlus V9.0.1, DesignBuilder 7.0, Ecotect 2021, etc., which require input parameters involving a large number of detailed descriptions of space and skin, building structure, equipment, operation mode, etc., and lots of information is difficult to obtain in the early design stages. The limitations of traditional forecasting models make it difficult to predict the physical environment of buildings efficiently [
21,
22,
23], while machine learning makes up for the shortcomings of traditional tools in terms of efficiency and accuracy. In order to quickly obtain energy-related feedback in the early design stage, Hygh developed a multivariate linear regression model based on a dataset of 27 design parameters. It was found that it is very consistent with the simulation results and becomes an effective decision support tool for designers [
24]. Turhan used artificial neural networks to forecast the thermal load of existing design parameters. The results show that under the given conditions, the prediction results of the artificial neural network were highly consistent with those of the building energy simulation tool, with a prediction success rate of 0.977 [
25]. However, there are limitations when traditional machine learning algorithms try to learn and predict the light environment of the waiting hall. Traditional machine learning is not good at dealing with nonlinear problems. Aiming at tackling this problem, this paper proposes a gradient boosting regression tree (GBRT) prediction model [
26], which has high prediction accuracy and efficiency in dealing with high-dimensional nonlinear problems, as an interpretable machine learning theory can make the model transparent, allowing designers to understand the decision-making process and build trust between people and the model. On this basis, by introducing explainability into the dynamic light environment prediction in waiting halls, the interpretation of inductive preferences of the model is added to the harvest regression model, and the different contributions of design factors to light environment metrics are summarized [
27].
Reasonable natural lighting design is of great significance to the waiting hall. Based on the regional characteristics of cold regions, this paper puts forward a digital design process for the light environment of waiting halls in cold regions under the background of flourishing “performance-driven” design thinking [
28], combined with the simulation technology of green building performance [
29] and the parametric design method [
30]. This paper aims to achieve the following objectives: (1) To construct a parametric design process to remedy the problems of the insufficient consideration of spatial parameters and long optimization time of existing simulation methods. (2) To conduct a parametric simulation of design parameters affecting the waiting hall, so as to construct a sample dataset affecting the lighting performance of the waiting hall. (3) The GBRT prediction model constructed by the sample dataset helps designers to make rapid predictions of lighting performance in early design stages, and then obtains the ranking of each design parameter on the lighting performance of the waiting hall, and derives the key parameters contributing to the indoor light environment. (4) To provide design suggestions for the light environment in the waiting halls to provide decision support for sustainable architectural lighting design.
3. Methodology
This paper takes the waiting hall of Harbin West Station as a research case, and firstly elaborates the digital design process of the waiting hall light environment, as shown in
Figure 2, and then starts the case study of the design strategy of the waiting hall light environment based on the natural lighting performance.
The flow of this research includes three steps:
Step 1: Determine the parameter range of building forms and window openings, build a parametric model of the waiting hall, integrate architectural and environmental information.
Step 2: The relationship between space and performance is constructed, and the waiting hall light environment is simulated based on a parametric model, and a sample database of a large number of cases is established.
Step 3: The GBRT model is established and validated based on a large number of sample data, and the degree of influence of each design parameter on the light environment is explored through interpretability analysis and the light environment design strategy is obtained.
These steps are described in detail in the following subsections.
3.1. Building and Environmental Information Integration
Building and environment information integration is the first subprocess of the digital design of the waiting hall light environment. Based on the design objectives, design conditions and design parameters, this subprocess applies to build information modeling technology, parametric programming technology and building performance simulation technology to integrate building information, environmental information and building performance indexes based on the characteristics of the cold regional environment and design conditions. Firstly, the study starts with the integration of architectural and environmental information, applies the digital design strategy under the consideration of natural lighting performance, selects architectural geometric information such as building orientation, building height, section aspect ratio, plan aspect ratio and skylight ratio as design parameters, and then combines the results of sampling survey of design parameters; the numerical constraints of design parameters are shown in
Table 1.
Meanwhile, sDA, UDI and DGP
exceed are taken as the optimization design objectives. The building environment information model is used to orderly overlay the building information of material and structure, geometry of the waiting hall. The initial model of the waiting hall established has a rectangular plan format. The length of east–west and north–south directions are 318 m and 68 m, respectively, the section form is curved, and the height of the waiting hall is 18 m. Regarding the variation in the building height, the sidewalls are adjusted, while the roof shape remains the same. According to the research and literature, the widely used strip skylight is adopted, 28 strip skylights are evenly distributed, the width of each skylight is 1.0 m and the ratio of skylight is 0.15. For the change in skylight area, the width of skylight is mainly adjusted while maintaining the layout form of skylight. In order to create a more realistic light environment in the waiting hall, the strip skylight is divided by 26 grids across the skylight opening, and the width of each grid is 50 mm, which divides the strip skylight into 27 parts, with the arc length of each part being 1.6 m, and the layout form of the skylight is described in detail by axonometric drawings. In addition, the thickness of the roof was set to 300 mm after reviewing the literature. Meanwhile, two side windows are set on the east and west sides of the waiting hall; because the depth of the waiting hall is large, it mainly relies on the skylight. Since the waiting hall is deeper and mainly relies on the skylight for lighting, the east and west side windows are set to constant values. In order to obtain more accurate natural lighting prediction accuracy, the optical properties of various materials in the waiting hall are set, including the reflection ratios of walls, ceilings, floors and the transmittance of window glass. The reflectance of white wall and ceiling in the waiting hall is 0.805, the reflectance of beige ground is 0.500 and the transmittance of skylight and side windows are 0.69. The constructed parametric model is shown in
Figure 3.
To construct a multi-level building information parameter network, it is necessary to compound the information of corresponding module orientation, shape, window opening design elements and their subdecision variables. The first type of decision variable consists of building orientation design parameters. In this paper, taking the cold city of Harbin as an example, according to field research, the experimentally selected building orientation value range is set as −30~30°, and the modulus of the parameters is set as 5°. The second type of decision variable consists of three parameters that affect the design of the building shape: height, section aspect ratio, plan aspect ratio of the waiting hall. After reviewing the literature and actual research, the range of the plan aspect ratio is 1.0~6.0, and the modulus of the parameter is 0.5 based on the time of optimization calculation. The value range and modulus of building height and section aspect ratio are shown in
Table 2. The third category of decision variables is to optimize the design parameter consisting of skylight ratio, which is set from 0.1 to 0.3, and the modulus of the parameter is 0.05, so as to ensure the optimal ratio of skylight openings. The three types of decision variables are integrated. The parametric model facilitates the architects to adjust the morphology of the waiting hall at a later stage and avoid the time wastage caused by repetitive modeling.
3.2. Construction of Relationship between Space and Performance
The subprocess of space–performance mapping relationship construction is the second subprocess of the digital design of the waiting hall, in which the designer starts from the two aspects of building performance design objectives and building space design parameters, and applies building performance simulation technology to construct the mapping relationship between building space and building performance objectives.
The design objectives of the natural lighting performance simulation are sDA, UDI and DGPexceed. The Honeybee and Ladybug digital technology platform was used to simulate the natural lighting performance of the waiting hall, and the simulation data were imported into the storage module to form a callable database. The process was divided into three parts: the pre-collection of simulation data, the verification of software simulation accuracy and the simulation of three natural lighting performance indicators.
3.2.1. Pre-Collection of Simulation Data
After establishing the parametric morphological model, relevant parameters need to be set to overlay the building environment information such as meteorological data, geographic location, sky model, material optical properties and operation time in an orderly manner to prepare for the performance simulation of natural lighting in the waiting hall.
First of all, the simulation data are collected in advance, which can be divided into building sky model, giving material structure properties and setting environmental parameters. The building case is located in Harbin, a typical cold city, with an average annual sunshine time of 4.4 h and a short winter sunshine time. As can be seen from
Figure 4, the monthly average sunshine radiation in Harbin reaches the maximum value in June, then gradually decreases, and reaches the minimum value in winter. In view of the distinct regional climate characteristics in cold regions, this study started from the actual observation, corrected the Perez sky model by using the measured data and generated a sky model reflecting the real sky state in Harbin, namely the modified Perez sky model, which improved the accuracy of reflecting the local light climate environment. The sky brightness distribution is obtained by calculating the sky clearness index and sky brightness index from the measured sky radiation values. The Perez sky model based on the photoclimatic characteristics of the cold region is obtained by replacing the radiation values in the database with the sky brightness distribution calculated based on the measured data, and thus correcting the Perez sky used in the dynamic lighting simulation process. The replacement of radiation values in the database with measured values is intended for comparison with specific summer and winter measurements. The six days of measured solar radiation data were imported into the Harbin weather data package, changing only the solar radiation values for these six simultaneous days, and not for the rest of the year’s weather data. Then, based on the measured optical properties such as reflectivity and transmittance of building materials in the waiting hall, the designer created the optical property dataset of commonly used materials in the waiting hall. The designer can call the optical property data of different materials according to the needs, and build an adaptive correlation relationship with the parameters of the waiting hall, so as to avoid the repeated modeling caused by the adjustment scheme and reduce the time consumption of simulation experiments. In this paper, according to the optical properties of the waiting hall and indoor materials needed for natural lighting performance simulation and the data requirements of the sky model, the data of the waiting hall and environment in cold regions are imported into simulation software such as Ladybug 1.6 and Radiance 5.4a, and the simulation software are used to calculate the natural lighting performance index.
3.2.2. Test Verification of Software Simulation Accuracy
In the simulation of natural lighting performance, it is necessary to verify the accuracy and reliability of software simulation. The light environment of the waiting hall in Harbin West Railway Station was measured on the spot. The height of the test was 0.75 m, and 32 measuring points were selected. The testing time was from 1 August to 3 August 2019 and from 11 December to 13 December 2019, and the measuring time was from 8:00 to 17:00. During the actual measurement, all the artificial lighting devices in the waiting hall were turned off and only had natural lighting. The distribution of measurement points is shown in
Figure 5.
In order to obtain the Perez sky model based on local photoclimatic characteristics, the outdoor sky radiation value was measured at the same time as the indoor illuminance during the actual measurement, and the observation site was located in the East Square of Harbin West Station. Sky conditions during the measurement period included cloudy skies and sunny and cloudy weather conditions. The weather conditions during the measurement included cloudy, clear and cloudy. The measurement meets the relevant requirements of “Light Measurement Methods” (GB/T 5699-2013). The experiments were conducted in two seasons, summer and winter, and the test time points were from 8:00 to 17:00. The solar radiation at the measurement points was recorded once every 1 h, and the arithmetic mean of all measured values at each measurement point was used as the final result for analysis.
Figure 6 shows the measured outdoor radiation values.
The simulated and measured values were imported into SPSS 25 software, and the Pearson correlation between 576 groups of simulated and measured data was analyzed. The analysis results show that the correlation coefficient between simulated and measured data of natural lighting illuminance is 0.987, which is highly correlated. The significant p-value is 0.000, less than 0.01, which is highly statistically significant. It can be seen that there is a very significant correlation between the measured data and the simulated data. In addition, root mean square error was introduced to verify the accuracy of the data. According to the calculation results, root mean square error is 68.3, which belongs to the allowable error range. Therefore, software such as Ladybug and Honeybee are more accurate in the parametric simulation of natural lighting performance in the waiting hall.
3.2.3. Simulation of Natural Lighting Performance Index
After completing the parametric geometric model construction, material parameter input, environment parameter setting and weather data and sky model input, the information of building and environment is input into the calculation engines such as Radiance and Daysim to construct the mapping relationship between design parameters and performance of the waiting hall, and the calculated performance index data are fed back to the designer through parametric simulation leveling; thus, this guides the design of natural lighting in the waiting hall. The reliability and accuracy of Radiance and Daysim have been verified in existing domestic and international studies to ensure the authenticity and reliability of the data. However, they have the deficiency of not being able to interact directly with the user. The performance simulation data interaction module of the software can be invoked in Grasshopper to link them and provide feedback to the designer through visualization. According to the above-mentioned link to build the sky model and other links for the preliminary preparation of simulation data, the Honeybee year-round sunlight simulation method was applied to simulate the natural lighting performance of the waiting hall. In the sDA and UDI simulations, the floor slab of the waiting hall was selected as the test grid surface for the natural lighting performance study, with a measurement point height of 0.75 m and a simulated grid distance of 1 m. In the DGP
exceed simulations, the test point of DGP
exceed was selected on the central axis of the waiting hall, and the measurement point was 3000 mm from the nearest wall. The viewpoint was adjusted to face the distant side windows to obtain a wider field of view. The difference between light and dark in this area is large, and it is easy to generate glare. The simulation procedure flow is shown in
Figure 7.
3.3. Gradient Boosted Regression Trees (GBRT) Principle and Model Construction
There are many design parameters affecting the light environment of the waiting hall, and designers predict the natural lighting level of the waiting hall in the pre-program, mostly using traditional prediction models; but these have shortcomings of a low accuracy and being time-consuming, and so, in order to make up for these shortcomings, GBRT prediction model was proposed in combination with parametric design. Firstly, we used the advantages of high accuracy and easy adjustment of parametric simulation to batch simulate the design parameters and construct the dataset of design parameters and lighting indexes; then we used the dataset to train the GBRT prediction model and verify the accuracy of the constructed GBRT prediction model; and finally, we used the GBRT prediction model to predict the lighting level of the waiting hall. The interpretability analysis of the GBRT prediction model can link the contribution of each design parameter to the indoor light environment. When a lighting index does not meet the indoor lighting demand, the designer can focus on adjusting the design parameters with a higher degree of influence while eliminating the design parameters with lower relevance, which greatly improves the design efficiency while ensuring a higher design accuracy.
3.3.1. Principle of Model Computation
The gradient boosting regression tree is derived from the integrated learning boosting algorithm and has upgraded it [
26]. The basic principle is to construct M different base learners through several iterations to generate strong learners to achieve the final combination. The base learner of the GBRT model is a regression tree, which divides the feature space into different regions and gives each region its corresponding value, and then obtains the predicted value by dividing the data into different regions. Unlike the traditional boosting algorithm, each iteration of GBRT is designed to reduce the residuals of the previous model and to build the base learner in the direction of the gradient of residual reduction. The workflow of the GBRT model is shown in
Figure 8.
The main process of GBRT modeling is as follows:
Training samples for the input model, T = {(x1, y1), (x2, y2), …, (xn, yn)}, loss function L(y, f(x)), used to calculate the difference between the target value and the calculated value.
Step 1, initialize the weak learner:
where
f0(
x) is a tree with only one root node,
L(
yi,
c) is the loss function,
c is the constant that minimizes the loss function.
Step 2, for the number of iterations, m = 1,2…, M
- (a)
For i = 1, 2, …, N, calculate the negative gradient value of the loss function and use it as an estimate of the residuals:
- (b)
rmi fits a regression tree to obtain the leaf node region Rmj. Predictions are made for the leaf node region of the decision tree to fit an approximation of the residuals.
- (c)
For j = 1, 2, …, J, linear search is used to estimate the values in the range of leaf nodes and minimize the loss function. The optimal fitted values are obtained as follows:
Step 3: After the iteration, it forms the GBRT strong learner, which can be expressed as:
3.3.2. Model Creation and Validation
In this study, based on Rhino and Grasshopper’s parametric modeling, five influencing factors, such as building orientation, building height, section aspect ratio, plan aspect ratio and skylight ratio, which affect the natural lighting performance of waiting halls in cold regions, are selected as the input variables of GBRT model, namely
Xi = {
x1,
x2, …,
x5}. Taking the lighting evaluation index as the output variable of the model, namely
yi, in this paper, the light environment was simulated using Ladybug and Honeybee components in the parametric platform, and 100 sets of simulated data were obtained to form a complete dataset. The data feature set was divided into training and test sets in the ratio of 7:3. Among them, 70 sets of data constitute the training dataset and the remaining 30 sets of data constitute the test set. The part of the constructed dataset is shown in
Table 2. Parameter selection and construction of prediction models were performed on the training set, and prediction and evaluation were performed on the test set. Before calculation GS, grid search (GS) was used to adjust the design parameters, and the Z-Score algorithm was used to standardize all the data.
The coefficient of determination (R
2), root mean square error (RMSE) were chosen as the evaluation indices of the model to assess its accuracy with the following expressions:
5. Discussion
Based on the regional environmental characteristics of short sunshine time and a harsh climate in cold regions, the digital design strategy of a waiting hall in cold regions is summarized based on the design parameters of orientation, shape and window opening.
5.1. Design Strategy of Orientation Parameters
The building orientation and the light environment of the waiting hall influence each other, so as to explore the inherent law of building orientation and lighting performance, and thus realize a sustainable building design. With the change in building orientation, the amount of sunshine radiation received by the waiting hall gradually changes. sDA and UDI are at a minimum when the building orientation is −30°, and reach the maximum when the building orientation is 0°. The probability of uncomfortable glare is lowest when the building is oriented in the range of 0~10°.
To sum up, the change in building orientation has a great influence on the lighting evaluation indexes sDA, UDI and DGPexceed. Better indoor natural lighting performance can be achieved when the building orientation is 0°, which is conducive to improving the utilization rate of natural lighting and reducing energy consumption. In addition, according to the importance ranking of the GBRT model, the building orientation has little influence on the lighting performance of the waiting hall compared to other design parameters.
5.2. Design Strategy of Shape Parameters
In the design of the vertical dimension, the building height is an important factor affecting the natural lighting of the waiting hall. The waiting hall is a large space; if the building height is too low, it will cause a sense of depression. However, too high a building height will produce uneven illumination, so the building height needs to be chosen within a reasonable range under the premise of meeting the indoor light comfort. The building height has a great influence on sDA; the increase in building height helps to make full use of natural lighting. UDI100–2000 shows the opposite trend to that of sDA. The value of UDI100–2000 shows a slow decline with the increase in the building height and then tends to bottleneck. It was found that the building height in the range of 17~21 m can significantly improve the ability to use natural light and the lighting energy-saving potential while ensuring a suitable proportion of natural light in the range of 100 to 2000 lx effective illuminance data on the working surface throughout the year. However, attention should be paid to the occurrence of uncomfortable glare when designing the lighting of waiting halls.
The effect on sDA, UDI and DGPexceed is large. The sDA was negatively correlated with SAR. The results showed that sDA was maintained at higher light levels when the SAR was in the range of 0.1~0.4, which was conducive to improving the average natural light performance of the overall waiting hall. UDI shows a trend of increasing at first and then decreasing. When the value of UDI100–2000 is significantly increased between 0.1 and 0.3, the value of SAR in the range of 0.3~0.4 greatly increases the proportion of data with effective illumination on the working surface throughout the year, while the probability of uncomfortable glare is smaller. In addition, SAR was found to have the most important effect on sDA compared to other architectural design parameters.
Based on the constraints of the cold climate and geographical environment, the main plan is mostly in the form of a regular rectangle; therefore, in the plan design of the waiting hall, attention should be paid to the control of the plan aspect ratio, and an inappropriate ratio will easily lead to the increase in energy consumption. PAR has a significant impact on sDA as sDA shows a slow decreasing trend with the growth of PAR, while UDI100–2000 shows the opposite trend: with the increase in PAR, the UDI100–2000 value gradually increases. When the value of PAR is 4.5, it can greatly increase the proportion of effective lighting data on the working surface throughout the year while reducing the occurrence of glare in the waiting hall. According to the importance ranking of the GBRT model, PAR has a significant impact on DGPexceed.
5.3. Design Strategy of Window Parameters
The increase in the skylight ratio triggers the increase in sDA. With the increase in the skylight ratio, the probability of meeting the minimum illumination of the visual requirements on the working surface throughout the year can be significantly increased, and the data of the effective illumination on the working surface throughout the year can also be improved to some extent. At the same time, the probability of uncomfortable glare increases with the increase in the skylight ratio. Therefore, appropriate shading facilities should be used to reduce the occurrence of glare. At the same time, compared with other design parameters, the skylight design parameters have the greatest influence on UDI. Designers can choose the appropriate skylight ratio to meet the requirements of modeling and creative ideas.