1. Introduction
2. Data description
3. Decomposition model
3.1 Erbs
3.2 Reindl
3.3 Maxwell
3.4 Perez
4. Deep learning model
4.1 LSTM
4.2 GRU
5. Results and Discussion
6. Conclusions
Abbreviations
DNI : Direct normal irradiance (W/m2)
DHI : Diffuse horizontal irradiance (W/m2)
GHI : Global horizontal irradiance (W/m2)
GRU : Gated recurrent unit
LSTM : Long short-term memory
QC : Quality control
rMBE : Relative mean bias error (%)
rRMSE : Relative mean square error (%)
Nomenclature
: Extraterrestrial irradiance (W/m2)
: Clearness index
: Diffuse fraction
: Direct beam transmission
: Clear sky index
: Air mass
1. Introduction
Solar irradiance components such as global horizontal irradiance (GHI), direct normal irradiance (DNI), and diffuse horizontal irradiance (DHI) are important for solar energy applications. The GHI is important as a resource of solar energy information to provide greater photovoltaic (PV) power production under all sky conditions. DNI is an important component of global irradiance, particularly under cloudless situations, and it represents a solar resource that may be exploited by many types of concentrating solar technology (CST)1). DHI is also important because it is required for energy and building applications2). However, most solar irradiance measurements solely focus on GHI because the devices to measure DHI and DNI are costly. Furthermore, the devices such as a pyrheliometer to measure DNI require periodic maintenance and calibration because they are mounted on a sun tracker that moves according to the sun's position.
Over the past half-century, decomposition solar radiation models have been developed to estimate DNI and DHI in order to obtain a more accurate estimate of solar irradiance components. Most of the decomposition models use correlations between clearness index and diffuse fraction. The following formula is used to calculate the clearness index and diffuse fraction.
where denotes the clearness index, indicates the diffuse fraction, denotes the extraterrestrial irradiance, and represents the solar zenith angle. However, a major limitation of most of these models is that they have primarily been developed for hourly global solar radiation data. This presents a challenge for solar energy applications, as sub-hourly data are required to accurately estimate the rapid changes in solar irradiance that occur over short time intervals. Despite the importance of sub-hourly data for solar energy applications, there has been relatively little research on the development of decomposition models that can handle this level of temporal resolution. This has led to a reliance on hourly data, which may not capture the true variability of solar irradiance over shorter time intervals. The lack of accurate sub-hourly data has also led to limitations in the design and optimization of solar energy systems. Without accurate information on the rapid changes in solar irradiance, it is difficult to optimize the performance of solar energy systems and ensure maximum power generation.
Therefore, in this study, we evaluated the accuracy of well-known decomposition models using sub-hour solar irradiance data. The decomposition models are evaluated at several temporal resolutions (1 hr, 30 min, 15 min, 5 min, 1 min) to estimate DNI using solar irradiance data in Korea. In addition, the estimated DNI using the decomposition model is compared with deep learning models to determine which model is best suited for sub-hourly data resolution.
2. Data description
The instruments to measure GHI, DHI, and DNI were set up on the rooftop at Kookmin University in Seoul, South Korea (latitude: 37.61200° N, longitude: 126.99770° E). Solar irradiance data in 2021 are used in this study. Furthermore, to acquire the best and most accurate solar irradiance ground measurement data, a set of quality control (QC) and explicit standards are required. Data QC is the use of techniques or processes to determine if data meets overall quality goals and particular quality requirements. A set of QC procedures that adhere to the baseline surface radiation network (BSRN) recommended3) QC tests are provided below.
1. < 85°
2. 0 < GHI < 1.50 cos1.2 + 100 Wm-2
3. 0 < DHI < 0.95 cos1.2 + 50 Wm-2
4. 0 < DNI < Wm-2
5. Abs[(DNIcos + DHI ‒ GHI) / GHI] < 0.08; for < 75° and GHI > 50 Wm-2
6. Abs[(DNIcos + DHI ‒ GHI) / GHI] < 0.15; for ≥ 75° and GHI > 50 Wm-2
7. DHI / GHI < 1.05; for < 75° and GHI > 50 Wm-2
8. DHI / GHI < 1.10; for ≥ 75° and GHI > 50 Wm-2
Condition (1) when ( < 85°) is included to eliminate the low irradiance. This is due to the fact that the errors for solar radiation models are large at low solar elevations4). As a result, to achieve a good quality of data the data points that do not satisfy the QC procedure will be eliminated.
3. Decomposition model
3.1 Erbs
Erbs et al.5) introduced a correlation method that uses solar irradiance data from five different locations in the United States. They calculated hourly, daily, and monthly averages of this data to create a fourth-order polynomial equation. The equations of the Erbs model are expressed as:
3.2 Reindl
The Reindl6) model was created using data from five European and North American regions, and the solar zenith angle was introduced as a second input parameter to estimate diffuse fraction in addition to the clearness index.
3.3 Maxwell
The Maxwell model7), commonly known as the direct insolation simulation code (DISC) model, uses an exponential connection between direct beam transmittance and air mass and contains correlation functions with a parameter of clearness index. The primary variable influencing the connection between and is air mass, which may be computed using clear-sky direct beam transmittance and deviations from clear transmittance values.
where is direct beam transmittance, and is air mass, is clear-sky direct beam transmittance, is deviations from clear transmittance values, and , , are polynomial functions dependent on the clearness index.
3.4 Perez
To enhance the Maxwell model, the Perez model8) was created. The outcome of this model was given a correction coefficient function of normalized clearness index, solar zenith angle, and precipitable water to the result of Maxwell model.
Here, is the normalized clearness index, is direct normal irradiance, and is precipitable water.
4. Deep learning model
4.1 LSTM
Long-short term memory (LSTM) was first introduced by Hochreiter et al.9) as a temporal recurrent neural network (RNN) for learning long-term information dependence. When training a simple RNN, issues with vanishing and exploding gradients arise when the data sequence is too long to carry the information. Thus, the LSTM model was established to address these problems. The operation inside the cells of LSTM consists of three gates: input, output, and forget as shown in Fig. 1(a). It aids in either remembering or forgetting information via the forget gate, solving the vanishing and exploding gradient problems associated with long-distance information transmission.
where is the forget gate, is the input gate, is the value of candidate state memory, is the state of the memory cell, is the output gate, and are the activation functions, and is the final memory at the current time step.
4.2 GRU
Cho et al.10) proposed gated recurrent unit (GRU) as a more straightforward RNN architecture than LSTM. To remember important details and capture long-term dependencies, GRU is comparable to LSTM. GRU's strength lies in the fact that it is less complicated and takes less time to compute than LSTM, because it has fewer parameters. Therefore, GRU can be trained much more quickly than LSTM. GRU has two gates: update gate and reset gate. The amount of data that must be lost from the past is set by the reset gate, while the type of data that should be stored for the future is set by the update gate.
Fig. 1(b) depicts the architecture of the GRU’s cell, and the following equation describes the connection between GRU's inputs and outputs.
where is the reset gate, is the update gate, is the memory content, and are the activation functions, and is the final memory at the current time step.
5. Results and Discussion
The evaluation metrics used to evaluate the accuracy of the decomposition and deep learning models are relative root mean square error (rRMSE) and relative mean bias error (rMBE). Evaluation metrics that measure relative performance are preferred because they show percentage error and are thus more useful when data vary with time scale.
where , , , represents the actual output, estimated output, mean value of, mean value of actual output, and the number of samples respectively.
Fig. 2 shows the results of decomposition solar radiation models to estimate DNI at various temporal resolutions. All decomposition models tend to have greater error as the time scale decreases. Using every minute and hourly solar irradiance data, the rRMSE difference error for all models is greater than 4%. Table 1 provides a comprehensive summary of the results of decomposition models for different sky conditions. The sky conditions are categorized based on the clearness index value11). The sky conditions are divided into the overcast sky (), partly cloudy (), and overcast sky (). It indicates that the rRMSE values are the highest when using every minute data, while the hourly data produces the most accurate results in all-sky conditions. For this reason, deep learning models are used to estimate DNI from minute-level irradiance data in order to improve the performance of decomposition models.
Table 1
rRMSE of decomposition models at different temporal resolutions for various sky conditions
In decomposition and deep learning models, the correlation between the clearness index and the diffuse fraction is crucial for determining whether the model functions well. The difference between the decomposition and deep learning models can be seen in Fig. 3. It shows that decomposition models cannot cover the gray distribution, which represents the actual measured data. The deep learning models, on the other hand, demonstrated better results and showed high coverage. The scatter plot of the estimated and measured DNI of the decomposition and deep learning models is depicted in Fig. 4. Compared to decomposition models, the accuracy of deep learning models was observably enhanced. It demonstrates that the GRU and LSTM models outperform all decomposition models, with the coefficient of determination (R2) values of 0.947 and 0.948, respectively.
For greater comprehension, the comprehensive outcomes of each model are presented in Table 2, offering a detailed review. Additionally, Fig. 5 illustrates the variations in patterns observed in each model as the time scale is changed. In terms of rRMSE, the deep learning models outperformed all decomposition models, where the error is approximately 25% in every minute data resolution. The deep learning models also have the smallest value of rMBE, with values of -0.7% and -1.9% for GRU and LSTM, respectively. However, as the time scale increases, the accuracy of deep learning models decreases. For example, the rRMSE for LSTM rises to 27.0% when the data resolution is every 15 minutes and 27.5% using hourly data resolution. In contrast to decomposition models, the accuracy of all models improves as the time scale increases. Compared to minute data resolution, the rRMSE of decomposition models decreases by approximately 3% when using 15-minute data resolution and by approximately 5% when using hourly data resolution.
Table 2
rRMSE and rMBE of decomposition and deep learning models at different temporal resolutions
6. Conclusions
To assess the accuracy of the decomposition solar radiation models in estimating DNI at various temporal resolutions, four such models were employed. The findings indicated an increase in error as the temporal scale was reduced to sub-hour intervals, with the rRMSE increasing by more than 4% when the temporal scale was reduced from hourly to minutely. The decomposition models exhibited rRMSE results exceeding 33% at every minute time scale. To improve the accuracy of DNI estimation, deep learning models such as GRU and LSTM were utilized as a solution. The results demonstrate that the deep learning models outperform the decomposition models across all temporal resolutions, with the rRMSE value being less than 26% and 29% for 1-minute and 1-hour temporal resolutions, respectively. However, due to a reduction in training data, deep learning models' accuracy decreased as the temporal scale increased. The results indicate that deep learning models are more suitable for estimating DNI at sub-hourly intervals. Moreover, these outcomes offer valuable insights into the effectiveness of decomposition and deep learning models in estimating solar radiation. This information can facilitate the creation of more precise models for assessing solar resources, which can have implications for the development and optimization of solar energy systems.







