Abstract
Solar irrigation systems should become more practical and efficient as technology advances. Automation and AIbased technologies can optimize solar energy use for irrigation while reducing environmental impacts and costs. These innovations have the potential to make agriculture more environmentally friendly and sustainable. Solar irrigation system implementation can be hampered by a lack of technical expertise in installation, operation, and maintenance. It must be technically and economically feasible to be practical and continuous. Due to weather and solar irradiation, photovoltaic power generation is difficult for highefficiency irrigation systems. As a result, more precise photovoltaic output calculations could improve solar power systems. Customers should benefit from increased power plant versatility and highquality electricity. As a result, an artificial intelligencepowered automated irrigation powergeneration system may improve the existing efficiency. To predict highefficiency irrigation system power outputs, this study proposed a spatial and temporal attention blockbased longshortterm memory (LSTM) model. Using MSE, RMSE, and MAE, the results have been compared to preexisting ML and a simple LSTM network. Moreover, it has been found that our model outperformed cuttingedge methods. MAPE was improved by 6–7% by increasing Look Back (LB) and Look Forward (LF). Future goals include adapting the technology for wind power production and improving the proposed model to harness customer behavior to improve forecasting accuracy.
Introduction
The current exorbitant market prices of photon capture devices necessitate the accurate determination of dimensions for photovoltaic (PV) solar power installations prior to conducting any subsequent analysis of their performance in a specific application^{1,2,3,4}. In order to maximize the profitability of an installation within a limited timeframe, it is imperative to accurately ascertain the power requirements for a given application. Numerous agricultural regions, particularly Char lands, coastal areas, and hilly terrains, lack access to grid electricity, posing a significant challenge for irrigation purposes^{5,6}. The utilization of photovoltaic (PV) solar power exhibits considerable potential in various domains, particularly in nations characterized by abundant solar radiation. Notably, the application of PV solar power for water pumping purposes, specifically for irrigating specific crops, emerges as a highly promising avenue^{7,8}. The water transportation pumps are equipped with solar cells. The solar energy that is absorbed by the cells is subsequently transformed into electrical energy through the utilization of a generator. This electrical energy is then supplied to an electric motor, which in turn powers the pump^{9}. The majority of conventional pump systems primarily operate using either a diesel engine or connecting to the local power grid. Nevertheless, when comparing solar pumps to these two modes of cooperation, certain drawbacks become apparent^{10}.
Nonetheless, the inherent instability of the technology remains a significant concern as it hinders the widespread integration of solar power into global power grids^{11}. Hence, accurate prediction of solar electricity generation plays a crucial role in enhancing the utilization of solar energy and bolstering the resilience of the power system. The expansion of the solar home system sector has led to the collection of significant timeseries data through the use of smart meters. This methodology guarantees the systematic production of precise data, the automated gathering of measurements, and the prompt retrieval of data. The generation of photovoltaic (PV) energy offers numerous advantages to various global markets due to its ability to align peak production with periods of high peak load. Morjaria et al. assert that the cost of solar power production has significantly declined, rendering it highly competitive in various international markets^{4,12}. The cost of renewable energy is comparable to or lower than that of power generated from nonrenewable sources. Consequently, there has been a notable increase in both the scale and quantity of solar power installations, reaching capacities of several hundred megawatts^{13}.
In order to ensure the reliability and effectiveness of power systems, it is imperative to employ robust and reliable models for estimating the power outputs of Photovoltaic systems. This is particularly crucial due to the urgent requirement of integrating environmentally friendly energy sources into grid systems^{14}. A limited number of factors exert influence on the overall transitivity of solar irradiance as it traverses the earth’s atmosphere. These factors include cloud cover, moisture content, and air pressure^{15,16}. The presence of cloud cover has a substantial effect on transitivity, as indicated by previous studies^{17,18}. Because of their broad availability, wind and solar photovoltaic energy are increasingly being integrated into electrical components. It is projected that solar energy will account for approximately 11% of power generation, and wind energy will contribute approximately 12% by the year 2050^{19,20}. There is a growing focus among governments and individuals in the solar energy sector on the development and utilization of smallscale distributed facilities and selfconsumption plants. The user’s text does not contain any information to rewrite in an academic manner. The development of accurate and costeffective forecasting models is crucial to the successful integration of largescale solar installations and small dispersed systems into electrical infrastructures. However, the unpredictable nature of solar energy poses a challenge to its integration into the grid^{22}. Furthermore, there has been a notable rise in the global installed photovoltaic capacity, which has now surpassed nearly 100 GW^{23}.
Physical models have demonstrated a high degree of accuracy in predicting weather patterns under consistent conditions. However, their ability to provide enhanced performance in the face of major changes in conditions remains uncertain. The authors of^{24} and^{25} examined a number of commonly used physical models. On the other hand, statistical models are developed based on the mathematical associations between one or more independent variables and particular dependent variables. These findings provide a solid basis for drawing inferences and making projections. The effectiveness of statistical models is greatly influenced by the quality of input data and variations in temporal frames. Among the most wellknown and widelyused statistical models are the single exponential model, the dynamically moving average model (ARMA), and its more complex versions (ARIMA, ARMAX, and SARIMA)^{26,27,28,29,30,31}. Machine learning models built from preexisting statistical models have also been used in various areas of engineering and in scientific study.
The four steps that all these models adhere to are data preprocessing, algorithm training, machine learning model creation, forecast generation, and forecast refinement^{32}. ML architectures such as Artificial Neural Networks (ANN)^{33}, Extreme Learning Machines (ELM)^{34}, and Support Vector Machines (SVM)^{35} are commonly employed for the purpose of estimating power output. Hybrid models mix many architectures for deep learning. Recurrent neural networks (RNNs) and deep learning techniques, which were originally designed for image processing tasks, are just a couple of the many deep learning models that find application across a wide range of domains^{36,37,38}. The projections of photovoltaic electricity using the aforementioned models have shown positive results. Neelesh et al.^{39} proposed a model for optimal onsite solar power generation, and improved the capacity of storage to improve the solar irrigation system. The mechanism was based on several steps such as as data acquisition, soil moisture forecasting, smart irrigation scheduling, and energy management scheme. The suggested system delivered approximately 9.24% water and energy savings for potato cultivation under full irrigation, thereby bolstering the WaterEnergyFood Nexus at the field level. Abhishek et al.^{40} introduced a novel statistical method for forecasting environmental parameters like soil moisture, temperature, and more. Utilizing an AutoRegressive Integrated Moving Average (ARIMA) model, various hyperparameter configurations were applied to the local unit’s recorded data. The forecasts obtained from the trained ARIMA model undergo validation using four distinct evaluation metrics: Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R2. Results indicated that the model accurately predicts observations with a 99% R2 score. The objective of the research in^{41} was to augment the solar energy collection capacity of Unmanned Aerial Vehicles (UAVs) by integrating solar power to enhance overall energy harvesting systems. The proposed approach merged two distinct renewable systems to harness electricity from the surroundings. Simulation outcomes leveraged an ensemble machine learning algorithm that integrated environmental factors and UAV data to forecast solar power output.
According to reference^{42}, this development enabled the extraction of photovoltaic (PV) energy characteristics with enhanced precision. The validation of the PVNet model’s reliability was conducted by employing four performance measures^{43}. The authors, in reference to^{44}, evaluate the efficacy of photovoltaic (PV) systems by considering the impacts of several parameters. Furthermore, the researchers took into account three separate types of Silicon Photovoltaic (PV) technologies, namely Polycrystalline, Monocrystalline, and Amorphous. Compared to other methods, polycrystalline panels have been shown to be more effective and have a favorable economic impact on power prices. A Convolutional Neural NetworkLong ShortTerm Memory (CNNLSTM) design for a photovoltaic (PV) plant with a power capacity of 15 kilowatts (kW) was created in reference^{45}. The power generation forecasting model consisted of multiple Long ShortTerm Memory (LSTM) layers. The aforementioned observations have served as motivation for previous studies that explore the utilization of hybrid deeplearning architectures in order to forecast the shortterm photovoltaic energy output for the subsequent day. The efficacy of the suggested approach was evaluated by contrasting machine learning and deep learning algorithms. The current research suggests a hybrid model for selfconsumption photovoltaic installations with the aim of ensuring grid stability, in contrast to previous studies that primarily focused on the development of extensive knowledge and machine learning models for largescale photovoltaic (PV) plants.
Deep learning models ensure that the right parameters are chosen when used. Nevertheless, the new architecture shows acceptable performance in predicting energy output in terms of accuracy improvements over the original LSTM model. The experimental results show that the suggested model outperforms both deep learning (DL) and machine learning (ML) models in terms of performance.
This paper introduces an attentionbased Long ShortTerm Memory (LSTM) model that is specifically developed for the purpose of forecasting the power output of a solar plant over various time intervals in the past and future. The dataset used in this study is derived from a photovoltaic facility that is connected to a field irrigation system. It consists of a wide range of variables, such as power generation, climatic factors, and facility power consumption. This study provides a thorough examination of the spatial–temporal attention mechanism, elucidating its visualization and interpretation. This study examines the relationship between the historical data range and the accuracy of forecasting, with the aim of determining the ideal parameters for the predictive model. The effectiveness of the spatialattentionbased LSTM model is assessed through a sequence of tests, demonstrating its favorable influence on the accuracy of forecasting and the robustness of the model. Moreover, a comparison analysis is undertaken to contrast the performance of the proposed attentionbased LSTM model with that of different machine learning models, including Decision Trees, K Nearest Neighbors, Support Vector Regression, and Naive Bayes. The evaluation of model performance is conducted with a rigorous approach, employing error measurements such as Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE).
LSTM layers are designed to capture sequential dependencies over time. By incorporating spatiotemporal attention in our proposed system, the model focuses on specific spatial regions at different time steps. This allows the network to dynamically adjust its attention to relevant spatial features as the sequence evolves, capturing intricate temporal patterns. Moreover, this mechanism enhances the network’s capability to learn discriminative features, focusing on regions that contribute more significantly to the task at hand.
Methodology and materials
This section provides a concise overview of the architecture of the proposed system. The entire process of predicting energy generation relies on multiple stages. Initially, the input undergoes preprocessing by being passed through a pipeline. Subsequently, the preprocessed variables are inputted into the model to undergo feature extraction and classification. In order to extract the most important components from the dataset, an attention mechanism was used in conjunction with the suggested Long ShortTerm Memory (LSTM) model. Convolutional neural networks (CNN) and long shortterm memory (LSTM) are two deep learning approaches that require organized input throughout the training process^{46}. The dataset utilized for the construction of the model is presented in a sequential format and demonstrates a grid topology with one dimension. Figure 1 depicts the diagram of the proposed system.
Dataset acquisition
This study involved the utilization of a 15 kW photovoltaic (PV) system integrated with a highefficiency irrigation system. A dataset was collected and analyzed to assess the system’s performance. Multiple data aspects were utilized to explore the complexities of a plant’s power dynamics. The initial characteristic, denoted as Temperature, represents the midday temperature measured in degrees Celsius. The impact of temperature changes on the operating efficiency of the power production systems within the plant is of significant importance. The second feature, windspeed, measures the mean wind velocity, denoted in kilometers per hour. The significance of this parameter is in its ability to affect the effectiveness of winddriven power generation methods. Humidity, the third characteristic, is quantified as a percentage and represents the amount of moisture contained in the atmosphere. This particular data point plays a crucial role in assessing the performance of plant operations and the efficiency of electricity generation. The subject of inquiry is the daily allocation of electrical power within a certain grid system. The fourth characteristic is consumption, which is quantified in kilowatts (kW). This statistic offers significant insights into the energy consumption patterns and requirements of the plant. The fifth attribute, Sun Hours, measures the duration of sunny hours that occur on alternate days, represented in hours (h). This characteristic clearly demonstrates the presence of solar irradiation, which has a direct impact on the potential for solar power generation. The sixth characteristic, known as Cloud Cover, is expressed as a numerical value representing the proportion of the sky that is obscured by clouds. This particular characteristic has a substantial impact on the quantity of sunshine that is able to reach the solar panels, hence directly affecting the production of solar power.
The ultimate characteristic, production, measures the daily power generation of the facility, denoted in kilowatts (kW). This characteristic is impacted by a convergence of variables such as Temperature, wind velocity, and solar radiation and provides an empirical assessment of the plant’s electricity production. By conducting a thorough examination of many data characteristics, this study aims to uncover the complex relationship between weather patterns, energy usage, and electricity generation. The objective is to cultivate a more profound understanding of the plant’s overall performance and operational efficiency. There are three distinct kinds of input features to choose from. In the first part of this research project, we investigate the power output records that cover a period of one year and three months. Secondly, weather characteristics such as Temperature, wind speed, and other relevant factors have been obtained by referring to online weather reports. Thirdly, there is a collective comprising the power consumption facility. Additionally, the existing algorithms are given the input variables in order to forecast the power value for the following day (Fig. 2).
LSTM
The normalization layer, RELU layer, LSTM layer, fully connected layer, dropout layer, and classification layer are just a few of the hidden layers that make up the LSTM network. In activities involving sequential data, the long shortterm memory (LSTM) model performs better. In this paper, a novel 20layer Long ShortTerm Memory (LSTM) architecture is presented. The design has 21 hidden levels, an output layer, and a feature input layer as its first three layers. The suggested LSTM also includes a mechanism for spatial–temporal attention. A recurrent neural network design called Long ShortTerm Memory (LSTM) is capable of accurately capturing the temporal dependencies included in sequential input. The Long ShortTerm Memory (LSTM) model uses the prior output of the hidden state, indicated as h(t − 1), and the current input, denoted as c_t, to operate on sequential data^{47}. The conventional recurrent neural network (RNN) has problems accurately capturing dependencies, which causes problems like exploding and vanishing gradients to appear, especially when dealing with longer time periods. Long ShortTerm Memory (LSTM) models’ inclusion of state units, input gates, output gates, and forget gates is a remedy intended to address the vanishing gradient problem, as mentioned in reference^{48}. The schematic in Fig. 3 shows how the Long ShortTerm Memory (LSTM) model uses input gates, output gates, and forget gates. The input gate, output gate, and forget gate are represented by the sigmoid function, which returns values between 0 and 1, and the hyperbolic tangent function, tanh. Additionally, the state cell memory is represented by Ct, and the candidate for the state of the cell is written as. The forget gate, which is in charge of eliminating unnecessary information from the previous state, as well as the output obtained from the top hidden layer, are among the assigned functions of these gates. The cell’s job is to make sure that the output gate chooses the most crucial information, whereas an update gate is in charge of changing the states by adding a new state.
Longdistance time sequence data can be stored in the LSTM network’s shortterm and longterm memory. Long shortterm memory (LSTM) is used to solve the problem of vanishing gradient, which happens when recurrent neural networks (RNNs) are unable to learn weight parameters successfully. In order to mitigate the problem caused by vanishing gradient, the Long ShortTerm Memory (LSTM) architecture makes use of input gates, output gates, and forget gates. Activation functions are provided by the aforementioned gates. The input gate is represented by Eq. (1), the forget gate by Eq. (2), and the output gate by Eq. (3). Equation (4) shows the potential cell state, Eq. (5) shows the memory state of the cell, and Eq. (6) shows the current output. Figure 4 shows the basic architectural layout of the Long ShortTerm Memory (LSTM) model.
The feature input layer receives the input features for the network and transmits these characteristics to the hidden layers of the LSTM for data normalization. The inclusion of a batch normalization layer has been shown to enhance the training efficiency of the LSTM network^{49}. Batch normalization layers are utilized to execute supplementary scaling and shifting operations. These layers are inserted in order to normalize the input of the activation function and address the issue of vanishing gradient that may occur prior to the hidden layer, such as tanh, RELU, or Sigmoid^{50}. The batch normalization layer incorporates two sequential operations, namely normalization and affine transformation, which are applied channelwise. The normalization operation of data, including n characteristics, includes the calculation of the variance and mean of batch B.
Equation (1) shows the mean, and finally, Eq. (2) is used to calculate the normalization operation by using the mean and variance function.
In this case, B is the minibatch with input of dimension d, which is the mean and displays variance. The input batch is normalized to have a unit S.D. and a zero mean.
Here ({prime}varepsilon {prime}) is an arbitrary tiny constant utilized to ensure numerical stability before an affine transformation (widehat{X}) is used. Equation (4) shows the affine transformation operation.
The learnable scale is ϒ, and the shift parameter is β.
The rectified linear unit (RELU) nonlinear function is applied by the RELU layer, a part of neural networks. The rectified linear unit (RELU) is utilized to modulate the output by constraining its range. The utilization of tanh and sigmoid activation functions in the context of backpropagation can give rise to certain challenges. Consequently, we have opted to employ the rectified linear unit (RELU) as an alternative activation function. Equations (5) and (6) provide the mathematical representation of the rectified linear unit (ReLU) in terms of its gradient and function, respectively.
If RELU receives a negative input, it returns 0, and if it receives a positive input, it returns the value of t. As a result, the RELU’s output ranges from 0 to infinite.
Addition Layer: An addition layer adds inputs elementwise from multiple neural network layers. The addition layer takes multiple inputs of the same shape and returns a single output. We only have to define the number of inputs to the layer when we want to create an addition.
Data is converted into a onedimensional vector and then given to the fully connected layer, which subsequently processes it. The weights of the connections are multiplied by historical data, and the bias value is applied. Equation (7) illustrates the function carried out by a completely connected layer.
Here, W stands for weight, b for bias, o for the rth neuron’s input vector, and f for the activation function.
Softmax layer: For multiclass classification issues, the softmax layer is employed. Softmax is normally used as an output layer for most multiclassification problems. The function of selection performed by the softmax layer is presented in Eq. (8).
The function s represents the softmax function applied to the input vector v. The variable n denotes the total number of classes. The function e^{vj} refers to the standard exponential function applied to the input vector, while e^{vj} represents the standard exponential function applied to the output vector.
Table 1 provides a comprehensive overview of the layers utilized in architecture. Each LSTM layer in the network captures different levels of abstraction in the input sequence. The lower layers learn simple temporal patterns, while the higher layers focus on more complex and abstract representations. This hierarchical feature learning allows the model to understand sequential dependencies at different levels. The use of five concatenated LSTM layers suggests a desire to handle tasks with a high level of complexity. Moreover, the LSTM layers tend to mitigate overfitting by offering a more expressive model capable of capturing intricate patterns in the training data without relying excessively on noise.
Proposed attention mechanism
The attention mechanism employed in deep learning approaches operates in a manner analogous to the human visual system, employing attention constraints to select the most salient aspects from a diverse set of input features. Attention blocks have been extensively employed in several applications, such as the detection of brain tumors. Song and colleagues (YEAR) proposed a novel approach for identifying and estimating human actions in films. Their method involves the development of an endtoend attention block that utilizes spatiotemporal information. Furthermore, Chen et al. (year) proposed a novel approach that incorporates channel and spatial attention mechanisms, as well as an image labeling method, in conjunction with convolutional neural networks (CNNs). This integrated framework demonstrated notable improvements in performance when applied to the specific dataset utilized in their study. Zhai et al. employed a concatenation technique to integrate a dilated convolutional neural network (CNN) with a channel attention model in order to tackle challenges associated with optical flow prediction effectively. The attentionbased model utilizing Long ShortTerm Memory (LSTM) was created by Ran et al. for the purpose of estimating journey duration. The findings indicate that the utilization of attentionbased approaches yields higher levels of accuracy compared to the basic models.
The previously described models have been a source of inspiration, prompting us to suggest a spatial–temporal attention mechanism for LSTM. The subsequent part offers a comprehensive examination of the spatial and temporal operation. Figure 5 illustrates the comprehensive architecture of the LSTM model incorporating an attention mechanism.
Operation for the spatial attention
Let us consider a spatial–temporal feature vector X, which is a 2dimensional matrix represented as XꞓR^{nxk}. Here, n represents the total number of features at a given time step, while k is the total number of time steps. The input matrix for the feature can be partitioned into k ndimensional vectors, as denoted by Eq. (9).
After the computation of monolayer neurons, an input vector gets activated using sigmoid(x) = 1/ 1 + e^{x}. ({alpha }_{t}) is the spatial weight that is generated by the normalization of Softmax(x_{i}). The softmax function is employed to ensure the restricted addition of weights. Moreover, (.) operation presents the Hadamard product, such as elementwise operation.
Operation for temporal attention
The data obtained from the spatial attention block is sequentially inputted into the LSTM cell. The data of the hidden layer output is acquired in accordance with Eq. (12).
Moreover, the temporal attention bias is established subsequent to the use of the rectified linear unit (ReLU) activation function, denoted as max (0,x). Additionally, the process involves the utilization of softmax normalization, as depicted in Fig. 6 and Eq. (19). The variable (x) in Eq. (20) denotes the operation of matrix multiplication. Equation (21) presents the final estimation e in the absence of activation.
Experimental evaluation
In this section, metrics used in the study, experimental setup, and results are presented with details.
Metrics
In the performance assessment, we included four metrics, namely mean square error (MSE), root mean square error (RMSE), and mean absolute error (MAE). The equations of RMSE, MAE, and MAPE are given below.
Here ({E}_{k}{prime}) refers to the estimated energy production, whereas E_{k} refers to the output for target energy, and n exhibits total points of data that are used for the forecasted error estimation. The mean absolute error (MAE) is computed by taking the average of the absolute differences between the target and anticipated values of energy production. In the end, RMSE uses the square to enhance the broad errors.
Environmental setup
For the execution of the proposed system, we used a single GPUbased system on the Windows 10 operating system. We performed all the experiments using the Python language. The details of the environmental setup are shown in Table 2.
Machine learningbased models
In addition, we conducted training on various machine learning methods in conjunction with our proposed LSTM network incorporating an attention mechanism. After training the model with suitable values for the number of epochs, batch size, and validation split, a testing phase is carried out to assess the effectiveness of our improved LSTM model. Furthermore, in order to conduct a comparative analysis, we have utilized supervised learning methodologies for the training phase. The classifiers employed for training our machine learning model predominantly consist of decision tree (DT), K nearest neighbor (KNN), support vector regression (SVR), and naïve Bayes (NB).
The performance of these four algorithms is compared with our attentionbased LSTM model. The results are reported in Table 3. The comparative analysis plot is shown in Fig. 7. It is clearly exhibited that our proposed model achieves better performance than traditional MLbased techniques. The comparative plot for loss estimation among proposed model and some ML algorithms is shown in Fig. 8.
Comparison with base LSTM model
In this part, we undertake a performance evaluation between our proposed attentionbased Long ShortTerm Memory (LSTM) model and a baseline LSTM model, taking into account different time strides for both looking back and looking forward. The results indicate that the average mean absolute percentage error (MAPE) of our suggested system is lower than that of the base long shortterm memory (LSTM) model, as presented in Table 4. When the number of lookback time steps is raised to 14, the mean absolute percentage error (MAPE) exhibits a decreasing trend in its values. As the duration of observation rises, there is an observable increase in the disparity between the MAPE values of the base LSTM model and our proposed LSTM model. In particular, when examining look back 12, the MAPE of our suggested LSTM model is lower for all lookforward values, including 1, 3, and 7. The observed increase was substantial in magnitude, coinciding with the rise in the Look Forward value. Consequently, we achieved an average improvement of 6–7% in the MAPE. Furthermore, when considering the Lookback values of 14 and 16, we observe the improved performance of the LSTM model that we have provided. The improvements in MAPE are observed to increase when the number of lookback periods is raised up to 16 and when the number of lookforward periods is increased up to 7. Therefore, in all instances, our suggested LSTM model demonstrates superior performance compared to the base LSTM model. The visualization of the training and testing curves is depicted in Fig. 9. The visualization of findings is also depicted in Fig. 10.
To analyse the performance of the proposed model’s prediction according to the training data, the Fig. 11 is shown. It is clearly notable from the plot that there is tiny difference among the ground truth and predicted values.
Robustness
Due to the unavailability of public datasets, we used sparse datasets to assess the robustness of the proposed model. Sparse data denotes a dataset characterized by a substantial portion of its elements or features containing zero values.
The division of data is illustrated in Fig. 12, considering four distinct scenarios: 20% data, 40% data, 60% data, and 80% data. Figure 13 showcases the predicted outcomes and errors corresponding to varying data proportions. Figure 13 highlights an increase in predicted error as the data amount decreases, underscoring the substantial role of data in model performance. Despite this, the proposed system, incorporating longshort term dependencies and adherence to physical laws, maintains higher accuracy compared to alternative models. This suggests that the proposed model exhibits superior robustness and stability. The findings affirm that the proposed system excels in providing more precise and satisfactory predictions, even in scenarios with sparse data, surpassing the performance of other models.
Conclusion
In this paper, we propose a robust framework that combines LSTM with a spatiotemporal attention mechanism. Our framework aims to accurately predict the output of a solar plant, specifically in the context of highefficiency irrigation systems. We accomplish this by carefully selecting and incorporating the most significant meteorological variables that have a direct impact on the power output of the plant. In order to ascertain the effectiveness of the proposed system, a range of machine learning techniques have been utilized and compared against our proposed system. The proposed model has shown significant performance improvements due to the spatiotemporal attention mechanism, which is considered the most valuable information among the hidden layers of LSTM during the training phase. Furthermore, we also took into account the retrospective periods of 12, 14, and 16 days when calculating the MAPE for forecasting periods of 1, 3, and 7 days. We found that the MAPE greatly improved after 14 days. Furthermore, the automated system we propose exhibits flexibility that allows system operators and energy service providers to make optimal decisions about power system control. These decisions encompass various aspects, including demand response, economic dispatch, reservation configuration, and unit effect. However, there exist some challenges that still need attention to overcome, like the unavailability of large data for better network training.
In the future, we aim to utilize this system for wind power production and finetune our model to exploit consumer behavioral features to further improve the performance of the forecasting system. Moreover, we will try to crossvalidate the performance of the model gathering versatile datasets.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
References

Mellit, & Kalogirou, S. A. Artificial intelligence techniques for photovoltaic applications: A review. Progress Energy Combus. Sci. 34(5), 574–632 (2008).

Australian Energy Resource Assessment – Chapter 10 – Solar Energy, http://arena.gov.au/files/2013/08/Chapter10SolarEnergy.pdf.

Simões, M. G. & Chakraborty, S. (eds) Power electronics for renewable and distributed energy systems: A sourcebook of topologies, Control and Integration (Springer, London, 2013).

Barbieri, F., Rajakaruna, S. & Ghosh, A. Very shortterm photovoltaic power forecasting with cloud modeling: A review. Renew. Sustain. Energy Rev. 75, 242–263 (2017).

Guo, Z. F. et al. Residential electricity consumption behavior: influencing factors, related theories, and intervention strategies. Renew Sustain Energy Rev 81, 399–412 (2018).

Kong, W., Dong, Z. Y., Hill, D. J., Luo, F. & Xu, Y. ShortTerm residential load forecasting based on resident behavior learning. IEEE Trans. Power Syst. 33(1), 1087–1088 (2018).

Marino, D.L., Amarasinghe, K. & Manic, M. Building energy load forecasting using deep neural networks. In IECON 2016–42nd Annual Conference of the IEEE Industrial Electronics Society, 7046–7051 (2016).

Ahmed, R., Sreeram, V., Mishra, Y. & Arif, M. D. A review and evaluation of the stateoftheart in PV solar power forecasting: Techniques and optimization. In REthinking 200a 100% renewable energy vision for the European Union (eds Arthouros Zervos, C. L. & Josche, M.) (Elsevier, Amsterdam, 2010).

Ssen, Z. Solar energy in progress and future research trends. Prog Energy Combust Sci 30(4), 367–416 (2004).

Rana, M., Koprinska, I. & Agelidis, V. G. Univariate and multivariate methods for very shortterm solar photovoltaic power forecasting. Energy Conv. Manag. 121, 380–390 (2016).

Nielsen, L., Prahm, L., Berkowicz, R. & Conradsen, K. Net incoming radiation estimated from hourly global radiation and/or cloud observations. Int J Climatol 1, 25572 (1981).

Morjaria, M. A gridfriendly plant: The role of utilityscale photovoltaic plants in grid stability and reliability. Power Energy Magazine IEEE 12(3), 87–95 (2014).

Manz, D. et al. The grid of the future: Ten trends that will shape the grid over the next decade. Power Energy Magazine IEEE 12, 26–36 (2014).

Gandoman, F. H., Raeisi, F. & Ahmadi, A. A literature review on estimating of PVarray hourly power under cloudy weather conditions. Renew. Sustain. Energy Rev. 63, 579–592 (2016).

Keller, A. Costa, A Matlab GUI for calculating the solar radiation and shading of surfaces on the earth. Comp. Appl. Eng. Edu. 19(1), 16170 (2011).

Graham, V. & Hollands, K. A method to generate synthetic hourly solar radiation globally. Sol. Energy 44, 33341 (1990).

Albizzati, E., Rossetti, G. & Alfano, O. Measurements and predictions of solar radiation incident on horizontal surfaces at Santa Fe, Argentina (31–390S, 60–430W). Renew. Energy 4, 46978 (1997).

Jones, P. Cloudcover distribution and correlations. J. Clim. Appl. Meteorol. 31, 73241 (1992).

Einozahy, M. S. & Salama, A. M. M. Technical impact of a gridconnected photovoltaic system on electrical networks a review. J. Renew. Sustain. Energy 5, 70111 (2013).

International Energy Agency. Technology roadmap: solar photovoltaic energy (IEA Publications, France, 2010).

Wang, L. & Singh, C. Reliabilityconstrained optimum placement of reclosers and distributed generators in distribution networks using an ant colony system algorithm. IEEE Trans. Syst. 38, 75764 (2008).

Cai, T., Duan, S. & Chen, C. Forecasting power output for gridconnected photovoltaic power system without using solar radiation measurement, In Power Electronics for Distributed Generation Systems (PEDG), 2010 2nd IEEE International Symposium on (2010).

Zhang, Y. et al. RBF neural network and ANFISbased shortterm load forecasting approach in realtime price environment. Power Syst. IEEE Trans. 23(3), 853–858 (2008).

Antonanzas, J. et al. Review of photovoltaic power forecasting. Sol. Energy 136, 78–111 (2016).

Chen, S., Duan, T., Cai, B. & Liu,. Online 24h solar power forecasting based on weather type classification using artificial neural network. Sol. Energy 85(11), 285670 (2011).

Lan, H., Zm, L. & Zhao, Y. ARMA model of the solar power station based on output prediction. Elect. Measure Instrum. 48, 315 (2011).

Htc, P. & Cfm, C. Assessment of forecasting techniques for solar power production with no exogenous inputs. Sol. Energy 86, 201728 (2012).

Chu, Y. H. et al. Shortterm reforecasting of power output from a 48 MWe solar PV plant. Sol. Energy 112, 68–77. https://doi.org/10.1016/j.solener.2014.11.017 (2015).

Sheng, H., Xiao, J., Cheng, Y., Ni, Q. & Wang, S. Shortterm solar power forecasting based on weighted Gaussian process regression. IEEE Trans. Ind. Electron. 65, 3008 (2018).

Persson, P., Bacher, T. & Shiga, H. Madsen, Multisite solar power forecasting using gradient boosted regression trees. Sol. Energy 150, 42336 (2017).

Liu, L. et al. Prediction of shortterm PV power output and uncertainty analysis. Appl. Energy 228, 70011 (2018).

Mellit, A., Pavan, M. & Lughi, V. Shortterm forecasting of power production in a largescale photovoltaic plant. Sol. Energy 105, 401–412 (2014).

Regents of the University of California, “California Renewable Energy Forecasting, Resource Data, and Mapping,” CEC500–2014–026.

Nespoli, et al. Dayahead photovoltaic forecasting: a comparison of the most effective techniques. Energies 12(9), 1–15 (2019).

Dolara, Grimaccia, F., Leva, S., Mussetta, M. & Ogliari, E. A physical hybrid artificial neural network for shortterm forecasting of PV plant power output. Energies 8(2), 113853 (2015).

Zang, H. et al. Hybrid method for shortterm photovoltaic power forecasting based on deep convolutional neural network. IET Gener. Trans. Distrib. 12, 455767 (2018).

Yona, A., Senjyu, T., Funabashi, T. & Kim, C.H. Determination method of insolation prediction with fuzzy and applying neural network for longterm ahead PV power output correction. IEEE Trans. Sustain. Energy 4, 52733 (2013).

Shi, H., Xu, M. & Li, R. Deep learning for household load forecastinga novel pooling deep RNN. IEEE Trans. Smart. Grid 9, 527180 (2017).

Yadav, N. et al. Toward improving waterenergyfood nexus through dynamic energy management of solar powered automated irrigation system. Heliyon https://doi.org/10.1016/j.heliyon.2024.e25359 (2024).

Khanna, A., Dr Kaur, S., Dr Kumar, P. & Singh, M.S. Revamping the Doctrinal Irrigation System into Smart Irrigation Framework Using Real Time Forecasting and Internet of Things (Iot) Based Concepts. Available at SSRN 4237350, (2022).

Sehrawat, N. et al. A power prediction approach for a solarpowered aerial vehicle enhanced by stacked machine learning technique. Comp. Elect. Eng. 115, 109128 (2024).

Luo, X., Zhang, D. & Zhu, X. Deep learningbased forecasting of photovoltaic power generation by incorporating domain knowledge. Energy 225, 120240 (2021).

Tovar, M., Robles, M. & Rashid, F. PV power prediction, using CNNLSTM hybrid neural network model, Case of Study: TemixcoMorelos. Mexico. Energies 13(24), 6512 (2020).

Sharadga, H., Hajimirza, S. & Balog, R. S. Time series forecasting of solar power generation for largescale photovoltaic plants. Renew. Energy 150, 797–807 (2020).

Hochreiter, S. & Schmidhuber, J. Long shortterm memory. Neu. Comp. 9(8), 1735–1780 (1997).

Liang, S., et al. A Double Channel CNNLSTM Model for Text Classification. In 2020 IEEE 22nd International Conference on HighPerformance Computing and Communications; IEEE 18th International Conference on Smart City; IEEE 6th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). (2020).

Huang, H. & Yaming, L.V. Shortterm Tieline Power Prediction Based on CNNLSTM. In 2020 IEEE 4th Conference on Energy Internet and Energy System Integration (EI2). (2020).

Ting, Y.S., Teng, Y.F. & Chiueh, T.D. Batch Normalization Processor Design for Convolution Neural Network Training and Inference. In 2021 IEEE International Symposium on Circuits and Systems (ISCAS). (2021).

Na, W., et al. Deep Neural Network with Batch Normalization for Automated Modeling of Microwave Components. In 2020 IEEE MTTS International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization (NEMO). (2020).

Masood, M. et al. Brain MRI analysis using deep neural network for medical of internet things applications. Comp. Elect. Eng. 103, 108386 (2022).
Acknowledgements
This work was funded by the Researchers Supporting Project No. (RSP2023R363), King Saud University, Riyadh, Saudi Arabia. The authors are also thankful to Henan Agriculture University for providing the research facilities.
Funding
This work was supported by the National Natural Science Foundation of China, grant numbers 32071890 and 31671581, and the Henan Center for Outstanding Overseas Scientists, grant numbers GZS2021007. This work was supported by the Major Science and Technology Projects in Henan Province (No. 221100320200) and Henan Center for Outstanding Overseas Scientists (No. GZS2021007).
Author information
Authors and Affiliations
Contributions
M. A. and R. M.; writingreviewing, editing, and original draft preparation, H. Z.; Conceptualization, W. Z. and A. S. M. M.; data curation, I. A.: Revision of Manuscript and further extension of experiments, J. H.; supervision and funding. The paper is submitted with the mutual consent of the authors for publication.
Corresponding authors
Ethics declarations
Competing interests
The authors of this manuscript declare no known financial or competing conflict of interest. Any pledge taken by the corresponding author declares it from the whole group.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Awais, M., Mahum, R., Zhang, H. et al. Shortterm photovoltaic energy generation for solar powered high efficiency irrigation systems using LSTM with Spatiotemporal attention mechanism.
Sci Rep 14, 10042 (2024). https://doi.org/10.1038/s41598024606729

Received: 04 October 2023

Accepted: 25 April 2024

Published: 02 May 2024

DOI: https://doi.org/10.1038/s41598024606729
Keywords
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.
This post was originally published on 3rd party site mentioned in the title of this site