The increasing penetration of renewable energy sources (RES) and the development of the cyber-physical microgrids (CPMs) make greater demands for frequency control of microgrids. The common approach for frequency control is controlling the micro-turbine to compensate for frequency deviations, with energy storage systems serving as an auxiliary approach. This article proposes an online adaptive frequency control method to control the governor and energy storage to realize the frequency recovery of microgrid, subject to the external unknown disturbances caused by wind turbine, power load and false data injection (FDI) attacks. First, the non-zero sum (NZS) games of the considered system are modeled in this work, where the unknown disturbances are also taken into account. For the sake of estimating the unknown disturbances, a disturbance observer (DOB) for the microgrid system is introduced. Then, on the basis of the estimated results of the applied DOB, the disturbance compensation input is derived to offset the interference of the unknown disturbance. Meanwhile, the adaptive dynamic programming (ADP) approach is employed to derive the adaptive optimal control input for the NZS games of microgrid system. Besides, the dynamic event-triggered (DET) control is introduced, reducing the occupation of computing resources. By utilizing the Lyapunov's method, the stability of the closed-loop system, the convergence of the estimation weight, the estimation disturbances and the system state are guaranteed. The effectiveness of the proposed method is ultimately verified by the simulation results.
1 INTRODUCTION
In recent years, the increasing integration of renewable energy sources into power systems has given rise to the concept of microgrids [1]. Such systems consist of distributed energy resources such as wind turbines generators, energy storage devices, and photovoltaic systems, enabling them to operate independently or in conjunction with the main grid [2-4]. As the proportion of renewable energy in microgrids grows, the inertia of the system is reduced, making it increasingly crucial to enhance system resilience [5, 6].
Meanwhile, with the application of information and communication technologies being widespread [7-9], the interaction between energy flow and information flow has become increasingly frequent, leading to the development of cyber-physical microgrids (CPMs). However, CPMs are more vulnerable to cyber attacks, which can lead to frequency fluctuation and, consequently, undermine the stability of microgrids [10].
Controlling micro-turbine is a common approach to smooth the frequency fluctuation. However, micro-turbine may experience mechanical wear during the frequency control process, which can influence the robustness of frequency control [11]. Therefore, combining the micro-turbines and energy storage systems (ESSs) can better regulate frequency fluctuations in microgrids [12]. Over the past few years, various energy storage systems have been studied [13, 14]. They have great potential in enhancing the overall stability of microgrid frequency due to their rapid response, precise control, and high flexibility.
Many control methods have been applied to stabilize the frequency. Proportional-Integral (PI) control has been widely adopted due to its simplicity and effectiveness [15]. However, it can exhibit poor performance in dynamic conditions, and it is unable to effectively handle system nonlinearity and disturbances. Furthermore, many other robust frequency control methods like sliding mode control [16], fuzzy logic control [17], [18] control have been investigated. Different from the above control method, the adaptive dynamic programming (ADP) is originated from dynamic programming, which can can adapt to the changing environments and system dynamics [19, 20]. Due to its ability to handle complex, nonlinear, and time-varying systems, ADP has been increasingly applied in power systems [21, 22]. ADP approach have already shown good performance in frequency control [12], but current ADP approach for frequency control primarily focus on single player. By exploring the Nash equilibrium of multiple players, players can collaborate with each other to achieve global optimization [23].
Currently, CPMs face various threats of cyber attacks, such as denial-of-service attacks, delay attacks, and false data injection (FDI) attacks. Among them, FDI attacks that occur on the actuators or the sensors can disrupt the commands transmitted through cyber channels, potentially causing unexpected frequency fluctuation [24, 25]. At the same time, microgrid systems are subject to numerous external disturbances that are either unmeasurable or difficult to control. Attempting to measure or manage these disturbances would require additional sensors and controllers, which could increase system complexity, cost and reduce system reliability. To address the above challenges, disturbance observer (DOB) has been widely researched and applied in different areas [26, 27]. In recent years, DOB has also been combined with sliding mode control to smooth the frequency fluctuation in microgrids [28].
The method of static event-triggered control (SET) has been applied into the intelligent frequency control and other power system fields. Reference [29] proposed a novel event-triggered control architecture for load frequency control with supplementary adaptive dynamic programming. Similarly, the hybrid policy-based reinforcement learning strategy proposed in [30] integrates event-driven mechanisms, further improving the adaptability of energy management. Static event-triggered control keeps the control signals unchanged between the adjacent triggering instants using zero-order holders. This method effectively conserves computing and transmission resources by reducing the frequency of control updates, making it applicable for systems with limited bandwidth and processing capabilities. Based on the static event-triggered control, dynamic event-triggered control is derived in reference [31] by introducing a dynamic variable to adjust the triggering threshold. Similarly, the DET-based distributed cooperative energy management approach developed in [32] effectively addresses the computational and communication constraints in multi-energy systems. Consequently, the introduction of the DET mechanism can have a significant positive impact on frequency control.
Currently, no research has utilized DET-based adaptive optimal control to address frequency control issues while considering unknown external disturbances. Therefore, the NZS games of the considered system are modeled in this work, taking into account the unknown external disturbances. Based on this, we propose a dynamic event-triggered-based ADP control scheme with the help of DOB, which can solve the optimal frequency control problem with one of the player's input constrained, while offsetting the interference of the disturbances by introducing the compensation input. The main tasks outlined in this article are as follows:
1) The microgrid optimal frequency control problem with ESS taking into account the external unknown disturbance is transformed into an NZS games problem with the help of DOB. Then the adaptive critic design method is utilized to get the approximate Nash equilibrium solutions.
2) For the first time, a dynamic event-triggering-based optimal control method, combining ADP and disturbance observer, is used to solve the non-zero sum games. The provided dynamic triggered rule can guarantee the stability of the system.
3) Unlike traditional time-triggered methods, a novel DET method is utilized as an alternative to the event-triggered method for the optimal frequency control issue. By introducing a mathematically equivalent filter structure, fewer events are triggered, so that the communication and computational burden are reduced.
The remainder of this article is organized as follows. In Section 2, the mathematical model of microgrid system is reconstructed and a DOB is designed. Based on frequency dynamics, a DET-based adaptive dynamic programming control scheme is proposed in Section 3, where the stability proof is also given. Section 4 demonstrates the effectiveness of the proposed method through the simulation results. Lastly, the conclusion is presented in Section 5.
2 MATHEMATICAL MODEL OF MICROGRID SYSTEM
2.1 Microgrid system with unknown disturbance
In this section, the microgrid system consists of micro-turbine, governor, energy storage system, wind turbine and power load. And the system structure is presented in Figure 1. Similar to the description in [33], the transfer functions are formulated: , , , where , , represent the time constants of governor, micro-turbine and system inertia, respectively. represents the gain coefficient of the power system, represents speed regulation coefficient and represents integral gain. The FDI attacks occurred on the actuator channel are also modeled [34], which can disrupt the transmission of the control commands of the ESS. As energy storage systems are often connected through IoT (Internet of Things)-based architectures, they are more exposed to cyber threats compared to micro-turbine and governor, which have a limited dependency on cyber and communication infrastructure. Therefore, this article only discuss the FDI attacks that interfere with the control input of ESS.
In order to propose the adaptive dynamic programming-based controller with the help of disturbance observer, the frequency dynamics can be described by:
(1)
Define as the state vector; they represent frequency deviation, turbine output and governor position valve and incremental change in integral control, respectively. is the unknown disturbance caused by the wind turbine and the load change, where , represent the wind turbine disturbance and the load change disturbance, respectively. represents the FDI attacks launched on the actuator channel that can interfere with the control input of the ESS. The control input and represent the control of the ESS and the control of the governor, respectively. And the system state equation can be given by
(2)
where
(3)
(4)
(5)
2.2 Problem statement
In this section, we design an intelligent frequency control algorithm regulated by the ADP method for the NZS games of system (1) with unknown disturbances. The impact of these unknown disturbances is effectively eliminated by the disturbance compensation input, ensuring the stability and reliability of the frequency control at the same time. To provide a clearer understanding of the proposed intelligent frequency control method, the overall framework of the composite control scheme is shown in Figure 1. It can be observed that the composite control input contains two parts: one is the adaptive optimal input obtained by ADP for the NZS games of system (1), and the other is the disturbance compensation input based on estimation of the disturbance observer. The composite robust control effectively eliminates the impact of unknown disturbances in the microgrid system and simultaneously minimize the value function.
Inspired by the excellent achievements in [35], the following disturbance observer is designed to estimate disturbance of system (1). And then, based on the estimated disturbance, a disturbance compensation input in the following form is introduced to offset the interference of the applied unknown disturbance. The DOB is designed as
(6)
where is the estimation of the external disturbance, is a vector function to be designed and . is the intermediate variable which is given to avoid computing the derivative of the state.
The disturbance observer error is defined as
(7)
The following assumption is needed in the following analysis and similar assumption can also be found in [36, 37].
Assumption 1.The disturbance in system (1) is slowly time-varying and bounded, that is, . is also bounded, that is, with positive constant .
Next, the following theorem illustrates that the disturbance observer error is asymptotic stable.
Theorem 1.If is a positive definite matrix and Assumption 1 is satisfied, the disturbance observer error is asymptotically stable.
Proof.Select the Lyapunov candidate as . Then the corresponding derivative of is
According to the Lyapunov's theory, is asymptotically stable, which indicates that the DOB can asymptotically approximate the unknown disturbance. This conclusion provides a foundation for offsetting the interference caused by wind power fluctuations, load changes, and FDI attacks.
Therefore, based on the estimation of the disturbance observer and the system state equation (2), the composite control inputs of system (1) are formulated as:
(10)
(11)
where is the disturbance compensation input, and are the ADP-based optimal control inputs of energy storage and governor, respectively.
3 DYNAMIC EVENT-TRIGGERED BASED ADAPTIVE DYNAMIC PROGRAMMING APPROACH FOR THE NZS GAMES
In this section, a DET-based adaptive dynamic programming control scheme with the help of disturbance observer for the NZS games of system (1) is proposed, and the structure diagram of the closed-loop system is given in Figure 1. The NZS games of system (1) are addressed by utilizing the adaptive critic design approach under the DET mechanism. By introducing a non-quadratic function into the performance index, the robust stabilization problem is transformed into a constrained optimal control problem. Finally, the stability of the closed-loop system is verified.
3.1 Triggering adaptive controller design
The governor's control signal, that is, , should be constrained to prevent excessive wear on the governor, as this could reduce system reliability. Therefore, a non-quadratic function is introduced into the performance index, and define the utility function of the ith player as . The performance index of the ith player can be described as:
(12)
where , and are symmetric positive-definite matrices. Inspired by [38], and are chosen as:
(13)
(14)
where is adopted, and is the upper bound of the control input . Since the overall composite control input is equivalent to the ADP-based control input without the disturbance compensation term, the upper bound of is aligns with the upper bound of . Therefore, it is possible to constrain the control signal of the governor. The ith player's value function is defined as
(15)
Definition 1. ([[39, 40]])The strategy set is a Nash equilibrium strategy set, and the following inequalities:
(16)
(17)
are satisfied for any admissible control policies.
Next, the Hamiltonian of the ith player with associated admissible control inputs is defined as
(18)
where and is calculated by . Thus, the optimal value function
(19)
satisfies the HJ equation
(20)
Based on stationary condition, the associated individual optimal ADP-based control policies are attained as
(21)
(22)
Due to the difficulty in solving the HJ equation and critic NN's universal approximation property, we approximate the optimal solution by
(23)
where is the ideal weight vector, is the activation function, is the number of hidden layer neurons, and is the approximation error.
Since the ideal weight of the critic NNs is unknown, the optimal solution cannot be obtained directly. Therefore, we reformulate the function (23) by using the critic NNs
(24)
where is the estimation of the ideal weight vector . Then, taking the partial derivative of , that is, , we can have
(25)
Similarly, we have
(26)
Submitting (25) into (21) and (22), the ADP-based optimal control policies can be derived as
(27)
(28)
Define as a monotonically increasing sequence, which presents the set of the triggering moments. Here represents the k-th sampling time, where is a positive constant. The sampling state remains unchanged between two sampling moments.
Using a zero-order holder(ZOH), the piecewise continuous control signals can be got, and the optimal control policies are transformed into the following formulas
(29)
(30)
where , .
Then, the measurement error can be defined as: , where is the sample state and is the real-time state. Submitting (26) into (21) and (22), the event-triggered approximated control policies are transformed as
(31)
(32)
And the event-triggered optimal control policies can be derived as
(33)
(34)
Considering the above equations, the Hamiltonian residual error can be defined as
(35)
We introduced an auxiliary term as
(36)
To get the minimum residual equation error , it is desired to update to minimize the squared residual error . Consequently, by using the gradient descent approach [41], the weight tuning laws can be calculated by
(37)
where , and the learning rate is a positive constant to be designed. Since the ideal weight is a constant vector, we can conclude that . According to (37), we have
(38)
3.2 Static event triggering mechanism and stability analysis
According to the previous work [38, 42, 43], the following assumptions are needed for the stability analysis.
Assumption 2.The control law is Lipschitz continuous, that is, with , where .
Assumption 3.For any , the input coefficient matrix , the ideal critic weight , the gradient of the activation function and the gradient of the NN approximation error are bounded, that is
where , , , are all positive constants.
Assumption 4.According to the persistence of excitation condition, for any player, the signal is persistently excited, so the inequality is satisfied:
(39)
where , and , are positive constants. Furthermore, assume that .
Assumption 5.The function is Lipschitz continuous, which satisfies . Here is a positive constant and .
Theorem 2.Considering the NZS games of system (1) with unknown external disturbance, and assuming that Assumptions 1–5 hold, along with the disturbance observer (6), the event-triggered approximate optimal control law (31), (32) and weight update law (37) are used. If the event-triggered rule
(40)
holds, where parameter , then the weight estimation errors , the system state and the disturbance observer estimation error are uniform ultimate boundedness (UUB).
Proof.Select the Lyapunov function candidate as , with , , , .
Case 1.When , the time derivative of is
(41)
Due to Equation (20), the following equation holds:
(42)
Submitting (42) into (41), we can have (43), as shown at the top of next page. We define and it is assumed to be bounded as .
(43)
By applying Young's inequality, we can derive
(44)
(45)
In the process of deriving the above equations, the following equations are obtained
Then, we continue to analyze the Lyapunov function. The time derivative of can be deduced as
(49)
where . As the function keeps unchanged during case 1, the time derivation equals zero. The time derivation of is formulated as (9).
By integrating the derivations of each term in the Lyapunov equation above, we obtain the following expression:
(50)
Furthermore, we have
(51)
where
(52)
Therefore, under the condition (40), when the following inequality
(53)
or
(54)
or
(55)
or
(56)
is satisfied, the conclusion can be drawn, which indicates that the weight estimation errors , the system state and the disturbance observer estimation error are UUB.
Case 2.When , we can obtain that
(57)
According to the analysis of Case 1, the function is non-increasing during the time interval. Hence, when the conditions (40), (53), (54), (55), (56) holds, . As , and are all time-continuous functions, .
So far, the entire framework of the DET-based adaptive dynamic programming approach for the NZS games has been established. As shown in Figure 1, by comparing the measurement error with the triggered condition, it is determined whether an event is triggered. Then, if the event is triggered, the critic NNs approximate the solutions of the coupled HJ equations by substituting in the sampled state. Meanwhile, the optimal strategy is updated based on this approximation, and finally, the ZOH maintains the continuity of the control signal.
3.3 Dynamic event triggering mechanism and stability analysis
In order to adjust the triggering threshold based on the circumstances, this section introduces a dynamic variable to generate a dynamic event-trigger rule. Dynamic variable needs to satisfy the following formulas:
(58)
where the parameter represents the filtering coefficient.
The dynamic variable is generated by the mentioned filter structure, which can dynamically adjust the threshold. Since the DET strategy ensures that the value of remains non-negative, it relaxes the stability condition, thereby increasing the trigger interval. The DET rule can be defined as:
(59)
where . When , the dynamic event triggering rule (59) becomes the static event triggering rule (40).
Lemma 1.For the system (1), the internal dynamic variable always remains non-negative during the DET control process.
Proof.Note that based on the employed DET rule, the following inequality
(60)
is obviously satisfied. Then, if , by combining (58) and (60), we have
(61)
According to the comparative lemma, we can get the following relation:
(62)
Therefore, is proven.
Theorem 3.Considering the NZS games of system (1) with unknown external disturbance, and assuming that Assumptions 1–5 hold, along with the disturbance observer (6), the event-triggered approximate optimal control law (31), (32) and weight update law (37) are used. If the dynamic event triggering rule (59) is adopted, the system state , the weight estimation errors and the disturbance observer estimation error are UUB.
Proof.Select the following Lyapunov function
(63)
The time derivation of can be obtained as
(64)
Based on Lemma 1 and , if the conditions (53), (54), (55), (56) hold, we can have
(65)
The proof is, thus, completed.
4 SIMULATION
In this section, the frequency control was designed according to the proposed method.
Example 1.Inspired by the excellent work [11], the power constraints of ESS was set as . The parameters of microgrid [12] were chosen as , , , , , .
The optimal cost functions of system (1) were defined with parameters , , , , and . The activation functions of NNs of the ESS and governor were both given as . Then the initial value of the critic NN weights were set as , , respectively. The learning rates were chosen as .
For the dynamic event-triggering condition (59), the parameters were chosen as , , , and . In order to achieve the disturbance observation, the initial value of the intermediate variable were set as 0 and was designed as . Thus . Starting with the initial value , the simulation time step was set to 0.01 s. The unknown external disturbance was added to the MG system for the first 30 s, and the dynamic event-triggered method was implemented for 150 s. The estimation of disturbance is shown in Figure 4.
Run the algorithm to acquire the learning results which are displayed in Figures 2, 3 and 4. The system states, the proposed DET control policies and the approximate Hamiltonian functions are presented in Figure 2. It can be seen that the system state close to zero at s, indicates our method to be effective. It can be found from Figure 2c that the control input amplitude of the governor is less than 0.1, which meets the control constraints.
The critic NNs training curves of the weight are shown in Figure 3. It can be seen that the converged weights are , , respectively. The triggering process is provided in Figure 3. And in Figure 3c, the cumulative number of triggers for the dynamic event-triggering method and the number of samples for the time triggering method are compared, indicating that the dynamic event-triggering method can indeed save system computing and communication resources. Figure 3d shows the evolution of the triggering condition, which illustrates how and change according to the condition (40).
The simulation result in Figure 4 illustrates that the disturbance observer efficiently estimates the external disturbance, making it possible to offset the interference of the disturbance. Notably, during the first 30 s, when the disturbance is applied, the designed DOB successfully approximates the disturbance. Afterward, the system states gradually converge.
Example 2.Different from the damping oscillating disturbance applied in Example 1, the step disturbance was added for the first 50 s.
The parameters of the system were chosen as , , , , , . And the DET condition parameters were set as , , , and . To approximate the external unknown disturbance, were changed to , accordingly, . Other parameter were the same as Example 1. The designed method was implemented for 200 s.
The running results of the designed algorithm are shown in Figures 4-6. The system states, the DET control policies and the approximate Hamiltonian functions are presented in Figure 5. It can be seen that the system state is close to zero at 150 s, which indicates our method to be effective. The control input of the governor is constrained to 0.1, which satisfies the requirement of input constraint.
According to the simulation results, we can obtain the estimation of the ideal weight as , . The triggering process is shown in Figure 6. And in Figure 6c, the cumulative number of triggers for the DET method and the number of samples for the time triggering method are compared, indicating that the dynamic event-triggering method can indeed save system computing and communication resources. Figure 6d shows the evolution of the triggering condition, which illustrates how and change according to the condition (40).
The simulation results in Figure 4 illustrates that the disturbance observer can successfully approximate both the oscillating disturbance and the step disturbance, which demonstrates the good estimation performance of the disturbance observer. The convergence of the system state to zero also indicates that the proposed algorithm, the DET-based ADP approach combining disturbance observer, can successfully utilize the disturbance observer to offset the interference of disturbance, thereby addressing the optimal frequency control problem.
5 CONCLUSION
For the frequency optimal control problem, we studied a class of cyber-physical microgrids facing unknown disturbances from wind turbine, FDI attacks and the load change. The composite control input consists of two parts: one is the adaptive optimal input for the NZS game of the microgrid system, and the other is the disturbance compensation input derived from the estimation provided by the disturbance observer. The composite robust control effectively eliminates the impact of unknown disturbances in the microgrid system while simultaneously minimizing the value function. Meanwhile, the system stability can be guaranteed. Simulation examples were used to prove the effectiveness of the presented algorithm.
It is expected that future work could further enhance this framework by integrating more advanced FDI detection and mitigation strategies and addressing challenges such as communication delays and network uncertainties. Such advancements would improve the robustness and adaptability of frequency control in microgrid environments.
This work was supported by National Natural Science Foundation of China (Nos. 62373091, 62103087, 62203311 & U22A2055), China Postdoctoral Science Foundation (Nos. 2024T170112 & 2021M690567), National Key R&D Program of China under Grant 2018YFA0702200, the Fundamental Research Funds for the Central Universities (Nos. N2104016 & N2304009), Natural Science Foundation of Liaoning Province (No. 2023-MSBA-082), China Academy of Engineering Institute of Land Cooperation Consulting Project (2023-DFZD-60, 2023-DFZD-60-03) and Key Laboratory of Integrated Energy Optimization and Secure Operation of Liaoning Province.
2Hu, Z., Qiu, H., Alhelou, H.H., Su, R., Ma, R.: Resilient distributed frequency regulation for interconnected power systems with PEVs and wind turbines against temporary PMU faults. IEEE Internet Things J.11(23), 38719–38727 (2024)
4Yang, L., Li, X., Sun, M., Sun, C.: Hybrid policy-based reinforcement learning of adaptive energy management for the energy transmission-constrained island group. IEEE Trans. Ind. Inf.19(11), 10751–10762 (2023)
5Hu, Z., Ma, R., Wang, B., Huang, Y., Su, R.: A general resiliency enhancement framework for load frequency control of interconnected power systems considering Internet of Things faults. IEEE Trans. Ind. Inf.20(9), 10695–10705 (2024)
6Wang, J., Wang, Y., Qiu, D., Su, H., Strbac, G., Gao, Z.: Resilient energy management of a multi-energy building under low-temperature district heating: A deep reinforcement learning approach. Appl. Energy378, 124780 (2025)
10Chaudhary, A.K., Roy, S., Guha, D., Negi, R., Banerjee, S.: Adaptive cyber-tolerant finite-time frequency control framework for renewable-integrated power system under deception and periodic denial-of-service attacks. Energy302, 131809 (2024)
11Tong, X., Ma, D., Wang, R., Xie, X., Zhang, H.: Dynamic event-triggered-based integral reinforcement learning algorithm for frequency control of microgrid with stochastic uncertainty. IEEE Trans. Consum. Electron.69(3), 321–330 (2023)
12Mu, C., Zhang, Y., Jia, H., He, H.: Energy-storage-based intelligent frequency control of microgrid with stochastic model uncertainties. IEEE Trans. Smart Grid11(2), 1748–1758 (2020)
13Takigawa, K., Okada, N., Kuwabara, N., Kitamura, A., Yamamoto, F.: Development and performance test of smart power conditioner for value-added PV application. Sol. Energy Mater. Sol. Cells75(3), 547–555 (2003)
14Zhao, T., Ding, Z.: Cooperative optimal control of battery energy storage system under wind uncertainties in a microgrid. IEEE Trans. Power Syst.33(2), 2292–2300 (2018)
15Garmroodi, M., Verbič, G., Hill, D.J.: Frequency support from wind turbine generators with a time-variable droop characteristic. IEEE Trans. Sustainable Energy9(2), 676–684 (2018)
16Guo, J.: Application of full order sliding mode control based on different areas power system with load frequency control. ISA Trans.92, 23–34 (2019)
17Yang Wang, Y.W., Zhou, X.: Fuzzy logic based feedback control system for the frequency stabilization of external-cavity semiconductor lasers. Int. J. Optomechatronics14(1), 44–52 (2020)
21Liu, F., Su, H., Zhang, H., Song, R., Wang, J.: Dynamic self-triggered adaptive control for voltage regulation of DC microgrids. IEEE Trans. Circuits Syst. II Exp. Briefs (2024)
22Su, H., Zhi, G., Zhang, H., Wang, J., Strbac, G., Ren, H.: ADP-based self-triggered optimal control of active loads in DC microgrid. IEEE Trans. Circuits Syst. II Exp. Briefs (2024)
23Mu, C., Wang, K., Ni, Z.: Adaptive learning and sampled-control for nonlinear game systems using dynamic event-triggering strategy. IEEE Trans. Neural Networks Learn. Syst.33(9), 4437–4450 (2022)