Szczegóły publikacji

Opis bibliograficzny

Fuzzy $H_\infty$ control of discrete-time nonlinear Markov jump systems via a novel hybrid reinforcement Q-learning method / Jing Wang, Jiacheng Wu, Hao Shen, Jinde Cao, Leszek RUTKOWSKI // IEEE Transactions on Cybernetics ; ISSN 2168-2267. — 2023 — vol. 53 no. 11, s. 7380–7391. — Bibliogr. s. 7390–7391, Abstr. — Publikacja dostępna online od: 2022-11-23. — L. Rutkowski - dod. afiliacja: Systems Research Institute, Polish Academy of Sciences, Warsaw

Autorzy (5)

Słowa kluczowe

hybrid reinforcement Q-learningcoupled algebraic Riccati equation (CARE)Markov jump nonlinear systemsfuzzy H∞ control

Dane bibliometryczne

ID BaDAP150855
Data dodania do BaDAP2024-01-09
Tekst źródłowyURL
DOI10.1109/TCYB.2022.3220537
Rok publikacji2023
Typ publikacjiartykuł w czasopiśmie
Otwarty dostęptak
Czasopismo/seriaIEEE Transactions on Cybernetics

Abstract

In this article, a novel hybrid reinforcement Q -learning control method is proposed to solve the adaptive fuzzy H∞ control problem of discrete-time nonlinear Markov jump systems based on the Takagi-Sugeno fuzzy model. First, the core problem of adaptive fuzzy H∞ control is converted to solving fuzzy game coupled algebraic Riccati equation, which can hardly be solved by mathematical methods directly. To solve this problem, an offline parallel hybrid learning algorithm is first designed, where system dynamics should be known as a prior. Furthermore, an online parallel Q -learning hybrid learning algorithm is developed. The main characteristics of the proposed online hybrid learning algorithms are threefold: 1) system dynamics are avoided during the learning process; 2) compared with the policy iteration method, the restriction of the initial stable control policy is removed; and 3) compared with the value iteration method, a faster convergence rate can be obtained. Finally, we provide a tunnel diode circuit system model to validate the effectiveness of the present learning algorithm. © 2013 IEEE.

Publikacje, które mogą Cię zainteresować

artykuł
#144554Data dodania: 12.1.2023
Robust composite $H_\infty$ synchronization of Markov jump reaction–diffusion neural networks via a disturbance observer-based method / Hao Shen, Xuelian Wang, Jing Wang, Jinde Cao, Leszek RUTKOWSKI // IEEE Transactions on Cybernetics ; ISSN 2168-2267. — 2022 — vol. 52 no. 12, s. 12712–12721. — Bibliogr. s. 12720–12721, Abstr. — L. Rutkowski - dod. afiliacja: Information Technology Institute, Academy of Social Sciences, Łódź, Poland; System Research Institute of Polish Academy of Sciences, Warsaw, Poland
artykuł
#165162Data dodania: 22.12.2025
Reinforcement-learning-based fuzzy bipartite consensus for multiagent systems: a novel scaling off-policy learning scheme / Jing Wang, Qing Yang, Jinde Cao, Leszek RUTKOWSKI, Hao Shen // IEEE Transactions on Cybernetics ; ISSN  2168-2267 . — 2025 — vol. 55 no. 9, s. 4491–4501. — Bibliogr. s. 4500–4501, Abstr. — Publikacja dostępna online od: 2025-06-04. — L. Rutkowski - dod. afiliacja: Systems Research Institute of the Polish Academy of Sciences, Warsaw, Poland