Szczegóły publikacji

Opis bibliograficzny

Fuzzy $H_\infty$ control of discrete-time nonlinear Markov jump systems via a novel hybrid reinforcement Q-learning method / Jing Wang, Jiacheng Wu, Hao Shen, Jinde Cao, Leszek RUTKOWSKI // IEEE Transactions on Cybernetics ; ISSN 2168-2267. — 2023 — vol. 53 no. 11, s. 7380–7391. — Bibliogr. s. 7390–7391, Abstr. — Publikacja dostępna online od: 2022-11-23. — L. Rutkowski - dod. afiliacja: Systems Research Institute, Polish Academy of Sciences, Warsaw

Autorzy (5)

Wang Jing
Wu Jiacheng
Shen Hao
Cao Jinde
AGHRutkowski Leszek

Słowa kluczowe

Markov jump nonlinear systems hybrid reinforcement Q-learning coupled algebraic Riccati equation (CARE)fuzzy H∞ control

Dane bibliometryczne

ID BaDAP	150855
Data dodania do BaDAP	2024-01-09
Tekst źródłowy	URL
DOI	10.1109/TCYB.2022.3220537
Rok publikacji	2023
Typ publikacji	artykuł w czasopiśmie
Otwarty dostęp
Czasopismo/seria	IEEE Transactions on Cybernetics

Abstract

In this article, a novel hybrid reinforcement Q -learning control method is proposed to solve the adaptive fuzzy H∞ control problem of discrete-time nonlinear Markov jump systems based on the Takagi-Sugeno fuzzy model. First, the core problem of adaptive fuzzy H∞ control is converted to solving fuzzy game coupled algebraic Riccati equation, which can hardly be solved by mathematical methods directly. To solve this problem, an offline parallel hybrid learning algorithm is first designed, where system dynamics should be known as a prior. Furthermore, an online parallel Q -learning hybrid learning algorithm is developed. The main characteristics of the proposed online hybrid learning algorithms are threefold: 1) system dynamics are avoided during the learning process; 2) compared with the policy iteration method, the restriction of the initial stable control policy is removed; and 3) compared with the value iteration method, a faster convergence rate can be obtained. Finally, we provide a tunnel diode circuit system model to validate the effectiveness of the present learning algorithm. © 2013 IEEE.

Publikacje, które mogą Cię zainteresować

artykuł

#144554Data dodania: 12.1.2023

Robust composite $H_\infty$ synchronization of Markov jump reaction–diffusion neural networks via a disturbance observer-based method / Hao Shen, Xuelian Wang, Jing Wang, Jinde Cao, Leszek RUTKOWSKI // IEEE Transactions on Cybernetics ; ISSN 2168-2267. — 2022 — vol. 52 no. 12, s. 12712–12721. — Bibliogr. s. 12720–12721, Abstr. — L. Rutkowski - dod. afiliacja: Information Technology Institute, Academy of Social Sciences, Łódź, Poland; System Research Institute of Polish Academy of Sciences, Warsaw, Poland

Szczegóły

artykuł

#165162Data dodania: 22.12.2025

Reinforcement-learning-based fuzzy bipartite consensus for multiagent systems: a novel scaling off-policy learning scheme / Jing Wang, Qing Yang, Jinde Cao, Leszek RUTKOWSKI, Hao Shen // IEEE Transactions on Cybernetics ; ISSN 2168-2267 . — 2025 — vol. 55 no. 9, s. 4491–4501. — Bibliogr. s. 4500–4501, Abstr. — Publikacja dostępna online od: 2025-06-04. — L. Rutkowski - dod. afiliacja: Systems Research Institute of the Polish Academy of Sciences, Warsaw, Poland

Szczegóły