Szczegóły publikacji
Opis bibliograficzny
Fuzzy $H_\infty$ control of discrete-time nonlinear Markov jump systems via a novel hybrid reinforcement Q-learning method / Jing Wang, Jiacheng Wu, Hao Shen, Jinde Cao, Leszek RUTKOWSKI // IEEE Transactions on Cybernetics ; ISSN 2168-2267. — 2023 — vol. 53 no. 11, s. 7380–7391. — Bibliogr. s. 7390–7391, Abstr. — Publikacja dostępna online od: 2022-11-23. — L. Rutkowski - dod. afiliacja: Systems Research Institute, Polish Academy of Sciences, Warsaw
Autorzy (5)
- Wang Jing
- Wu Jiacheng
- Shen Hao
- Cao Jinde
- AGHRutkowski Leszek
Słowa kluczowe
Dane bibliometryczne
| ID BaDAP | 150855 |
|---|---|
| Data dodania do BaDAP | 2024-01-09 |
| Tekst źródłowy | URL |
| DOI | 10.1109/TCYB.2022.3220537 |
| Rok publikacji | 2023 |
| Typ publikacji | artykuł w czasopiśmie |
| Otwarty dostęp | |
| Czasopismo/seria | IEEE Transactions on Cybernetics |
Abstract
In this article, a novel hybrid reinforcement Q -learning control method is proposed to solve the adaptive fuzzy H∞ control problem of discrete-time nonlinear Markov jump systems based on the Takagi-Sugeno fuzzy model. First, the core problem of adaptive fuzzy H∞ control is converted to solving fuzzy game coupled algebraic Riccati equation, which can hardly be solved by mathematical methods directly. To solve this problem, an offline parallel hybrid learning algorithm is first designed, where system dynamics should be known as a prior. Furthermore, an online parallel Q -learning hybrid learning algorithm is developed. The main characteristics of the proposed online hybrid learning algorithms are threefold: 1) system dynamics are avoided during the learning process; 2) compared with the policy iteration method, the restriction of the initial stable control policy is removed; and 3) compared with the value iteration method, a faster convergence rate can be obtained. Finally, we provide a tunnel diode circuit system model to validate the effectiveness of the present learning algorithm. © 2013 IEEE.