Reinforcement-Learning-Based Disturbance Rejection Control for Uncertain Nonlinear Systems. 2022

Maopeng Ran, and Juncheng Li, and Lihua Xie

This article investigates the reinforcement-learning (RL)-based disturbance rejection control for uncertain nonlinear systems having nonsimple nominal models. An extended state observer (ESO) is first designed to estimate the system state and the total uncertainty, which represents the perturbation to the nominal system dynamics. Based on the output of the observer, the control compensates for the total uncertainty in real time, and simultaneously, online approximates the optimal policy for the compensated system using a simulation of experience-based RL technique. Rigorous theoretical analysis is given to show the practical convergence of the system state to the origin and the developed policy to the ideal optimal policy. It is worth mentioning that the widely used restrictive persistence of excitation (PE) condition is not required in the established framework. Simulation results are presented to illustrate the effectiveness of the proposed method.

UI MeSH Term Description Entries
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D016571 Neural Networks, Computer A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming. Computational Neural Networks,Connectionist Models,Models, Neural Network,Neural Network Models,Neural Networks (Computer),Perceptrons,Computational Neural Network,Computer Neural Network,Computer Neural Networks,Connectionist Model,Model, Connectionist,Model, Neural Network,Models, Connectionist,Network Model, Neural,Network Models, Neural,Network, Computational Neural,Network, Computer Neural,Network, Neural (Computer),Networks, Computational Neural,Networks, Computer Neural,Networks, Neural (Computer),Neural Network (Computer),Neural Network Model,Neural Network, Computational,Neural Network, Computer,Neural Networks, Computational,Perceptron
D017711 Nonlinear Dynamics The study of systems which respond disproportionately (nonlinearly) to initial conditions or perturbing stimuli. Nonlinear systems may exhibit "chaos" which is classically characterized as sensitive dependence on initial conditions. Chaotic systems, while distinguished from more ordered periodic systems, are not random. When their behavior over time is appropriately displayed (in "phase space"), constraints are evident which are described by "strange attractors". Phase space representations of chaotic systems, or strange attractors, usually reveal fractal (FRACTALS) self-similarity across time scales. Natural, including biological, systems often display nonlinear dynamics and chaos. Chaos Theory,Models, Nonlinear,Non-linear Dynamics,Non-linear Models,Chaos Theories,Dynamics, Non-linear,Dynamics, Nonlinear,Model, Non-linear,Model, Nonlinear,Models, Non-linear,Non linear Dynamics,Non linear Models,Non-linear Dynamic,Non-linear Model,Nonlinear Dynamic,Nonlinear Model,Nonlinear Models,Theories, Chaos,Theory, Chaos
D035501 Uncertainty The condition in which reasonable knowledge regarding risks, benefits, or the future is not available.

Related Publications

Maopeng Ran, and Juncheng Li, and Lihua Xie
November 2022, ISA transactions,
Maopeng Ran, and Juncheng Li, and Lihua Xie
November 2021, Entropy (Basel, Switzerland),
Maopeng Ran, and Juncheng Li, and Lihua Xie
October 2023, IEEE transactions on neural networks and learning systems,
Maopeng Ran, and Juncheng Li, and Lihua Xie
November 2022, ISA transactions,
Maopeng Ran, and Juncheng Li, and Lihua Xie
October 2022, IEEE transactions on neural networks and learning systems,
Maopeng Ran, and Juncheng Li, and Lihua Xie
October 2020, IEEE transactions on neural networks and learning systems,
Maopeng Ran, and Juncheng Li, and Lihua Xie
June 2007, IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society,
Maopeng Ran, and Juncheng Li, and Lihua Xie
October 2017, IEEE transactions on cybernetics,
Maopeng Ran, and Juncheng Li, and Lihua Xie
November 2022, ISA transactions,
Maopeng Ran, and Juncheng Li, and Lihua Xie
July 2023, IEEE transactions on neural networks and learning systems,
Copied contents to your clipboard!