Inverse Reinforcement Learning in Tracking Control Based on Inverse Optimal Control. 2022

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

This article provides a novel inverse reinforcement learning (RL) algorithm that learns an unknown performance objective function for tracking control. The algorithm combines three steps: 1) an optimal control update; 2) a gradient descent correction step; and 3) an inverse optimal control (IOC) update. The new algorithm clarifies the relation between inverse RL and IOC. It is shown that the reward weight of an unknown performance objective that generates a target control policy may not be unique. We characterize the set of all weights that generate the same target control policy. We develop a model-based algorithm and, further, two model-free algorithms for systems with unknown model information. Finally, simulation experiments are presented to show the effectiveness of the proposed algorithms.

UI MeSH Term Description Entries
D007858 Learning Relatively permanent change in behavior that is the result of past experience or practice. The concept includes the acquisition of knowledge. Phenomenography
D012054 Reinforcement, Psychology The strengthening of a conditioned response. Negative Reinforcement,Positive Reinforcement,Psychological Reinforcement,Reinforcement (Psychology),Negative Reinforcements,Positive Reinforcements,Psychological Reinforcements,Psychology Reinforcement,Psychology Reinforcements,Reinforcement, Negative,Reinforcement, Positive,Reinforcement, Psychological,Reinforcements (Psychology),Reinforcements, Negative,Reinforcements, Positive,Reinforcements, Psychological,Reinforcements, Psychology
D003198 Computer Simulation Computer-based representation of physical systems and phenomena such as chemical processes. Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D012201 Reward An object or a situation that can serve to reinforce a response, to satisfy a motive, or to afford pleasure. Rewards

Related Publications

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
June 2023, ISA transactions,
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
July 2021, IEEE transactions on neural networks and learning systems,
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
February 2023, IEEE transactions on cybernetics,
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
December 2022, IEEE transactions on neural networks and learning systems,
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
April 2024, Sensors (Basel, Switzerland),
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
March 2017, IEEE transactions on neural networks and learning systems,
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
April 2024, Biomimetics (Basel, Switzerland),
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
November 2022, Sensors (Basel, Switzerland),
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
March 2020, IEEE transactions on neural networks and learning systems,
Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis
May 2024, IEEE transactions on neural networks and learning systems,
Copied contents to your clipboard!