Biomaterial Database

Inverse Reinforcement Learning in Tracking Control Based on Inverse Optimal Control. 2022

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

This article provides a novel inverse reinforcement learning (RL) algorithm that learns an unknown performance objective function for tracking control. The algorithm combines three steps: 1) an optimal control update; 2) a gradient descent correction step; and 3) an inverse optimal control (IOC) update. The new algorithm clarifies the relation between inverse RL and IOC. It is shown that the reward weight of an unknown performance objective that generates a target control policy may not be unique. We characterize the set of all weights that generate the same target control policy. We develop a model-based algorithm and, further, two model-free algorithms for systems with unknown model information. Finally, simulation experiments are presented to show the effectiveness of the proposed algorithms.

UI	MeSH Term	Description	Entries
D007858	Learning	Relatively permanent change in behavior that is the result of past experience or practice. The concept includes the acquisition of knowledge.	Phenomenography
D012054	Reinforcement, Psychology	The strengthening of a conditioned response.	Negative Reinforcement,Positive Reinforcement,Psychological Reinforcement,Reinforcement (Psychology),Negative Reinforcements,Positive Reinforcements,Psychological Reinforcements,Psychology Reinforcement,Psychology Reinforcements,Reinforcement, Negative,Reinforcement, Positive,Reinforcement, Psychological,Reinforcements (Psychology),Reinforcements, Negative,Reinforcements, Positive,Reinforcements, Psychological,Reinforcements, Psychology
D003198	Computer Simulation	Computer-based representation of physical systems and phenomena such as chemical processes.	Computational Modeling,Computational Modelling,Computer Models,In silico Modeling,In silico Models,In silico Simulation,Models, Computer,Computerized Models,Computer Model,Computer Simulations,Computerized Model,In silico Model,Model, Computer,Model, Computerized,Model, In silico,Modeling, Computational,Modeling, In silico,Modelling, Computational,Simulation, Computer,Simulation, In silico,Simulations, Computer
D000465	Algorithms	A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task.	Algorithm
D012201	Reward	An object or a situation that can serve to reinforce a response, to satisfy a motive, or to afford pleasure.	Rewards

Related Publications

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Adaptive optimal trajectory tracking control of AUVs based on reinforcement learning.

June 2023, ISA transactions,

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Reinforcement Learning-Based Optimal Tracking Control of an Unknown Unmanned Surface Vehicle.

July 2021, IEEE transactions on neural networks and learning systems,

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles.

February 2023, IEEE transactions on cybernetics,

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Reinforcement Learning Based Optimal Tracking Control Under Unmeasurable Disturbances With Application to HVAC Systems.

December 2022, IEEE transactions on neural networks and learning systems,

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

USV Trajectory Tracking Control Based on Receding Horizon Reinforcement Learning.

April 2024, Sensors (Basel, Switzerland),

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Model-Based Reinforcement Learning for Infinite-Horizon Approximate Optimal Tracking.

March 2017, IEEE transactions on neural networks and learning systems,

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Trajectory Tracking Control of Variable Sweep Aircraft Based on Reinforcement Learning.

April 2024, Biomimetics (Basel, Switzerland),

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Optimal Greedy Control in Reinforcement Learning.

November 2022, Sensors (Basel, Switzerland),

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Trajectory Tracking on Uncertain Complex Networks via NN-Based Inverse Optimal Pinning Control.

March 2020, IEEE transactions on neural networks and learning systems,

Wenqian Xue, and Patrik Kolaric, and Jialu Fan, and Bosen Lian, and Tianyou Chai, and Frank L Lewis

Two-Time Scale Tracking Control of Flexible Robots With Primal-Dual Inverse Reinforcement Learning.

May 2024, IEEE transactions on neural networks and learning systems,

Inverse Reinforcement Learning in Tracking Control Based on Inverse Optimal Control. 2022

Related Publications

SEARCH

RESOURCES

HELP

BIOMATERIAL MARKETPLACE

Selection Actions

Need Help?