TY - GEN
T1 - Reinforcement Learning for Nash Noncoperative and Pareto Cooperative Optimal Linear Systems
AU - Gajic, Zoran
AU - Memon, Zulfiqar
AU - Awan, Ahmed Bilal
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In this paper we use the reinforcement learning techniques for optimal control of linear-quadratic systems, and consider two situations: when controllers cooperate (Pareto strategies) and when they do not cooperate (Nash strategies). These situations are encountered in optimal control and reinforcement learning of several engineering systems (especially in energy systems), economics, and in general machine learning. We compare the corresponding optimal costs using normalized data assuming that the system initial conditions are uniformly distributed on the unit sphere, and provide an estimate how much cooperation helps in learning optimal controllers for these kind of problems. The theoretical results are demonstrated on an example of an electric power system.
AB - In this paper we use the reinforcement learning techniques for optimal control of linear-quadratic systems, and consider two situations: when controllers cooperate (Pareto strategies) and when they do not cooperate (Nash strategies). These situations are encountered in optimal control and reinforcement learning of several engineering systems (especially in energy systems), economics, and in general machine learning. We compare the corresponding optimal costs using normalized data assuming that the system initial conditions are uniformly distributed on the unit sphere, and provide an estimate how much cooperation helps in learning optimal controllers for these kind of problems. The theoretical results are demonstrated on an example of an electric power system.
KW - Nash policy iterations
KW - Parato policy iterations
KW - Reinforcement learning
KW - linear-quadratic dynamic optimization
KW - styling
UR - https://www.scopus.com/pages/publications/85189142508
U2 - 10.1109/ACIT58888.2023.10453794
DO - 10.1109/ACIT58888.2023.10453794
M3 - Conference contribution
AN - SCOPUS:85189142508
T3 - 2023 24th International Arab Conference on Information Technology, ACIT 2023
BT - 2023 24th International Arab Conference on Information Technology, ACIT 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th International Arab Conference on Information Technology, ACIT 2023
Y2 - 6 December 2023 through 8 December 2023
ER -