TY - JOUR
T1 - Nonintrusive parameter adaptation of chemical process models with reinforcement learning
AU - Alhazmi, Khalid
AU - Sarathy, Mani
N1 - KAUST Repository Item: Exported on 2023-03-07
Acknowledged KAUST grant number(s): OSR-2019-CRG7-4077
Acknowledgements: This work was supported by King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research under the award number OSR-2019-CRG7-4077.
PY - 2023/2/7
Y1 - 2023/2/7
N2 - Model-based control is one of the most prevalent techniques for designing and controlling engineering systems. However, many of these systems are complex and characterized by changing dynamics. Hence, online system identification is required to achieve optimum adaptive control performance for such complex systems. This work proposes an algorithm for nonintrusive, online, nonlinear parameter estimation of physical models using deep reinforcement learning (RL). The problem of training a neural network for parameter estimation is formulated as a reinforcement learning problem. The RL-based parameter estimation policy is tested on a simulation of the selective hydrogenation of acetylene, which is a highly nonlinear system. The learned model estimation policy is able to correctly predict the states of the system with a prediction error of less than 1% in various conditions, such as in the presence of measurement noise and structural differences in models.
AB - Model-based control is one of the most prevalent techniques for designing and controlling engineering systems. However, many of these systems are complex and characterized by changing dynamics. Hence, online system identification is required to achieve optimum adaptive control performance for such complex systems. This work proposes an algorithm for nonintrusive, online, nonlinear parameter estimation of physical models using deep reinforcement learning (RL). The problem of training a neural network for parameter estimation is formulated as a reinforcement learning problem. The RL-based parameter estimation policy is tested on a simulation of the selective hydrogenation of acetylene, which is a highly nonlinear system. The learned model estimation policy is able to correctly predict the states of the system with a prediction error of less than 1% in various conditions, such as in the presence of measurement noise and structural differences in models.
UR - http://hdl.handle.net/10754/690061
UR - https://linkinghub.elsevier.com/retrieve/pii/S0959152423000264
UR - http://www.scopus.com/inward/record.url?scp=85149071144&partnerID=8YFLogxK
U2 - 10.1016/j.jprocont.2023.02.001
DO - 10.1016/j.jprocont.2023.02.001
M3 - Article
SN - 0959-1524
VL - 123
SP - 87
EP - 95
JO - Journal of Process Control
JF - Journal of Process Control
ER -