Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-Minimum Phase Systems

Tavakkoli, Fatemeh; Sarhadi, Pouria; Clement, Benoit; Naeem, Wasif

dc.contributor.author	Tavakkoli, Fatemeh
dc.contributor.author	Sarhadi, Pouria
dc.contributor.author	Clement, Benoit
dc.contributor.author	Naeem, Wasif
dc.date.accessioned	2024-10-01T09:30:04Z
dc.date.available	2024-10-01T09:30:04Z
dc.date.issued	2024-05-22
dc.identifier.citation	Tavakkoli , F , Sarhadi , P , Clement , B & Naeem , W 2024 , Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-Minimum Phase Systems . in 2024 UKACC 14th International Conference on Control, CONTROL 2024 . 2024 UKACC 14th International Conference on Control, CONTROL 2024 , Institute of Electrical and Electronics Engineers (IEEE) , Winchester, UK , pp. 163-168 , CONTROL 2024: 14th United Kingdom Automatic Control Council (UKACC) International Conference on Control , Southampton , United Kingdom , 10/04/24 . https://doi.org/10.1109/CONTROL60310.2024.10531953
dc.identifier.citation	conference
dc.identifier.isbn	979-8-3503-7426-1
dc.identifier.other	Bibtex: 10531953
dc.identifier.other	ORCID: /0000-0002-6004-676X/work/168940964
dc.identifier.uri	http://hdl.handle.net/2299/28293
dc.description	© 2024 IEEE. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1109/CONTROL60310.2024.10531953
dc.description.abstract	Deep Reinforcement Learning (DRL) techniques have received significant attention in control and decision-making algorithms. Most applications involve complex decision-making systems, justified by the algorithms' computational power and cost. While model-based versions are emerging, model-free DRL approaches are intriguing for their independence from models, yet they remain relatively less explored in terms of performance, particularly in applied control. This study conducts a thorough performance analysis comparing the data-driven DRL paradigm with a classical state feedback controller, both designed based on the same cost (reward) function of the linear quadratic regulator (LQR) problem. Twelve additional performance criteria are introduced to assess the controllers' performance, independent of the LQR problem for which they are designed. Two Deep Deterministic Policy Gradient (DDPG)-based controllers are developed, leveraging DDPG's widespread reputation. These controllers are aimed at addressing a challenging setpoint tracking problem in a Non-Minimum Phase (NMP) system. The performance and robustness of the controllers are assessed in the presence of operational challenges, including disturbance, noise, initial conditions, and model uncertainties. The findings suggest that the DDPG controller demonstrates promising behavior under rigorous test conditions. Nevertheless, further improvements are necessary for the DDPG controller to outperform classical methods in all criteria. While DRL algorithms may excel in complex environments owing to the flexibility in the reward function definition, this paper offers practical insights and a comparison framework specifically designed to evaluate these algorithms within the context of control engineering.	en
dc.format.extent	6
dc.format.extent	855777
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof	2024 UKACC 14th International Conference on Control, CONTROL 2024
dc.relation.ispartofseries	2024 UKACC 14th International Conference on Control, CONTROL 2024
dc.subject	Training
dc.subject	State feedback
dc.subject	Uncertainty
dc.subject	Costs
dc.subject	Computational modeling
dc.subject	Decision making
dc.subject	Noise
dc.subject	Control and Optimization
dc.subject	Energy Engineering and Power Technology
dc.subject	Control and Systems Engineering
dc.subject	Renewable Energy, Sustainability and the Environment
dc.title	Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-Minimum Phase Systems	en
dc.contributor.institution	Centre for Engineering Research
dc.contributor.institution	Communications and Intelligent Systems
dc.contributor.institution	Department of Engineering and Technology
dc.contributor.institution	School of Physics, Engineering & Computer Science
dc.identifier.url	http://www.scopus.com/inward/record.url?scp=85194879026&partnerID=8YFLogxK
rioxxterms.versionofrecord	10.1109/CONTROL60310.2024.10531953
rioxxterms.type	Other
herts.preservation.rarelyaccessed	true

Files in this item

Name:: UKACC24_0041_FI.pdf
Size:: 835.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Research publications

Show simple item record