Reinforcement Learning Based Efficiency Optimization Scheme for the DAB DC-DC Converter with Triple-Phase-Shift Modulation

Yuanhong Tang; Weihao Hu; Jian Xiao; Zhangyong Chen; Qi Huang; Zhe Chen; Frede Blaabjerg

doi:10.1109/TIE.2020.3007113

Reinforcement Learning Based Efficiency Optimization Scheme for the DAB DC-DC Converter with Triple-Phase-Shift Modulation

Yuanhong Tang, Weihao Hu, Jian Xiao, Zhangyong Chen, Qi Huang, Zhe Chen, Frede Blaabjerg

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

66 Citationer (Scopus)

220 Downloads (Pure)

Abstract

Aim to improve the power efficiency of the dual-active-bridge (DAB) dc–dc converter, an efficiency optimization scheme with triple-phase-shift (TPS) modulation using reinforcement learning (RL) is proposed in this article. More specifically, the Q-learning algorithm, as a typical algorithm of the RL, is applied to train an agent offline to obtain an optimized modulation strategy, and then the trained agent provides control decisions online in a real-time manner for the DAB dc–dc converter according to the current operating environment. The main objective is to obtain the optimal phase-shift angles for the DAB dc–dc converter, which can achieve the maximum power efficiency by reducing the power losses. Moreover, all possible operation modes of the TPS modulation are considered during the offline training process of the Q-learning algorithm. Thus, the cumbersome process for selecting the optimal operation mode in the conventional schemes can be circumvented successfully. Based on these merits, the proposed efficiency optimization scheme using the RL can realize the excellent performances for the whole load conditions and voltage conversion ratios. Finally, a 1.2-KW prototyped is built, and the simulation and the experimental results demonstrate that the power efficiency can be improved by using the optimization scheme based on the RL.

Originalsprog	Engelsk
Artikelnummer	9138774
Tidsskrift	I E E E Transactions on Industrial Electronics
Vol/bind	68
Udgave nummer	8
Sider (fra-til)	7350 - 7361
Antal sider	12
ISSN	0278-0046
DOI	https://doi.org/10.1109/TIE.2020.3007113
Status	Udgivet - aug. 2021

Adgang til dokumentet

10.1109/TIE.2020.3007113

Accepted author manuscriptAccepteret manuskript, 1,96 MB

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

@article{3776a47716264d808e9812510fda92d5,

title = "Reinforcement Learning Based Efficiency Optimization Scheme for the DAB DC-DC Converter with Triple-Phase-Shift Modulation",

abstract = "Aim to improve the power efficiency of the dual-active-bridge (DAB) dc–dc converter, an efficiency optimization scheme with triple-phase-shift (TPS) modulation using reinforcement learning (RL) is proposed in this article. More specifically, the Q-learning algorithm, as a typical algorithm of the RL, is applied to train an agent offline to obtain an optimized modulation strategy, and then the trained agent provides control decisions online in a real-time manner for the DAB dc–dc converter according to the current operating environment. The main objective is to obtain the optimal phase-shift angles for the DAB dc–dc converter, which can achieve the maximum power efficiency by reducing the power losses. Moreover, all possible operation modes of the TPS modulation are considered during the offline training process of the Q-learning algorithm. Thus, the cumbersome process for selecting the optimal operation mode in the conventional schemes can be circumvented successfully. Based on these merits, the proposed efficiency optimization scheme using the RL can realize the excellent performances for the whole load conditions and voltage conversion ratios. Finally, a 1.2-KW prototyped is built, and the simulation and the experimental results demonstrate that the power efficiency can be improved by using the optimization scheme based on the RL.",

keywords = "DAB DC-DC converter, power efficiency, optimization, Reinforcement Learning (RL), Q-learning",

author = "Yuanhong Tang and Weihao Hu and Jian Xiao and Zhangyong Chen and Qi Huang and Zhe Chen and Frede Blaabjerg",

year = "2021",

month = aug,

doi = "10.1109/TIE.2020.3007113",

language = "English",

volume = "68",

pages = "7350 -- 7361",

journal = "I E E E Transactions on Industrial Electronics",

issn = "0278-0046",

publisher = "IEEE",

number = "8",

}

TY - JOUR

T1 - Reinforcement Learning Based Efficiency Optimization Scheme for the DAB DC-DC Converter with Triple-Phase-Shift Modulation

AU - Tang, Yuanhong

AU - Hu, Weihao

AU - Xiao, Jian

AU - Chen, Zhangyong

AU - Huang, Qi

AU - Chen, Zhe

AU - Blaabjerg, Frede

PY - 2021/8

Y1 - 2021/8

N2 - Aim to improve the power efficiency of the dual-active-bridge (DAB) dc–dc converter, an efficiency optimization scheme with triple-phase-shift (TPS) modulation using reinforcement learning (RL) is proposed in this article. More specifically, the Q-learning algorithm, as a typical algorithm of the RL, is applied to train an agent offline to obtain an optimized modulation strategy, and then the trained agent provides control decisions online in a real-time manner for the DAB dc–dc converter according to the current operating environment. The main objective is to obtain the optimal phase-shift angles for the DAB dc–dc converter, which can achieve the maximum power efficiency by reducing the power losses. Moreover, all possible operation modes of the TPS modulation are considered during the offline training process of the Q-learning algorithm. Thus, the cumbersome process for selecting the optimal operation mode in the conventional schemes can be circumvented successfully. Based on these merits, the proposed efficiency optimization scheme using the RL can realize the excellent performances for the whole load conditions and voltage conversion ratios. Finally, a 1.2-KW prototyped is built, and the simulation and the experimental results demonstrate that the power efficiency can be improved by using the optimization scheme based on the RL.

AB - Aim to improve the power efficiency of the dual-active-bridge (DAB) dc–dc converter, an efficiency optimization scheme with triple-phase-shift (TPS) modulation using reinforcement learning (RL) is proposed in this article. More specifically, the Q-learning algorithm, as a typical algorithm of the RL, is applied to train an agent offline to obtain an optimized modulation strategy, and then the trained agent provides control decisions online in a real-time manner for the DAB dc–dc converter according to the current operating environment. The main objective is to obtain the optimal phase-shift angles for the DAB dc–dc converter, which can achieve the maximum power efficiency by reducing the power losses. Moreover, all possible operation modes of the TPS modulation are considered during the offline training process of the Q-learning algorithm. Thus, the cumbersome process for selecting the optimal operation mode in the conventional schemes can be circumvented successfully. Based on these merits, the proposed efficiency optimization scheme using the RL can realize the excellent performances for the whole load conditions and voltage conversion ratios. Finally, a 1.2-KW prototyped is built, and the simulation and the experimental results demonstrate that the power efficiency can be improved by using the optimization scheme based on the RL.

KW - DAB DC-DC converter

KW - power efficiency

KW - optimization

KW - Reinforcement Learning (RL)

KW - Q-learning

U2 - 10.1109/TIE.2020.3007113

DO - 10.1109/TIE.2020.3007113

M3 - Journal article

SN - 0278-0046

VL - 68

SP - 7350

EP - 7361

JO - I E E E Transactions on Industrial Electronics

JF - I E E E Transactions on Industrial Electronics

IS - 8

M1 - 9138774

ER -

Reinforcement Learning Based Efficiency Optimization Scheme for the DAB DC-DC Converter with Triple-Phase-Shift Modulation

Abstract

Adgang til dokumentet

AUB Link

Fingeraftryk

Citationsformater