TY - JOUR
T1 - Multi-agent reinforcement learning enabling dynamic pricing policy for charging station operators
AU - Han, Ye
AU - Zhang, Xuefei
AU - Zhang, Jian
AU - Cui, Qimei
AU - Wang, Shuo
AU - Han, Zhu
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019
Y1 - 2019
N2 - The development of plug-in electric vehicles (PEVs) brings lucrative opportunities for charging station operators (CSOs). To attract more CSOs to the PEV market, provision of reasonable pricing policy is of great importance. However, dynamic environments and uncertain behavior of competitors make the pricing problem of CSOs challenging. In this paper, we focus on the dynamic pricing policy for maximizing the long-term profits of CSOs. Firstly, we propose a hierarchical framework to describe the economic association of PEV market, which is composed of smart grid, CSOs and charging stations (CSs) serving PEVs from top to bottom. Next, we leverage the Markov game to model the layer of CSOs as a competitive market. Finally, we design a dynamic pricing policy algorithm (DPPA) based on multi-agent reinforcement learning to achieve higher long-term profits of CSOs. Based on the real data of PEVs in Beijing, the experiment results show that DPPA has a significant improvement in long-term profit of CSOs, and the improvement gains increase over time. Moreover, DPPA can reduce the profit loss of CSOs effectively while involving more competitors.
AB - The development of plug-in electric vehicles (PEVs) brings lucrative opportunities for charging station operators (CSOs). To attract more CSOs to the PEV market, provision of reasonable pricing policy is of great importance. However, dynamic environments and uncertain behavior of competitors make the pricing problem of CSOs challenging. In this paper, we focus on the dynamic pricing policy for maximizing the long-term profits of CSOs. Firstly, we propose a hierarchical framework to describe the economic association of PEV market, which is composed of smart grid, CSOs and charging stations (CSs) serving PEVs from top to bottom. Next, we leverage the Markov game to model the layer of CSOs as a competitive market. Finally, we design a dynamic pricing policy algorithm (DPPA) based on multi-agent reinforcement learning to achieve higher long-term profits of CSOs. Based on the real data of PEVs in Beijing, the experiment results show that DPPA has a significant improvement in long-term profit of CSOs, and the improvement gains increase over time. Moreover, DPPA can reduce the profit loss of CSOs effectively while involving more competitors.
KW - Charging station operators
KW - Competitive market
KW - Hierarchical framework
KW - Multi-agent reinforcement learning
KW - Pricing policy
UR - http://www.scopus.com/inward/record.url?scp=85081969325&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM38437.2019.9013999
DO - 10.1109/GLOBECOM38437.2019.9013999
M3 - Conference article
AN - SCOPUS:85081969325
SN - 2334-0983
JO - Proceedings - IEEE Global Communications Conference, GLOBECOM
JF - Proceedings - IEEE Global Communications Conference, GLOBECOM
M1 - 9013999
T2 - 2019 IEEE Global Communications Conference, GLOBECOM 2019
Y2 - 9 December 2019 through 13 December 2019
ER -