A deterministic policy gradient based load control policy in direct current distribution networks

Hong Duan; Xu Zhou; Xianhong Kang; Zhongjing Ma

A deterministic policy gradient based load control policy in direct current distribution networks

Hong Duan, Xu Zhou, Xianhong Kang^*, Zhongjing Ma

^*Corresponding author for this work

School of Automation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Developing algorithms for global optimum seeking of non-convex optimization problems has special potential in the real world. Previous researches in this field suffer from resulting a local optimum or losing some accuracy by convex relaxation. In this paper, we consider a demand side management (DSM) problem in direct current (DC) distribution networks as an application to study the global optimum seeking of non-convex optimization. Due to the voltage and network constraints, non-convexity appears in the objective function taking into account the tradeoff between the operation costs and users' preferences. By the freedom to express learning problem as a non-convex optimization, we explore a deterministic policy gradient (DPG) based algorithm to calculate the global optimum. A policy network and a polynomial regression critic are built to learn the optimal policy under an exploration noise. Numerical results are provided to demonstrate the DPG algorithm increasing the probability of convergence to the global optimum.

Original language	English
Title of host publication	Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019
Publisher	International Workshop on Computer Science and Engineering (WCSE)
Pages	996-1001
Number of pages	6
ISBN (Electronic)	9789811416842
Publication status	Published - 2020
Event	2019 9th International Workshop on Computer Science and Engineering, WCSE 2019 - Hong Kong, Hong Kong Duration: 15 Jun 2019 → 17 Jun 2019

Publication series

Name	Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019

Conference

Conference	2019 9th International Workshop on Computer Science and Engineering, WCSE 2019
Country/Territory	Hong Kong
City	Hong Kong
Period	15/06/19 → 17/06/19

Keywords

Demand-side management (DSM)
Deterministic policy gradient
Distribution networks
Reinforcement learning

Cite this

Duan, H., Zhou, X., Kang, X., & Ma, Z. (2020). A deterministic policy gradient based load control policy in direct current distribution networks. In Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019 (pp. 996-1001). (Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019). International Workshop on Computer Science and Engineering (WCSE).

Duan, Hong ; Zhou, Xu ; Kang, Xianhong et al. / A deterministic policy gradient based load control policy in direct current distribution networks. Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019. International Workshop on Computer Science and Engineering (WCSE), 2020. pp. 996-1001 (Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019).

@inproceedings{db1ac7d97ca0489ba369e229f4ce829c,

title = "A deterministic policy gradient based load control policy in direct current distribution networks",

abstract = "Developing algorithms for global optimum seeking of non-convex optimization problems has special potential in the real world. Previous researches in this field suffer from resulting a local optimum or losing some accuracy by convex relaxation. In this paper, we consider a demand side management (DSM) problem in direct current (DC) distribution networks as an application to study the global optimum seeking of non-convex optimization. Due to the voltage and network constraints, non-convexity appears in the objective function taking into account the tradeoff between the operation costs and users' preferences. By the freedom to express learning problem as a non-convex optimization, we explore a deterministic policy gradient (DPG) based algorithm to calculate the global optimum. A policy network and a polynomial regression critic are built to learn the optimal policy under an exploration noise. Numerical results are provided to demonstrate the DPG algorithm increasing the probability of convergence to the global optimum.",

keywords = "Demand-side management (DSM), Deterministic policy gradient, Distribution networks, Reinforcement learning",

author = "Hong Duan and Xu Zhou and Xianhong Kang and Zhongjing Ma",

note = "Publisher Copyright: {\textcopyright} WCSE 2019. All rights reserved.; 2019 9th International Workshop on Computer Science and Engineering, WCSE 2019 ; Conference date: 15-06-2019 Through 17-06-2019",

year = "2020",

language = "English",

series = "Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019",

publisher = "International Workshop on Computer Science and Engineering (WCSE)",

pages = "996--1001",

booktitle = "Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019",

}

Duan, H, Zhou, X, Kang, X & Ma, Z 2020, A deterministic policy gradient based load control policy in direct current distribution networks. in Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019. Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019, International Workshop on Computer Science and Engineering (WCSE), pp. 996-1001, 2019 9th International Workshop on Computer Science and Engineering, WCSE 2019, Hong Kong, Hong Kong, 15/06/19.

A deterministic policy gradient based load control policy in direct current distribution networks. / Duan, Hong; Zhou, Xu; Kang, Xianhong et al.
Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019. International Workshop on Computer Science and Engineering (WCSE), 2020. p. 996-1001 (Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A deterministic policy gradient based load control policy in direct current distribution networks

AU - Duan, Hong

AU - Zhou, Xu

AU - Kang, Xianhong

AU - Ma, Zhongjing

PY - 2020

Y1 - 2020

N2 - Developing algorithms for global optimum seeking of non-convex optimization problems has special potential in the real world. Previous researches in this field suffer from resulting a local optimum or losing some accuracy by convex relaxation. In this paper, we consider a demand side management (DSM) problem in direct current (DC) distribution networks as an application to study the global optimum seeking of non-convex optimization. Due to the voltage and network constraints, non-convexity appears in the objective function taking into account the tradeoff between the operation costs and users' preferences. By the freedom to express learning problem as a non-convex optimization, we explore a deterministic policy gradient (DPG) based algorithm to calculate the global optimum. A policy network and a polynomial regression critic are built to learn the optimal policy under an exploration noise. Numerical results are provided to demonstrate the DPG algorithm increasing the probability of convergence to the global optimum.

AB - Developing algorithms for global optimum seeking of non-convex optimization problems has special potential in the real world. Previous researches in this field suffer from resulting a local optimum or losing some accuracy by convex relaxation. In this paper, we consider a demand side management (DSM) problem in direct current (DC) distribution networks as an application to study the global optimum seeking of non-convex optimization. Due to the voltage and network constraints, non-convexity appears in the objective function taking into account the tradeoff between the operation costs and users' preferences. By the freedom to express learning problem as a non-convex optimization, we explore a deterministic policy gradient (DPG) based algorithm to calculate the global optimum. A policy network and a polynomial regression critic are built to learn the optimal policy under an exploration noise. Numerical results are provided to demonstrate the DPG algorithm increasing the probability of convergence to the global optimum.

KW - Demand-side management (DSM)

KW - Deterministic policy gradient

KW - Distribution networks

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85081097981&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85081097981

T3 - Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019

SP - 996

EP - 1001

BT - Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019

PB - International Workshop on Computer Science and Engineering (WCSE)

T2 - 2019 9th International Workshop on Computer Science and Engineering, WCSE 2019

Y2 - 15 June 2019 through 17 June 2019

ER -

Duan H, Zhou X, Kang X, Ma Z. A deterministic policy gradient based load control policy in direct current distribution networks. In Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019. International Workshop on Computer Science and Engineering (WCSE). 2020. p. 996-1001. (Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, WCSE 2019).

A deterministic policy gradient based load control policy in direct current distribution networks

Abstract

Publication series

Conference

Keywords

Other files and links

Fingerprint

Cite this