TY - GEN
T1 - Trained Model Reuse of Autonomous-Driving in Pygame with Deep Reinforcement Learning
AU - Guo, Youtian
AU - Gao, Qi
AU - Pan, Feng
N1 - Publisher Copyright:
© 2020 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2020/7
Y1 - 2020/7
N2 - Autonomous-Driving technology has begun to bring great convenience to daily trip, transportation, and surveying harsh environment. Considering that deep reinforcement learning has requirements for the convergence performance of the training results, and the actual training results sometimes cannot converge steadily or fail to reach the training goals, in this paper, the trained model reuse method was proposed, which can use the trained model generates Q(St, At) and can be used as a part of Deep Reinforcement Learning model, and this model was built based on the value function that could predict the Q value corresponding to the various actions performed in the environment state. In the Pygame platform, a simplified traffic simulation environment was set, it is observed that the Autonomous-Driving vehicle could run smoothly without collision in a fixed-length test simulation environment, and this trained model reuse method could help autonomous vehicle accelerate the learning process, obtain better simulation results during most of the training process, save simulation time and computing resources.
AB - Autonomous-Driving technology has begun to bring great convenience to daily trip, transportation, and surveying harsh environment. Considering that deep reinforcement learning has requirements for the convergence performance of the training results, and the actual training results sometimes cannot converge steadily or fail to reach the training goals, in this paper, the trained model reuse method was proposed, which can use the trained model generates Q(St, At) and can be used as a part of Deep Reinforcement Learning model, and this model was built based on the value function that could predict the Q value corresponding to the various actions performed in the environment state. In the Pygame platform, a simplified traffic simulation environment was set, it is observed that the Autonomous-Driving vehicle could run smoothly without collision in a fixed-length test simulation environment, and this trained model reuse method could help autonomous vehicle accelerate the learning process, obtain better simulation results during most of the training process, save simulation time and computing resources.
KW - Deep Reinforcement Learning
KW - Experience Replay
KW - Pygame Platform
KW - Trained Model Reuse
UR - http://www.scopus.com/inward/record.url?scp=85091396240&partnerID=8YFLogxK
U2 - 10.23919/CCC50068.2020.9188547
DO - 10.23919/CCC50068.2020.9188547
M3 - Conference contribution
AN - SCOPUS:85091396240
T3 - Chinese Control Conference, CCC
SP - 5660
EP - 5664
BT - Proceedings of the 39th Chinese Control Conference, CCC 2020
A2 - Fu, Jun
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 39th Chinese Control Conference, CCC 2020
Y2 - 27 July 2020 through 29 July 2020
ER -