Smooth Actor-Critic Algorithm for End-to-End Autonomous Driving

Wenjie Song*, Shixian Liu, Yujun Li, Yi Yang, Changle Xiang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

For the intelligent sequential decision-making tasks like autonomous driving, decisions or actions made by the agent in a short period of time should be smooth enough or not too choppy. In order to help the agent learn smooth actions (steering, accelerating, braking) for autonomous driving, this paper proposes the smooth actor-critic algorithm for both deterministic policy and stochastic policy systems. Specifically, a regularization term is added to the objective function of actorcritic methods to constrain the difference between neighbouring actions in a small region without affecting the convergence performance of the whole system. Then, the theoretical analysis and proof for the modified methods are conducted so that it can be theoretically guaranteed in terms of iterative improvements. Moreover, experiments in different simulation systems also prove that the methods can generate much smoother actions and obtain more robust performance for reinforcement learning-based End-to-End autonomous driving.

Original languageEnglish
Title of host publication2020 American Control Conference, ACC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3242-3248
Number of pages7
ISBN (Electronic)9781538682661
DOIs
Publication statusPublished - Jul 2020
Event2020 American Control Conference, ACC 2020 - Denver, United States
Duration: 1 Jul 20203 Jul 2020

Publication series

NameProceedings of the American Control Conference
Volume2020-July
ISSN (Print)0743-1619

Conference

Conference2020 American Control Conference, ACC 2020
Country/TerritoryUnited States
CityDenver
Period1/07/203/07/20

Fingerprint

Dive into the research topics of 'Smooth Actor-Critic Algorithm for End-to-End Autonomous Driving'. Together they form a unique fingerprint.

Cite this