Abstract
This paper presents a deep reinforcement learning-based control strategy for underwater manipulator systems, addressing the challenges of environmental disturbances and actuator faults, which is of vital importance for enhancing the human-computer interaction experience. A dynamic model of a single-joint underwater robotic arm is established using Newton-Euler formalism, incorporating the effects of added mass and hydrodynamic drag. A TD3 (Twin Delayed Deep Deterministic Policy Gradient) algorithm is developed for adaptive tracking, integrating a carefully designed reward function that balances tracking precision, control smoothness, and energy efficiency. Simulation results under five typical conditions, including disturbances and actuator faults, demonstrate that the proposed TD3 controller achieves high-precision tracking with strong adaptability and fault tolerance.
| Original language | English |
|---|---|
| Pages (from-to) | 430-435 |
| Number of pages | 6 |
| Journal | IFAC-PapersOnLine |
| Volume | 59 |
| Issue number | 35 |
| DOIs | |
| Publication status | Published - 2025 |
| Externally published | Yes |
| Event | 16th IFAC Symposium on Analysis, Design and Evaluation of Human-Machine Systems, HMS 2025 - Beijing, China Duration: 18 Nov 2025 → 21 Nov 2025 |
Keywords
- Underwater manipulator
- adaptive control
- deep reinforcement learning
- fault tolerant control
- twin delayed deep deterministic policy gradient
Fingerprint
Dive into the research topics of 'Deep Reinforcement Learning-Based Control Strategy for Underwater Manipulator Systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver