TY - GEN
T1 - Reinforcement-Learning-Enabled Beam Alignment for Water-Air Direct Optical Wireless Communications
AU - Liu, Jiayue
AU - Mao, Tianqi
AU - He, Dongxuan
AU - Yang, Yang
AU - Gao, Zhen
AU - Zheng, Dezhi
AU - Zhang, Jun
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The escalating interests on underwater exploration/ reconnaissance applications have motivated high-rate data transmission from underwater to airborne relaying platforms, especially under high-sea scenarios. Thanks to its broad bandwidth and superior confidentiality, Optical wireless communication has become one promising candidate for water-air transmission. However, the optical signals inevitably suffer from deviations when crossing the highly-dynamic water-air interfaces in the absence of relaying ships/buoys. To address the issue, this article proposes one novel beam alignment strategy based on deep reinforcement learning (DRL) for water-air direct optical wireless communications. Specifically, the dynamic water-air interface is mathematically modeled using sea-wave spectrum analysis, followed by characterization of the propagation channel with ray-tracing techniques. Then the deep deterministic policy gradient (DDPG) scheme is introduced for DRL-based transceiving beam alignment. A logarithm-exponential (LE) nonlinear reward function with respect to the received signal strength is designed for high-resolution rewarding between different actions. Simulation results validate the superiority of the proposed DRL-based beam alignment scheme.
AB - The escalating interests on underwater exploration/ reconnaissance applications have motivated high-rate data transmission from underwater to airborne relaying platforms, especially under high-sea scenarios. Thanks to its broad bandwidth and superior confidentiality, Optical wireless communication has become one promising candidate for water-air transmission. However, the optical signals inevitably suffer from deviations when crossing the highly-dynamic water-air interfaces in the absence of relaying ships/buoys. To address the issue, this article proposes one novel beam alignment strategy based on deep reinforcement learning (DRL) for water-air direct optical wireless communications. Specifically, the dynamic water-air interface is mathematically modeled using sea-wave spectrum analysis, followed by characterization of the propagation channel with ray-tracing techniques. Then the deep deterministic policy gradient (DDPG) scheme is introduced for DRL-based transceiving beam alignment. A logarithm-exponential (LE) nonlinear reward function with respect to the received signal strength is designed for high-resolution rewarding between different actions. Simulation results validate the superiority of the proposed DRL-based beam alignment scheme.
KW - deep reinforcement learning (DRL)
KW - dynamic water surface
KW - optical wireless communications (OWC)
KW - Water-air direct communications
UR - http://www.scopus.com/inward/record.url?scp=85206482912&partnerID=8YFLogxK
U2 - 10.1109/ICCC62479.2024.10681690
DO - 10.1109/ICCC62479.2024.10681690
M3 - Conference contribution
AN - SCOPUS:85206482912
T3 - 2024 IEEE/CIC International Conference on Communications in China, ICCC 2024
SP - 138
EP - 143
BT - 2024 IEEE/CIC International Conference on Communications in China, ICCC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE/CIC International Conference on Communications in China, ICCC 2024
Y2 - 7 August 2024 through 9 August 2024
ER -