TY - JOUR
T1 - Reinforcement-Learning-Based Overhead Reduction for Online Fountain Codes With Limited Feedback
AU - Qin, Zijun
AU - Fei, Zesong
AU - Huang, Jingxuan
AU - Wang, Yeliang
AU - Xiao, Ming
AU - Yuan, Jinhong
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023/7/1
Y1 - 2023/7/1
N2 - We investigate the application of reinforcement learning (RL) on online fountain codes, and propose two schemes to reduce the full-recovery overhead with limited feedback. First, we use RL in determining the optimal degree of coded symbols for a given number of feedback, and propose the RL-based degree determination (RL-DD), with the help of theoretical analysis of the relationship between recovery rate and buffer occupancy. Then we propose online fountain codes with no build-up phase using sectioned distribution (OFCNB-SD), where the encoder sends symbols whose degrees are sampled from different sections of an overall distribution, and the decoder is improved to utilize coded symbols that are not immediately decodable. We present theoretical analysis of OFCNB-SD, and introduce RL-based sectioned distribution (RL-SD) scheme where the sectioning of the overall distribution is optimized with RL. Simulation results show that our proposed schemes could achieve lower full-recovery overhead with limited feedback compared to existing schemes.
AB - We investigate the application of reinforcement learning (RL) on online fountain codes, and propose two schemes to reduce the full-recovery overhead with limited feedback. First, we use RL in determining the optimal degree of coded symbols for a given number of feedback, and propose the RL-based degree determination (RL-DD), with the help of theoretical analysis of the relationship between recovery rate and buffer occupancy. Then we propose online fountain codes with no build-up phase using sectioned distribution (OFCNB-SD), where the encoder sends symbols whose degrees are sampled from different sections of an overall distribution, and the decoder is improved to utilize coded symbols that are not immediately decodable. We present theoretical analysis of OFCNB-SD, and introduce RL-based sectioned distribution (RL-SD) scheme where the sectioning of the overall distribution is optimized with RL. Simulation results show that our proposed schemes could achieve lower full-recovery overhead with limited feedback compared to existing schemes.
KW - Online fountain codes
KW - feedback
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85159794979&partnerID=8YFLogxK
U2 - 10.1109/TCOMM.2023.3273413
DO - 10.1109/TCOMM.2023.3273413
M3 - Article
AN - SCOPUS:85159794979
SN - 1558-0857
VL - 71
SP - 3977
EP - 3991
JO - IEEE Transactions on Communications
JF - IEEE Transactions on Communications
IS - 7
ER -