TY - JOUR
T1 - HiBid
T2 - A Cross-Channel Constrained Bidding System With Budget Allocation by Hierarchical Offline Deep Reinforcement Learning
AU - Wang, Hao
AU - Tang, Bo
AU - Liu, Chi Harold
AU - Mao, Shangqin
AU - Zhou, Jiahong
AU - Dai, Zipeng
AU - Sun, Yaqi
AU - Xie, Qianlong
AU - Wang, Xingxing
AU - Wang, Dong
N1 - Publisher Copyright:
© 1968-2012 IEEE.
PY - 2024/3/1
Y1 - 2024/3/1
N2 - Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single channel bidding, we explicitly consider cross-channel constrained bidding with budget allocation. Specifically, we propose a hierarchical offline deep reinforcement learning (DRL) framework called 'HiBid', consisted of a high-level planner equipped with auxiliary loss for non-competitive budget allocation, and a data augmentation enhanced low-level executor for adaptive bidding strategy in response to allocated budgets. Additionally, a CPC-guided action selection mechanism is introduced to satisfy the cross-channel CPC constraint. Through extensive experiments on both the large-scale log data and online A/B testing, we confirm that HiBid outperforms six baselines in terms of the number of clicks, CPC satisfactory ratio, and return-on-investment (ROI). We also deploy HiBid on Meituan advertising platform to already service tens of thousands of advertisers every day.
AB - Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single channel bidding, we explicitly consider cross-channel constrained bidding with budget allocation. Specifically, we propose a hierarchical offline deep reinforcement learning (DRL) framework called 'HiBid', consisted of a high-level planner equipped with auxiliary loss for non-competitive budget allocation, and a data augmentation enhanced low-level executor for adaptive bidding strategy in response to allocated budgets. Additionally, a CPC-guided action selection mechanism is introduced to satisfy the cross-channel CPC constraint. Through extensive experiments on both the large-scale log data and online A/B testing, we confirm that HiBid outperforms six baselines in terms of the number of clicks, CPC satisfactory ratio, and return-on-investment (ROI). We also deploy HiBid on Meituan advertising platform to already service tens of thousands of advertisers every day.
KW - Real-time bidding systems
KW - cross-channel bidding
KW - deep reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85180345224&partnerID=8YFLogxK
U2 - 10.1109/TC.2023.3343111
DO - 10.1109/TC.2023.3343111
M3 - Article
AN - SCOPUS:85180345224
SN - 0018-9340
VL - 73
SP - 815
EP - 828
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 3
ER -