HiBid: A Cross-Channel Constrained Bidding System With Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

Hao Wang; Bo Tang; Chi Harold Liu; Shangqin Mao; Jiahong Zhou; Zipeng Dai; Yaqi Sun; Qianlong Xie; Xingxing Wang; Dong Wang

doi:10.1109/TC.2023.3343111

HiBid: A Cross-Channel Constrained Bidding System With Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

Hao Wang, Bo Tang, Chi Harold Liu^*, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single channel bidding, we explicitly consider cross-channel constrained bidding with budget allocation. Specifically, we propose a hierarchical offline deep reinforcement learning (DRL) framework called 'HiBid', consisted of a high-level planner equipped with auxiliary loss for non-competitive budget allocation, and a data augmentation enhanced low-level executor for adaptive bidding strategy in response to allocated budgets. Additionally, a CPC-guided action selection mechanism is introduced to satisfy the cross-channel CPC constraint. Through extensive experiments on both the large-scale log data and online A/B testing, we confirm that HiBid outperforms six baselines in terms of the number of clicks, CPC satisfactory ratio, and return-on-investment (ROI). We also deploy HiBid on Meituan advertising platform to already service tens of thousands of advertisers every day.

Original language	English
Pages (from-to)	815-828
Number of pages	14
Journal	IEEE Transactions on Computers
Volume	73
Issue number	3
DOIs	https://doi.org/10.1109/TC.2023.3343111
Publication status	Published - 1 Mar 2024

Keywords

Real-time bidding systems
cross-channel bidding
deep reinforcement learning

Access to Document

10.1109/TC.2023.3343111

Cite this

@article{c362008b454c44059693f88909a47ca3,

title = "HiBid: A Cross-Channel Constrained Bidding System With Budget Allocation by Hierarchical Offline Deep Reinforcement Learning",

abstract = "Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single channel bidding, we explicitly consider cross-channel constrained bidding with budget allocation. Specifically, we propose a hierarchical offline deep reinforcement learning (DRL) framework called 'HiBid', consisted of a high-level planner equipped with auxiliary loss for non-competitive budget allocation, and a data augmentation enhanced low-level executor for adaptive bidding strategy in response to allocated budgets. Additionally, a CPC-guided action selection mechanism is introduced to satisfy the cross-channel CPC constraint. Through extensive experiments on both the large-scale log data and online A/B testing, we confirm that HiBid outperforms six baselines in terms of the number of clicks, CPC satisfactory ratio, and return-on-investment (ROI). We also deploy HiBid on Meituan advertising platform to already service tens of thousands of advertisers every day.",

keywords = "Real-time bidding systems, cross-channel bidding, deep reinforcement learning",

author = "Hao Wang and Bo Tang and Liu, {Chi Harold} and Shangqin Mao and Jiahong Zhou and Zipeng Dai and Yaqi Sun and Qianlong Xie and Xingxing Wang and Dong Wang",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2024",

month = mar,

day = "1",

doi = "10.1109/TC.2023.3343111",

language = "English",

volume = "73",

pages = "815--828",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "3",

}

TY - JOUR

T1 - HiBid

T2 - A Cross-Channel Constrained Bidding System With Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

AU - Wang, Hao

AU - Tang, Bo

AU - Liu, Chi Harold

AU - Mao, Shangqin

AU - Zhou, Jiahong

AU - Dai, Zipeng

AU - Sun, Yaqi

AU - Xie, Qianlong

AU - Wang, Xingxing

AU - Wang, Dong

PY - 2024/3/1

Y1 - 2024/3/1

N2 - Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single channel bidding, we explicitly consider cross-channel constrained bidding with budget allocation. Specifically, we propose a hierarchical offline deep reinforcement learning (DRL) framework called 'HiBid', consisted of a high-level planner equipped with auxiliary loss for non-competitive budget allocation, and a data augmentation enhanced low-level executor for adaptive bidding strategy in response to allocated budgets. Additionally, a CPC-guided action selection mechanism is introduced to satisfy the cross-channel CPC constraint. Through extensive experiments on both the large-scale log data and online A/B testing, we confirm that HiBid outperforms six baselines in terms of the number of clicks, CPC satisfactory ratio, and return-on-investment (ROI). We also deploy HiBid on Meituan advertising platform to already service tens of thousands of advertisers every day.

AB - Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single channel bidding, we explicitly consider cross-channel constrained bidding with budget allocation. Specifically, we propose a hierarchical offline deep reinforcement learning (DRL) framework called 'HiBid', consisted of a high-level planner equipped with auxiliary loss for non-competitive budget allocation, and a data augmentation enhanced low-level executor for adaptive bidding strategy in response to allocated budgets. Additionally, a CPC-guided action selection mechanism is introduced to satisfy the cross-channel CPC constraint. Through extensive experiments on both the large-scale log data and online A/B testing, we confirm that HiBid outperforms six baselines in terms of the number of clicks, CPC satisfactory ratio, and return-on-investment (ROI). We also deploy HiBid on Meituan advertising platform to already service tens of thousands of advertisers every day.

KW - Real-time bidding systems

KW - cross-channel bidding

KW - deep reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85180345224&partnerID=8YFLogxK

U2 - 10.1109/TC.2023.3343111

DO - 10.1109/TC.2023.3343111

M3 - Article

AN - SCOPUS:85180345224

SN - 0018-9340

VL - 73

SP - 815

EP - 828

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 3

ER -

HiBid: A Cross-Channel Constrained Bidding System With Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this