R-AQM: Reverse ACK Active Queue Management in Multi-tenant Data Centers

Xinle Du, Tong Li*, Lei Xu, Kai Zheng, Meng Shen, Bo Wu, Ke Xu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

TCP incast has become a practical problem for high-bandwidth, low-latency transmissions, resulting in throughput degradation of up to 90% and delays of hundreds of milliseconds, severely impacting application performance. However, in virtualized multi-tenant data centers, host-based advancements in the TCP stack are hard to deploy from the operators perspective. Operators only provide infrastructure in the form of virtual machines, in which only tenants can directly modify the end-host TCP stack. In this paper, we present R-AQM, a switch-powered reverse ACK active queue management (R-AQM) mechanism for enhancing ACK-clocking effects through assisting legacy TCP. Specifically, R-AQM proactively intercepts ACKs and paces the ACK-clocked in-flight data packets, preventing TCP from suffering incast collapse. We implement and evaluate R-AQM in NS-3 simulation and NetFPGA-based hardware switch. Both simulation and testbed results show that R-AQM greatly improves TCP performance under heavy incast workloads by significantly lowering packet loss rate, reducing retransmission timeouts, and supporting 16 times (i.e., 60 → 1000) more senders. Meanwhile, the forward queuing delays are also reduced by 4.6 times.

Original languageEnglish
Title of host publication2021 IEEE 29th International Conference on Network Protocols, ICNP 2021
PublisherIEEE Computer Society
ISBN (Electronic)9781665441315
DOIs
Publication statusPublished - 2021
Event29th IEEE International Conference on Network Protocols, ICNP 2021 - Virtual, Online, United States
Duration: 1 Nov 20215 Nov 2021

Publication series

NameProceedings - International Conference on Network Protocols, ICNP
Volume2021-November
ISSN (Print)1092-1648

Conference

Conference29th IEEE International Conference on Network Protocols, ICNP 2021
Country/TerritoryUnited States
CityVirtual, Online
Period1/11/215/11/21

Fingerprint

Dive into the research topics of 'R-AQM: Reverse ACK Active Queue Management in Multi-tenant Data Centers'. Together they form a unique fingerprint.

Cite this