摘要
Emerging In-Network Computing (INC) technique provides a new opportunity to improve application’s performance by using network programmability, computational capability, and storage capacity enabled by programmable switches. One typical application is Distributed Machine Learning (DML), which accelerates machine learning training by employing multiple works to train model parallelly. This paper introduces INC-based DML systems, analyzes performance improvement from using INC, and overviews current studies of INC-based DML systems. We also propose potential research directions for applying INC to DML systems.
源语言 | 英语 |
---|---|
页(从-至) | 1 |
页数 | 1 |
期刊 | IEEE Network |
DOI | |
出版状态 | 已接受/待刊 - 2024 |