TY - GEN
T1 - SINGA-Easy
T2 - 29th ACM International Conference on Multimedia, MM 2021
AU - Xing, Naili
AU - Yeung, Sai Ho
AU - Cai, Cheng Hao
AU - Ng, Teck Khim
AU - Wang, Wei
AU - Yang, Kaiyuan
AU - Yang, Nan
AU - Zhang, Meihui
AU - Chen, Gang
AU - Ooi, Beng Chin
N1 - Publisher Copyright:
© 2021 Owner/Author.
PY - 2021/10/17
Y1 - 2021/10/17
N2 - Deep learning has achieved great success in a wide spectrum of multimedia applications such as image classification, natural language processing and multimodal data analysis. Recent years have seen the development of many deep learning frameworks that provide a high-level programming interface for users to design models, conduct training and deploy inference. However, it remains challenging to build an efficient end-to-end multimedia application with most existing frameworks. Specifically, in terms of usability, it is demanding for non-experts to implement deep learning models, obtain the right settings for the entire machine learning pipeline, manage models and datasets, and exploit external data sources all together. Further, in terms of adaptability, elastic computation solutions are much needed as the actual serving workload fluctuates constantly, and scaling the hardware resources to handle the fluctuating workload is typically infeasible. To address these challenges, we introduce SINGA-Easy, a new deep learning framework that provides distributed hyper-parameter tuning at the training stage, dynamic computational cost control at the inference stage, and intuitive user interactions with multimedia contents facilitated by model explanation. Our experiments on the training and deployment of multi-modality data analysis applications show that the framework is both usable and adaptable to dynamic inference loads. We implement SINGA-Easy on top of Apache SINGA and demonstrate our system with the entire machine learning life cycle.
AB - Deep learning has achieved great success in a wide spectrum of multimedia applications such as image classification, natural language processing and multimodal data analysis. Recent years have seen the development of many deep learning frameworks that provide a high-level programming interface for users to design models, conduct training and deploy inference. However, it remains challenging to build an efficient end-to-end multimedia application with most existing frameworks. Specifically, in terms of usability, it is demanding for non-experts to implement deep learning models, obtain the right settings for the entire machine learning pipeline, manage models and datasets, and exploit external data sources all together. Further, in terms of adaptability, elastic computation solutions are much needed as the actual serving workload fluctuates constantly, and scaling the hardware resources to handle the fluctuating workload is typically infeasible. To address these challenges, we introduce SINGA-Easy, a new deep learning framework that provides distributed hyper-parameter tuning at the training stage, dynamic computational cost control at the inference stage, and intuitive user interactions with multimedia contents facilitated by model explanation. Our experiments on the training and deployment of multi-modality data analysis applications show that the framework is both usable and adaptable to dynamic inference loads. We implement SINGA-Easy on top of Apache SINGA and demonstrate our system with the entire machine learning life cycle.
KW - data analytics
KW - deep learning
KW - distributed training
KW - dynamic inference
KW - multimedia application
UR - http://www.scopus.com/inward/record.url?scp=85119337317&partnerID=8YFLogxK
U2 - 10.1145/3474085.3475176
DO - 10.1145/3474085.3475176
M3 - Conference contribution
AN - SCOPUS:85119337317
T3 - MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia
SP - 1293
EP - 1302
BT - MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia
PB - Association for Computing Machinery, Inc
Y2 - 20 October 2021 through 24 October 2021
ER -