Rafiki: Machine learning as an analytics service system

Wei Wang, Jinyang Gao, Meihui Zhang, Sheng Wang, Gang Chen, Teck Khim Ng, Beng Chin Ooi, Jie Shao, Moaz Reyad

科研成果: 期刊稿件会议文章同行评审

51 引用 (Scopus)

摘要

Big data analytics is gaining massive momentum in the last few years. Applying machine learning models to big data has become an implicit requirement or an expectation for most analysis tasks, especially on high-stakes applications. Typical applications include sentiment analysis against reviews for analyzing on-line products, image classification in food logging applications for monitoring user's daily intake, and stock movement prediction. Extending traditional database systems to support the above analysis is intriguing but challenging. First, it is almost impossible to implement all machine learning models in the database engines. Second, expert knowledge is required to optimize the training and inference procedures in terms of efficiency and effectiveness, which imposes heavy burden on the system users. In this paper, we develop and present a system, called Rafiki, to provide the training and inference service of machine learning models. Rafiki provides distributed hyper-parameter tuning for the training service, and online ensemble modeling for the inference service which trades off between latency and accuracy. Experimental results confirm the efficiency, effectiveness, scalability and usability of Rafiki.

源语言英语
页(从-至)128-140
页数13
期刊Proceedings of the VLDB Endowment
12
2
DOI
出版状态已出版 - 2018
已对外发布
活动45th International Conference on Very Large Data Bases, VLDB 2019 - Los Angeles, 美国
期限: 26 8月 201730 8月 2017

指纹

探究 'Rafiki: Machine learning as an analytics service system' 的科研主题。它们共同构成独一无二的指纹。

引用此