Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning

Rui Han; Fan Zhang; Lydia Y. Chen; Jianfeng Zhan

doi:10.1109/RTSS.2017.00055

Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning

Rui Han, Fan Zhang, Lydia Y. Chen, Jianfeng Zhan

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

3 Citations (Scopus)

Abstract

As iterative machine learning (ML) (e.g. neural network based supervised learning and k-means clustering) becomes more ubiquitous in our daily life, it is becoming increasingly important to complete model training quickly to support real-time decision making, while still achieving high model accuracy (e.g. low prediction errors) that is critical for profits of ML tasks. Motivated by the observation that the small proportions of accuracy-critical input data can contribute to large parts of model accuracy in many iterative ML applications, this paper introduces a system middleware to maximize model accuracy by spending the limited time budget on the most accuracy-related input data. To achieve this, our approach employs a fast method to divide the input data into multiple parts of similar points and represents each part with an aggregated data point. Using these points, it quickly estimates the correlations between different parts and model accuracy, thus allowing ML tasks to process the most accuracy-related parts first. We incorporate our approach with two popular supervised and unsupervised ML algorithms on Spark and demonstrate its benefits in providing high model accuracy under short deadlines.

Original language	English
Title of host publication	Proceedings - 2017 IEEE Real-Time Systems Symposium, RTSS 2017
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	351-353
Number of pages	3
ISBN (Electronic)	9781538614143
DOIs	https://doi.org/10.1109/RTSS.2017.00055
Publication status	Published - 2 Jul 2017
Externally published	Yes
Event	38th IEEE Real-Time Systems Symposium, RTSS 2017 - Paris, France Duration: 5 Oct 2017 → 8 Oct 2017

Publication series

Name	Proceedings - Real-Time Systems Symposium
Volume	2018-January
ISSN (Print)	1052-8725

Conference

Conference	38th IEEE Real-Time Systems Symposium, RTSS 2017
Country/Territory	France
City	Paris
Period	5/10/17 → 8/10/17

Keywords

Accuracy-aware-processing
Machine-learning

Access to Document

10.1109/RTSS.2017.00055

Cite this

Han, R., Zhang, F., Chen, L. Y., & Zhan, J. (2017). Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning. In Proceedings - 2017 IEEE Real-Time Systems Symposium, RTSS 2017 (pp. 351-353). (Proceedings - Real-Time Systems Symposium; Vol. 2018-January). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/RTSS.2017.00055

@inproceedings{14698aefa64e4dd68742e968d0ba4782,

title = "Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning",

abstract = "As iterative machine learning (ML) (e.g. neural network based supervised learning and k-means clustering) becomes more ubiquitous in our daily life, it is becoming increasingly important to complete model training quickly to support real-time decision making, while still achieving high model accuracy (e.g. low prediction errors) that is critical for profits of ML tasks. Motivated by the observation that the small proportions of accuracy-critical input data can contribute to large parts of model accuracy in many iterative ML applications, this paper introduces a system middleware to maximize model accuracy by spending the limited time budget on the most accuracy-related input data. To achieve this, our approach employs a fast method to divide the input data into multiple parts of similar points and represents each part with an aggregated data point. Using these points, it quickly estimates the correlations between different parts and model accuracy, thus allowing ML tasks to process the most accuracy-related parts first. We incorporate our approach with two popular supervised and unsupervised ML algorithms on Spark and demonstrate its benefits in providing high model accuracy under short deadlines.",

keywords = "Accuracy-aware-processing, Machine-learning",

author = "Rui Han and Fan Zhang and Chen, {Lydia Y.} and Jianfeng Zhan",

note = "Publisher Copyright: {\textcopyright} 2017 IEEE.; 38th IEEE Real-Time Systems Symposium, RTSS 2017 ; Conference date: 05-10-2017 Through 08-10-2017",

year = "2017",

month = jul,

day = "2",

doi = "10.1109/RTSS.2017.00055",

language = "English",

series = "Proceedings - Real-Time Systems Symposium",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "351--353",

booktitle = "Proceedings - 2017 IEEE Real-Time Systems Symposium, RTSS 2017",

address = "United States",

}

Han, R, Zhang, F, Chen, LY & Zhan, J 2017, Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning. in Proceedings - 2017 IEEE Real-Time Systems Symposium, RTSS 2017. Proceedings - Real-Time Systems Symposium, vol. 2018-January, Institute of Electrical and Electronics Engineers Inc., pp. 351-353, 38th IEEE Real-Time Systems Symposium, RTSS 2017, Paris, France, 5/10/17. https://doi.org/10.1109/RTSS.2017.00055

Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning. / Han, Rui; Zhang, Fan; Chen, Lydia Y. et al.
Proceedings - 2017 IEEE Real-Time Systems Symposium, RTSS 2017. Institute of Electrical and Electronics Engineers Inc., 2017. p. 351-353 (Proceedings - Real-Time Systems Symposium; Vol. 2018-January).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Work-in-Progress

T2 - 38th IEEE Real-Time Systems Symposium, RTSS 2017

AU - Han, Rui

AU - Zhang, Fan

AU - Chen, Lydia Y.

AU - Zhan, Jianfeng

PY - 2017/7/2

Y1 - 2017/7/2

N2 - As iterative machine learning (ML) (e.g. neural network based supervised learning and k-means clustering) becomes more ubiquitous in our daily life, it is becoming increasingly important to complete model training quickly to support real-time decision making, while still achieving high model accuracy (e.g. low prediction errors) that is critical for profits of ML tasks. Motivated by the observation that the small proportions of accuracy-critical input data can contribute to large parts of model accuracy in many iterative ML applications, this paper introduces a system middleware to maximize model accuracy by spending the limited time budget on the most accuracy-related input data. To achieve this, our approach employs a fast method to divide the input data into multiple parts of similar points and represents each part with an aggregated data point. Using these points, it quickly estimates the correlations between different parts and model accuracy, thus allowing ML tasks to process the most accuracy-related parts first. We incorporate our approach with two popular supervised and unsupervised ML algorithms on Spark and demonstrate its benefits in providing high model accuracy under short deadlines.

AB - As iterative machine learning (ML) (e.g. neural network based supervised learning and k-means clustering) becomes more ubiquitous in our daily life, it is becoming increasingly important to complete model training quickly to support real-time decision making, while still achieving high model accuracy (e.g. low prediction errors) that is critical for profits of ML tasks. Motivated by the observation that the small proportions of accuracy-critical input data can contribute to large parts of model accuracy in many iterative ML applications, this paper introduces a system middleware to maximize model accuracy by spending the limited time budget on the most accuracy-related input data. To achieve this, our approach employs a fast method to divide the input data into multiple parts of similar points and represents each part with an aggregated data point. Using these points, it quickly estimates the correlations between different parts and model accuracy, thus allowing ML tasks to process the most accuracy-related parts first. We incorporate our approach with two popular supervised and unsupervised ML algorithms on Spark and demonstrate its benefits in providing high model accuracy under short deadlines.

KW - Accuracy-aware-processing

KW - Machine-learning

UR - http://www.scopus.com/inward/record.url?scp=85046357376&partnerID=8YFLogxK

U2 - 10.1109/RTSS.2017.00055

DO - 10.1109/RTSS.2017.00055

M3 - Conference contribution

AN - SCOPUS:85046357376

T3 - Proceedings - Real-Time Systems Symposium

SP - 351

EP - 353

BT - Proceedings - 2017 IEEE Real-Time Systems Symposium, RTSS 2017

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 5 October 2017 through 8 October 2017

ER -

Work-in-Progress: Maximizing Model Accuracy in Real-time and Iterative Machine Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this