TY - GEN
T1 - An enforcement of real time scheduling in Spark Streaming
AU - Liao, Xinyi
AU - Gao, Zhiwei
AU - Ji, Weixing
AU - Wang, Yizhuo
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/1/26
Y1 - 2016/1/26
N2 - With the exponential growth in continuous data streams, real time streaming processing has been gaining a lot of popularity. Spark Streaming is one of the open source frameworks for reliable, high-throughput and low latency stream processing. Though it is a near real time stream processing framework running on commodity hardware, real time event processing is not guaranteed in its scheduling system. Profiling results indicate that the total delay time of events with unstable inputs is more volatile and presents big fluctuations. In this paper, we propose a simple, yet effective scheduling strategy to reduce the worst case event processing time by dynamic adjusting the time window of batch intervals. It is a real time enhancement to Spark Streaming based on Spark's framework. The proposed strategy is evaluated using two streaming benchmarks and our preliminary results demonstrate the feasibility of our approach with unstable event streams.
AB - With the exponential growth in continuous data streams, real time streaming processing has been gaining a lot of popularity. Spark Streaming is one of the open source frameworks for reliable, high-throughput and low latency stream processing. Though it is a near real time stream processing framework running on commodity hardware, real time event processing is not guaranteed in its scheduling system. Profiling results indicate that the total delay time of events with unstable inputs is more volatile and presents big fluctuations. In this paper, we propose a simple, yet effective scheduling strategy to reduce the worst case event processing time by dynamic adjusting the time window of batch intervals. It is a real time enhancement to Spark Streaming based on Spark's framework. The proposed strategy is evaluated using two streaming benchmarks and our preliminary results demonstrate the feasibility of our approach with unstable event streams.
KW - Spark Streaming
KW - big data
KW - real time scheduling
KW - streaming processing
UR - http://www.scopus.com/inward/record.url?scp=84962859057&partnerID=8YFLogxK
U2 - 10.1109/IGCC.2015.7393730
DO - 10.1109/IGCC.2015.7393730
M3 - Conference contribution
AN - SCOPUS:84962859057
T3 - 2015 6th International Green and Sustainable Computing Conference
BT - 2015 6th International Green and Sustainable Computing Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th International Green and Sustainable Computing Conference, IGSC 2015
Y2 - 14 December 2015 through 16 December 2015
ER -