A compilation and run-time framework for maximizing performance of self-scheduling algorithms

Yizhuo Wang, Laleh Aghababaie Beni, Alexandru Nicolau, Alexander V. Veidenbaum, Rosario Cammarota

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Ordinary programs contain many parallel loops which account for a significant portion of these programs' completion time. The parallel executions of such loops can significantly speedup performance of modern multi-core systems. We propose a new framework - Locality Aware Self-scheduling (LASS) - for scheduling parallel loops to multi-core systems and boost up performance of known self-scheduling algorithms in diverse execution conditions. LASS enforces data locality, by forcing the execution of consecutive chunks of iterations to the same core, and favours load balancing with the introduction of a work-stealing mechanism. LASS is evaluated on a set of kernels on a multi-core system with 16 cores. Two execution scenarios are considered. In the first scenario our application runs alone on top of the operating system. In the second scenario our application runs in conjunction with an interfering parallel job. The average speedup achieved by LASS for first execution scenario is 11% and for the second one is 31%.

Original languageEnglish
Title of host publicationNetwork and Parallel Computing - 11th IFIP WG 10.3 International Conference, NPC 2014, Proceedings
PublisherSpringer Verlag
Pages459-470
Number of pages12
ISBN (Print)9783662449165
DOIs
Publication statusPublished - 2014
Event11th IFIP WG 10.3 International Conference on Network and Parallel Computing, NPC 2014 - Ilan, Taiwan, Province of China
Duration: 18 Sept 201420 Sept 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8707 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th IFIP WG 10.3 International Conference on Network and Parallel Computing, NPC 2014
Country/TerritoryTaiwan, Province of China
CityIlan
Period18/09/1420/09/14

Keywords

  • loop scheduling
  • random forest
  • self-scheduling

Fingerprint

Dive into the research topics of 'A compilation and run-time framework for maximizing performance of self-scheduling algorithms'. Together they form a unique fingerprint.

Cite this