Highly Parallel SPARQL Engine for RDF

Fan Feng, Weikang Zhou, Ding Zhang, Jinhui Pang*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

In this paper, a highly parallel batch processing engine is designed for SPARQL queries. Machine learning algorithms were applied to make time predictions of queries and reasonably group them, and further make reasonable estimates of the memory footprint of the queries to arrange the order of each group of queries. Finally, the query is processed in parallel by introducing pthreads. Based on the above three points, a spall time prediction algorithm was proposed, including data processing, to better deal with batch SPARQL queries, and the introduction of pthread can make our query processing faster. Since data processing was added to query time prediction, the method can be implemented in any set of data-queries. Experiments show that the engine can optimize time and maximize the use of memory when processing batch SPARQL queries.

Original languageEnglish
Title of host publicationData Science - 6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020, Proceedings
EditorsJianchao Zeng, Weipeng Jing, Xianhua Song, Zeguang Lu
PublisherSpringer
Pages61-71
Number of pages11
ISBN (Print)9789811579806
DOIs
Publication statusPublished - 2020
Event6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020 - Taiyuan, China
Duration: 18 Sept 202021 Sept 2020

Publication series

NameCommunications in Computer and Information Science
Volume1257 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference6th International Conference of Pioneering Computer Scientists, Engineers and Educators, ICPCSEE 2020
Country/TerritoryChina
CityTaiyuan
Period18/09/2021/09/20

Keywords

  • Multithreading
  • Performance prediction
  • Pthread
  • SPARQL

Fingerprint

Dive into the research topics of 'Highly Parallel SPARQL Engine for RDF'. Together they form a unique fingerprint.

Cite this