DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autorschaft

  • Farah Karim
  • Ioanna Lytra
  • Christian Mader
  • Sören Auer
  • Maria Esther Vidal

Externe Organisationen

  • Rheinische Friedrich-Wilhelms-Universität Bonn
  • Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme (IAIS)
  • Technische Informationsbibliothek (TIB) Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)373-397
Seitenumfang25
FachzeitschriftInternational Journal of Semantic Computing
Jahrgang12
Ausgabenummer3
PublikationsstatusVeröffentlicht - Sept. 2018

Abstract

The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.

ASJC Scopus Sachgebiete

Zitieren

DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering. / Karim, Farah; Lytra, Ioanna; Mader, Christian et al.
in: International Journal of Semantic Computing, Jahrgang 12, Nr. 3, 09.2018, S. 373-397.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Karim, F, Lytra, I, Mader, C, Auer, S & Vidal, ME 2018, 'DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering', International Journal of Semantic Computing, Jg. 12, Nr. 3, S. 373-397. https://doi.org/10.1142/S1793351X18400172
Karim, F., Lytra, I., Mader, C., Auer, S., & Vidal, M. E. (2018). DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering. International Journal of Semantic Computing, 12(3), 373-397. https://doi.org/10.1142/S1793351X18400172
Karim F, Lytra I, Mader C, Auer S, Vidal ME. DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering. International Journal of Semantic Computing. 2018 Sep;12(3):373-397. doi: 10.1142/S1793351X18400172
Karim, Farah ; Lytra, Ioanna ; Mader, Christian et al. / DESERT : A Continuous SPARQL Query Engine for On-Demand Query Answering. in: International Journal of Semantic Computing. 2018 ; Jahrgang 12, Nr. 3. S. 373-397.
Download
@article{4fbf287430a94032acf2f4f8903b2f10,
title = "DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering",
abstract = "The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.",
keywords = "continuous SPARQL query, Internet of things, semantic enrichment, stream data",
author = "Farah Karim and Ioanna Lytra and Christian Mader and S{\"o}ren Auer and Vidal, {Maria Esther}",
note = "(c) 2018 World Scientific Publishing Company",
year = "2018",
month = sep,
doi = "10.1142/S1793351X18400172",
language = "English",
volume = "12",
pages = "373--397",
number = "3",

}

Download

TY - JOUR

T1 - DESERT

T2 - A Continuous SPARQL Query Engine for On-Demand Query Answering

AU - Karim, Farah

AU - Lytra, Ioanna

AU - Mader, Christian

AU - Auer, Sören

AU - Vidal, Maria Esther

N1 - (c) 2018 World Scientific Publishing Company

PY - 2018/9

Y1 - 2018/9

N2 - The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.

AB - The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.

KW - continuous SPARQL query

KW - Internet of things

KW - semantic enrichment

KW - stream data

U2 - 10.1142/S1793351X18400172

DO - 10.1142/S1793351X18400172

M3 - Article

AN - SCOPUS:85053696463

VL - 12

SP - 373

EP - 397

JO - International Journal of Semantic Computing

JF - International Journal of Semantic Computing

SN - 1793-351X

IS - 3

ER -

Von denselben Autoren