DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering

Farah Karim; Ioanna Lytra; Christian Mader; Sören Auer; Maria Esther Vidal

doi:10.1142/S1793351X18400172

Details

Originalsprache	Englisch
Seiten (von - bis)	373-397
Seitenumfang	25
Fachzeitschrift	International Journal of Semantic Computing
Jahrgang	12
Ausgabenummer	3
Publikationsstatus	Veröffentlicht - Sept. 2018

Abstract

The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.

ASJC Scopus Sachgebiete

Informatik (insg.)
Software
Informatik (insg.)
Information systems
Sozialwissenschaften (insg.)
Linguistik und Sprache
Informatik (insg.)
Angewandte Informatik
Informatik (insg.)
Computernetzwerke und -kommunikation
Informatik (insg.)
Artificial intelligence

Zitieren

DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering. / Karim, Farah; Lytra, Ioanna; Mader, Christian et al.
in: International Journal of Semantic Computing, Jahrgang 12, Nr. 3, 09.2018, S. 373-397.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Karim, F, Lytra, I, Mader, C, Auer, S & Vidal, ME 2018, 'DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering', International Journal of Semantic Computing, Jg. 12, Nr. 3, S. 373-397. https://doi.org/10.1142/S1793351X18400172

Karim, F., Lytra, I., Mader, C., Auer, S., & Vidal, M. E. (2018). DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering. International Journal of Semantic Computing, 12(3), 373-397. https://doi.org/10.1142/S1793351X18400172

Karim F, Lytra I, Mader C, Auer S, Vidal ME. DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering. International Journal of Semantic Computing. 2018 Sep;12(3):373-397. doi: 10.1142/S1793351X18400172

Karim, Farah ; Lytra, Ioanna ; Mader, Christian et al. / DESERT : A Continuous SPARQL Query Engine for On-Demand Query Answering. in: International Journal of Semantic Computing. 2018 ; Jahrgang 12, Nr. 3. S. 373-397.

Download

@article{4fbf287430a94032acf2f4f8903b2f10,

title = "DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering",

abstract = "The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.",

keywords = "continuous SPARQL query, Internet of things, semantic enrichment, stream data",

author = "Farah Karim and Ioanna Lytra and Christian Mader and S{\"o}ren Auer and Vidal, {Maria Esther}",

year = "2018",

month = sep,

doi = "10.1142/S1793351X18400172",

language = "English",

volume = "12",

pages = "373--397",

number = "3",

}

Download

TY - JOUR

T1 - DESERT

T2 - A Continuous SPARQL Query Engine for On-Demand Query Answering

AU - Karim, Farah

AU - Lytra, Ioanna

AU - Mader, Christian

AU - Auer, Sören

AU - Vidal, Maria Esther

PY - 2018/9

Y1 - 2018/9

N2 - The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.

AB - The Internet of Things (IoT) has been rapidly adopted in many domains ranging from household appliances e.g. ventilation, lighting, and heating, to industrial manufacturing and transport networks. Despite the, enormous benefits of optimization, monitoring, and maintenance rendered by IoT devices, an ample amount of data is generated continuously. Semantically describing IoT generated data using ontologies enables a precise interpretation of this data. However, ontology-based descriptions tremendously increase the size of IoT data and in presence of repeated sensor measurements, a large amount of the data are duplicates that do not contribute to new insights during query processing or IoT data analytics. In order to ensure that only required ontology-based descriptions are generated, we devise a knowledge-driven approach named DESERT that is able to on-D––emand factorizE–– and S––emantically E––nrich stR––eam daT––a. DESERT resorts to a knowledge graph to describe IoT stream data; it utilizes only the data that is required to answer an input continuous SPARQL query and applies a novel method of data factorization to reduce duplicated measurements in the knowledge graph. The performance of DESERT is empirically studied on a collection of continuous SPARQL queries from SRBench, a benchmark of IoT stream data and continuous SPARQL queries. Furthermore, data streams with various combinations of uniform and varying data stream speeds and streaming window size dimensions are considered in the study. Experimental results suggest that DESERT is capable of speeding up continuous query processing while creates knowledge graphs that include no replications.

KW - continuous SPARQL query

KW - Internet of things

KW - semantic enrichment

KW - stream data

U2 - 10.1142/S1793351X18400172

DO - 10.1142/S1793351X18400172

M3 - Article

AN - SCOPUS:85053696463

VL - 12

SP - 373

EP - 397

JO - International Journal of Semantic Computing

JF - International Journal of Semantic Computing

SN - 1793-351X

IS - 3

ER -

Research@Leibniz University

DESERT: A Continuous SPARQL Query Engine for On-Demand Query Answering

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Leveraging GPT Models For Semantic Table Annotation

Managing Comprehensive Research Instrument Descriptions Within a Scholarly Knowledge Graph

DataDesc: A framework for creating and sharing technical metadata for research software interfaces

Organizing Scientific Knowledge from Engineering Sciences Using the Open Research Knowledge Graph: The Tailored Forming Process Chain Use Case

A Neuro-Symbolic Approach for Faceted Search in Digital Libraries