SHACL constraint validation during SPARQL query processing

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autorschaft

  • Philipp D. Rohde
  • Maria Esther Vidal

Organisationseinheiten

Externe Organisationen

  • Technische Informationsbibliothek (TIB) Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksVLDB 2021 PhD Workshop
UntertitelProceedings of the VLDB 2021 PhD Workshop co-located with the 47th International Conference on Very Large Databases (VLDB 2021)
Seitenumfang4
PublikationsstatusVeröffentlicht - 2021
Veranstaltung2021 International Conference on Very Large Databases PhD Workshop, VLDB-PhD 2021 - Copenhagen, Dänemark
Dauer: 16 Aug. 2021 → …

Publikationsreihe

NameCEUR Workshop Proceedings
Herausgeber (Verlag)CEUR Workshop Proceedings
Band2971
ISSN (Print)1613-0073

Abstract

The importance of knowledge graphs is increasing. Due to their application in more and more real-world use-cases the data quality issue has to be addressed. The Shapes Constraint Language (SHACL) is the W3C recommendation language for defining integrity constraints over knowledge graphs expressed in the Resource Description Framework (RDF). Annotating SPARQL query results with metadata from the SHACL validation provides a better understanding of the knowledge graph and its data quality. We propose a query engine that is able to efficiently evaluate which instances in the knowledge graph fulfill the requirements from the SHACL shape schema and annotate the SPARQL query result with this metadata. Hence, adding the dimension of explainability to SPARQL query processing. Our preliminary analysis shows that the proposed optimizations performed for SHACL validation during SPARQL query processing increase the performance compared to a naive approach. However, in some queries the naive approach outperforms the optimizations. This shows that more work needs to be done in this topic to fully comprehend all impacting factors and to identify the amount of overhead added to the query execution.

ASJC Scopus Sachgebiete

Zitieren

SHACL constraint validation during SPARQL query processing. / Rohde, Philipp D.; Vidal, Maria Esther.
VLDB 2021 PhD Workshop: Proceedings of the VLDB 2021 PhD Workshop co-located with the 47th International Conference on Very Large Databases (VLDB 2021). 2021. (CEUR Workshop Proceedings; Band 2971).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Rohde, PD & Vidal, ME 2021, SHACL constraint validation during SPARQL query processing. in VLDB 2021 PhD Workshop: Proceedings of the VLDB 2021 PhD Workshop co-located with the 47th International Conference on Very Large Databases (VLDB 2021). CEUR Workshop Proceedings, Bd. 2971, 2021 International Conference on Very Large Databases PhD Workshop, VLDB-PhD 2021, Copenhagen, Dänemark, 16 Aug. 2021. <https://ceur-ws.org/Vol-2971/paper05.pdf>
Rohde, P. D., & Vidal, M. E. (2021). SHACL constraint validation during SPARQL query processing. In VLDB 2021 PhD Workshop: Proceedings of the VLDB 2021 PhD Workshop co-located with the 47th International Conference on Very Large Databases (VLDB 2021) (CEUR Workshop Proceedings; Band 2971). https://ceur-ws.org/Vol-2971/paper05.pdf
Rohde PD, Vidal ME. SHACL constraint validation during SPARQL query processing. in VLDB 2021 PhD Workshop: Proceedings of the VLDB 2021 PhD Workshop co-located with the 47th International Conference on Very Large Databases (VLDB 2021). 2021. (CEUR Workshop Proceedings).
Rohde, Philipp D. ; Vidal, Maria Esther. / SHACL constraint validation during SPARQL query processing. VLDB 2021 PhD Workshop: Proceedings of the VLDB 2021 PhD Workshop co-located with the 47th International Conference on Very Large Databases (VLDB 2021). 2021. (CEUR Workshop Proceedings).
Download
@inproceedings{655ae4deee75429290e2ba505a87f110,
title = "SHACL constraint validation during SPARQL query processing",
abstract = "The importance of knowledge graphs is increasing. Due to their application in more and more real-world use-cases the data quality issue has to be addressed. The Shapes Constraint Language (SHACL) is the W3C recommendation language for defining integrity constraints over knowledge graphs expressed in the Resource Description Framework (RDF). Annotating SPARQL query results with metadata from the SHACL validation provides a better understanding of the knowledge graph and its data quality. We propose a query engine that is able to efficiently evaluate which instances in the knowledge graph fulfill the requirements from the SHACL shape schema and annotate the SPARQL query result with this metadata. Hence, adding the dimension of explainability to SPARQL query processing. Our preliminary analysis shows that the proposed optimizations performed for SHACL validation during SPARQL query processing increase the performance compared to a naive approach. However, in some queries the naive approach outperforms the optimizations. This shows that more work needs to be done in this topic to fully comprehend all impacting factors and to identify the amount of overhead added to the query execution.",
author = "Rohde, {Philipp D.} and Vidal, {Maria Esther}",
note = "Funding Information: This work has been partially supported by the EU H2020 RIA funded projects QualiChain (No 822404) and CLARIFY (No 875160), and the ERAMed project P4-LUCAT (No 53000015). ; 2021 International Conference on Very Large Databases PhD Workshop, VLDB-PhD 2021 ; Conference date: 16-08-2021",
year = "2021",
language = "English",
series = "CEUR Workshop Proceedings",
publisher = "CEUR Workshop Proceedings",
booktitle = "VLDB 2021 PhD Workshop",

}

Download

TY - GEN

T1 - SHACL constraint validation during SPARQL query processing

AU - Rohde, Philipp D.

AU - Vidal, Maria Esther

N1 - Funding Information: This work has been partially supported by the EU H2020 RIA funded projects QualiChain (No 822404) and CLARIFY (No 875160), and the ERAMed project P4-LUCAT (No 53000015).

PY - 2021

Y1 - 2021

N2 - The importance of knowledge graphs is increasing. Due to their application in more and more real-world use-cases the data quality issue has to be addressed. The Shapes Constraint Language (SHACL) is the W3C recommendation language for defining integrity constraints over knowledge graphs expressed in the Resource Description Framework (RDF). Annotating SPARQL query results with metadata from the SHACL validation provides a better understanding of the knowledge graph and its data quality. We propose a query engine that is able to efficiently evaluate which instances in the knowledge graph fulfill the requirements from the SHACL shape schema and annotate the SPARQL query result with this metadata. Hence, adding the dimension of explainability to SPARQL query processing. Our preliminary analysis shows that the proposed optimizations performed for SHACL validation during SPARQL query processing increase the performance compared to a naive approach. However, in some queries the naive approach outperforms the optimizations. This shows that more work needs to be done in this topic to fully comprehend all impacting factors and to identify the amount of overhead added to the query execution.

AB - The importance of knowledge graphs is increasing. Due to their application in more and more real-world use-cases the data quality issue has to be addressed. The Shapes Constraint Language (SHACL) is the W3C recommendation language for defining integrity constraints over knowledge graphs expressed in the Resource Description Framework (RDF). Annotating SPARQL query results with metadata from the SHACL validation provides a better understanding of the knowledge graph and its data quality. We propose a query engine that is able to efficiently evaluate which instances in the knowledge graph fulfill the requirements from the SHACL shape schema and annotate the SPARQL query result with this metadata. Hence, adding the dimension of explainability to SPARQL query processing. Our preliminary analysis shows that the proposed optimizations performed for SHACL validation during SPARQL query processing increase the performance compared to a naive approach. However, in some queries the naive approach outperforms the optimizations. This shows that more work needs to be done in this topic to fully comprehend all impacting factors and to identify the amount of overhead added to the query execution.

UR - http://www.scopus.com/inward/record.url?scp=85117054341&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85117054341

T3 - CEUR Workshop Proceedings

BT - VLDB 2021 PhD Workshop

T2 - 2021 International Conference on Very Large Databases PhD Workshop, VLDB-PhD 2021

Y2 - 16 August 2021

ER -