Trav-SHACL: Efficiently validating networks of SHACL constraints

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • MA³nica Figuera
  • Philipp D. Rohde
  • Maria Esther Vidal

Research Organisations

External Research Organisations

  • University of Bonn
  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationThe Web Conference 2021
Subtitle of host publicationProceedings of the World Wide Web Conference, WWW 2021
Pages3337-3348
Number of pages12
ISBN (electronic)9781450383127
Publication statusPublished - 19 Apr 2021
EventWorld Wide Web Conference (WWW 2021) - Ljubljana, Slovenia
Duration: 19 Apr 202123 Apr 2021
Conference number: 30

Abstract

Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.

Keywords

    Knowledge Graph Constraints, Quality Assessment, SHACL Validation

ASJC Scopus subject areas

Cite this

Trav-SHACL: Efficiently validating networks of SHACL constraints. / Figuera, MA³nica; Rohde, Philipp D.; Vidal, Maria Esther.
The Web Conference 2021: Proceedings of the World Wide Web Conference, WWW 2021. 2021. p. 3337-3348.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Figuera, MA, Rohde, PD & Vidal, ME 2021, Trav-SHACL: Efficiently validating networks of SHACL constraints. in The Web Conference 2021: Proceedings of the World Wide Web Conference, WWW 2021. pp. 3337-3348, World Wide Web Conference (WWW 2021), Ljubljana, Slovenia, 19 Apr 2021. https://doi.org/10.1145/3442381.3449877
Figuera, MA., Rohde, P. D., & Vidal, M. E. (2021). Trav-SHACL: Efficiently validating networks of SHACL constraints. In The Web Conference 2021: Proceedings of the World Wide Web Conference, WWW 2021 (pp. 3337-3348) https://doi.org/10.1145/3442381.3449877
Figuera MA, Rohde PD, Vidal ME. Trav-SHACL: Efficiently validating networks of SHACL constraints. In The Web Conference 2021: Proceedings of the World Wide Web Conference, WWW 2021. 2021. p. 3337-3348 doi: 10.1145/3442381.3449877
Figuera, MA³nica ; Rohde, Philipp D. ; Vidal, Maria Esther. / Trav-SHACL : Efficiently validating networks of SHACL constraints. The Web Conference 2021: Proceedings of the World Wide Web Conference, WWW 2021. 2021. pp. 3337-3348
Download
@inproceedings{89975de91c8443789f522088bbd08539,
title = "Trav-SHACL: Efficiently validating networks of SHACL constraints",
abstract = "Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art. ",
keywords = "Knowledge Graph Constraints, Quality Assessment, SHACL Validation",
author = "MA³nica Figuera and Rohde, {Philipp D.} and Vidal, {Maria Esther}",
note = "Funding Information: This work has been partially supported by the EU H2020 RIA funded projects QualiChain (No 822404) and iASiS (No 727658).; World Wide Web Conference (WWW 2021), WWW 2021 ; Conference date: 19-04-2021 Through 23-04-2021",
year = "2021",
month = apr,
day = "19",
doi = "10.1145/3442381.3449877",
language = "English",
pages = "3337--3348",
booktitle = "The Web Conference 2021",

}

Download

TY - GEN

T1 - Trav-SHACL

T2 - World Wide Web Conference (WWW 2021)

AU - Figuera, MA³nica

AU - Rohde, Philipp D.

AU - Vidal, Maria Esther

N1 - Conference code: 30

PY - 2021/4/19

Y1 - 2021/4/19

N2 - Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.

AB - Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.

KW - Knowledge Graph Constraints

KW - Quality Assessment

KW - SHACL Validation

UR - http://www.scopus.com/inward/record.url?scp=85107646461&partnerID=8YFLogxK

U2 - 10.1145/3442381.3449877

DO - 10.1145/3442381.3449877

M3 - Conference contribution

AN - SCOPUS:85107646461

SP - 3337

EP - 3348

BT - The Web Conference 2021

Y2 - 19 April 2021 through 23 April 2021

ER -