KnowlyBERT: Hybrid Query Answering over Language Models and Knowledge Graphs

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

External Research Organisations

  • Technische Universität Braunschweig
View graph of relations

Details

Original languageEnglish
Title of host publicationThe Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings
EditorsJeff Z. Pan, Valentina Tamma, Claudia d’Amato, Krzysztof Janowicz, Bo Fu, Axel Polleres, Oshani Seneviratne, Lalana Kagal
Pages294-310
Number of pages17
Publication statusPublished - 2020
Externally publishedYes

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12506 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Providing a plethora of entity-centric information, Knowledge Graphs have become a vital building block for a variety of intelligent applications. Indeed, modern knowledge graphs like Wikidata already capture several billions of RDF triples, yet they still lack a good coverage for most relations. On the other hand, recent developments in NLP research show that neural language models can easily be queried for relational knowledge without requiring massive amounts of training data. In this work, we leverage this idea by creating a hybrid query answering system on top of knowledge graphs in combination with the masked language model BERT to complete query results. We thus incorporate valuable structural and semantic information from knowledge graphs with textual knowledge from language models to achieve high precision query results. Standard techniques for dealing with incomplete knowledge graphs are either (1) relation extraction which requires massive amounts of training data or (2) knowledge graph embeddings which have problems to succeed beyond simple baseline datasets. Our hybrid system KnowlyBERT requires only small amounts of training data, while outperforming state-of-the-art techniques by boosting their precision by over 30% in our large Wikidata experiment.

Keywords

    Knowledge graphs, Language models, Query answering

ASJC Scopus subject areas

Cite this

KnowlyBERT: Hybrid Query Answering over Language Models and Knowledge Graphs. / Kalo, Jan-Christoph; Fichtel, Leandra; Ehler, Philipp et al.
The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings. ed. / Jeff Z. Pan; Valentina Tamma; Claudia d’Amato; Krzysztof Janowicz; Bo Fu; Axel Polleres; Oshani Seneviratne; Lalana Kagal. 2020. p. 294-310 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12506 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Kalo, J-C, Fichtel, L, Ehler, P & Balke, W-T 2020, KnowlyBERT: Hybrid Query Answering over Language Models and Knowledge Graphs. in JZ Pan, V Tamma, C d’Amato, K Janowicz, B Fu, A Polleres, O Seneviratne & L Kagal (eds), The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12506 LNCS, pp. 294-310. https://doi.org/10.1007/978-3-030-62419-4_17
Kalo, J.-C., Fichtel, L., Ehler, P., & Balke, W.-T. (2020). KnowlyBERT: Hybrid Query Answering over Language Models and Knowledge Graphs. In J. Z. Pan, V. Tamma, C. d’Amato, K. Janowicz, B. Fu, A. Polleres, O. Seneviratne, & L. Kagal (Eds.), The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings (pp. 294-310). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12506 LNCS). https://doi.org/10.1007/978-3-030-62419-4_17
Kalo JC, Fichtel L, Ehler P, Balke WT. KnowlyBERT: Hybrid Query Answering over Language Models and Knowledge Graphs. In Pan JZ, Tamma V, d’Amato C, Janowicz K, Fu B, Polleres A, Seneviratne O, Kagal L, editors, The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings. 2020. p. 294-310. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-62419-4_17
Kalo, Jan-Christoph ; Fichtel, Leandra ; Ehler, Philipp et al. / KnowlyBERT : Hybrid Query Answering over Language Models and Knowledge Graphs. The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings. editor / Jeff Z. Pan ; Valentina Tamma ; Claudia d’Amato ; Krzysztof Janowicz ; Bo Fu ; Axel Polleres ; Oshani Seneviratne ; Lalana Kagal. 2020. pp. 294-310 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{63f125c5d0dc49059329012f89162838,
title = "KnowlyBERT: Hybrid Query Answering over Language Models and Knowledge Graphs",
abstract = "Providing a plethora of entity-centric information, Knowledge Graphs have become a vital building block for a variety of intelligent applications. Indeed, modern knowledge graphs like Wikidata already capture several billions of RDF triples, yet they still lack a good coverage for most relations. On the other hand, recent developments in NLP research show that neural language models can easily be queried for relational knowledge without requiring massive amounts of training data. In this work, we leverage this idea by creating a hybrid query answering system on top of knowledge graphs in combination with the masked language model BERT to complete query results. We thus incorporate valuable structural and semantic information from knowledge graphs with textual knowledge from language models to achieve high precision query results. Standard techniques for dealing with incomplete knowledge graphs are either (1) relation extraction which requires massive amounts of training data or (2) knowledge graph embeddings which have problems to succeed beyond simple baseline datasets. Our hybrid system KnowlyBERT requires only small amounts of training data, while outperforming state-of-the-art techniques by boosting their precision by over 30% in our large Wikidata experiment.",
keywords = "Knowledge graphs, Language models, Query answering",
author = "Jan-Christoph Kalo and Leandra Fichtel and Philipp Ehler and Wolf-Tilo Balke",
year = "2020",
doi = "10.1007/978-3-030-62419-4_17",
language = "English",
isbn = "9783030624187",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "294--310",
editor = "Pan, {Jeff Z.} and Valentina Tamma and Claudia d{\textquoteright}Amato and Krzysztof Janowicz and Bo Fu and Axel Polleres and Oshani Seneviratne and Lalana Kagal",
booktitle = "The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings",

}

Download

TY - GEN

T1 - KnowlyBERT

T2 - Hybrid Query Answering over Language Models and Knowledge Graphs

AU - Kalo, Jan-Christoph

AU - Fichtel, Leandra

AU - Ehler, Philipp

AU - Balke, Wolf-Tilo

PY - 2020

Y1 - 2020

N2 - Providing a plethora of entity-centric information, Knowledge Graphs have become a vital building block for a variety of intelligent applications. Indeed, modern knowledge graphs like Wikidata already capture several billions of RDF triples, yet they still lack a good coverage for most relations. On the other hand, recent developments in NLP research show that neural language models can easily be queried for relational knowledge without requiring massive amounts of training data. In this work, we leverage this idea by creating a hybrid query answering system on top of knowledge graphs in combination with the masked language model BERT to complete query results. We thus incorporate valuable structural and semantic information from knowledge graphs with textual knowledge from language models to achieve high precision query results. Standard techniques for dealing with incomplete knowledge graphs are either (1) relation extraction which requires massive amounts of training data or (2) knowledge graph embeddings which have problems to succeed beyond simple baseline datasets. Our hybrid system KnowlyBERT requires only small amounts of training data, while outperforming state-of-the-art techniques by boosting their precision by over 30% in our large Wikidata experiment.

AB - Providing a plethora of entity-centric information, Knowledge Graphs have become a vital building block for a variety of intelligent applications. Indeed, modern knowledge graphs like Wikidata already capture several billions of RDF triples, yet they still lack a good coverage for most relations. On the other hand, recent developments in NLP research show that neural language models can easily be queried for relational knowledge without requiring massive amounts of training data. In this work, we leverage this idea by creating a hybrid query answering system on top of knowledge graphs in combination with the masked language model BERT to complete query results. We thus incorporate valuable structural and semantic information from knowledge graphs with textual knowledge from language models to achieve high precision query results. Standard techniques for dealing with incomplete knowledge graphs are either (1) relation extraction which requires massive amounts of training data or (2) knowledge graph embeddings which have problems to succeed beyond simple baseline datasets. Our hybrid system KnowlyBERT requires only small amounts of training data, while outperforming state-of-the-art techniques by boosting their precision by over 30% in our large Wikidata experiment.

KW - Knowledge graphs

KW - Language models

KW - Query answering

UR - http://www.scopus.com/inward/record.url?scp=85096523582&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-62419-4_17

DO - 10.1007/978-3-030-62419-4_17

M3 - Conference contribution

SN - 9783030624187

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 294

EP - 310

BT - The Semantic Web – ISWC 2020 - 19th International Semantic Web Conference, 2020, Proceedings

A2 - Pan, Jeff Z.

A2 - Tamma, Valentina

A2 - d’Amato, Claudia

A2 - Janowicz, Krzysztof

A2 - Fu, Bo

A2 - Polleres, Axel

A2 - Seneviratne, Oshani

A2 - Kagal, Lalana

ER -

By the same author(s)