Falcon 2.0: An Entity and Relation Linking Tool over Wikidata

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Ahmad Sakor
  • Kuldeep Singh
  • Anery Patel
  • Maria Esther Vidal

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)
  • Zerotha-Research and Cerence GmbH
View graph of relations

Details

Original languageEnglish
Title of host publicationCIKM 2020
Subtitle of host publicationProceedings of the 29th ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery (ACM)
Pages3141-3148
Number of pages8
ISBN (electronic)9781450368599
Publication statusPublished - 19 Oct 2020
Event29th ACM International Conference on Information and Knowledge Management, CIKM 2020 - online, Virtual, Online, Ireland
Duration: 19 Oct 202023 Oct 2020

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Abstract

The Natural Language Processing (NLP) community has significantly contributed to the solutions for entity and relation recognition from a natural language text, and possibly linking them to proper matches in Knowledge Graphs (KGs). Considering Wikidata as the background KG, there are still limited tools to link knowledge within the text to Wikidata. In this paper, we present Falcon 2.0, the first joint entity and relation linking tool over Wikidata. It receives a short natural language text in the English language and outputs a ranked list of entities and relations annotated with the proper candidates in Wikidata. The candidates are represented by their Internationalized Resource Identifier (IRI) in Wikidata. Falcon 2.0 resorts to the English language model for the recognition task (e.g., N-Gram tiling and N-Gram splitting), and then an optimization approach for the linking task. We have empirically studied the performance of Falcon 2.0 on Wikidata and concluded that it outperforms all the existing baselines. Falcon 2.0 is open source and can be reused by the community; all the required instructions of Falcon 2.0 are well-documented at our GitHub repository (https://github.com/SDM-TIB/falcon2.0). We also demonstrate an online API, which can be run without any technical expertise. Falcon 2.0 and its background knowledge bases are available as resources at https://labs.tib.eu/falcon/falcon2/.

Keywords

    background knowledge, dbpedia, english morphology, entity linking, nlp, relation linking, wikidata

ASJC Scopus subject areas

Cite this

Falcon 2.0: An Entity and Relation Linking Tool over Wikidata. / Sakor, Ahmad; Singh, Kuldeep; Patel, Anery et al.
CIKM 2020: Proceedings of the 29th ACM International Conference on Information and Knowledge Management. Association for Computing Machinery (ACM), 2020. p. 3141-3148 (International Conference on Information and Knowledge Management, Proceedings).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Sakor, A, Singh, K, Patel, A & Vidal, ME 2020, Falcon 2.0: An Entity and Relation Linking Tool over Wikidata. in CIKM 2020: Proceedings of the 29th ACM International Conference on Information and Knowledge Management. International Conference on Information and Knowledge Management, Proceedings, Association for Computing Machinery (ACM), pp. 3141-3148, 29th ACM International Conference on Information and Knowledge Management, CIKM 2020, Virtual, Online, Ireland, 19 Oct 2020. https://doi.org/10.1145/3340531.3412777
Sakor, A., Singh, K., Patel, A., & Vidal, M. E. (2020). Falcon 2.0: An Entity and Relation Linking Tool over Wikidata. In CIKM 2020: Proceedings of the 29th ACM International Conference on Information and Knowledge Management (pp. 3141-3148). (International Conference on Information and Knowledge Management, Proceedings). Association for Computing Machinery (ACM). https://doi.org/10.1145/3340531.3412777
Sakor A, Singh K, Patel A, Vidal ME. Falcon 2.0: An Entity and Relation Linking Tool over Wikidata. In CIKM 2020: Proceedings of the 29th ACM International Conference on Information and Knowledge Management. Association for Computing Machinery (ACM). 2020. p. 3141-3148. (International Conference on Information and Knowledge Management, Proceedings). doi: 10.1145/3340531.3412777
Sakor, Ahmad ; Singh, Kuldeep ; Patel, Anery et al. / Falcon 2.0 : An Entity and Relation Linking Tool over Wikidata. CIKM 2020: Proceedings of the 29th ACM International Conference on Information and Knowledge Management. Association for Computing Machinery (ACM), 2020. pp. 3141-3148 (International Conference on Information and Knowledge Management, Proceedings).
Download
@inproceedings{166eef9f27604467abf54a3fdda1d127,
title = "Falcon 2.0: An Entity and Relation Linking Tool over Wikidata",
abstract = "The Natural Language Processing (NLP) community has significantly contributed to the solutions for entity and relation recognition from a natural language text, and possibly linking them to proper matches in Knowledge Graphs (KGs). Considering Wikidata as the background KG, there are still limited tools to link knowledge within the text to Wikidata. In this paper, we present Falcon 2.0, the first joint entity and relation linking tool over Wikidata. It receives a short natural language text in the English language and outputs a ranked list of entities and relations annotated with the proper candidates in Wikidata. The candidates are represented by their Internationalized Resource Identifier (IRI) in Wikidata. Falcon 2.0 resorts to the English language model for the recognition task (e.g., N-Gram tiling and N-Gram splitting), and then an optimization approach for the linking task. We have empirically studied the performance of Falcon 2.0 on Wikidata and concluded that it outperforms all the existing baselines. Falcon 2.0 is open source and can be reused by the community; all the required instructions of Falcon 2.0 are well-documented at our GitHub repository (https://github.com/SDM-TIB/falcon2.0). We also demonstrate an online API, which can be run without any technical expertise. Falcon 2.0 and its background knowledge bases are available as resources at https://labs.tib.eu/falcon/falcon2/.",
keywords = "background knowledge, dbpedia, english morphology, entity linking, nlp, relation linking, wikidata",
author = "Ahmad Sakor and Kuldeep Singh and Anery Patel and Vidal, {Maria Esther}",
note = "Funding Information: This work has received funding from the EU H2020 Project No. 727658 (IASIS).; 29th ACM International Conference on Information and Knowledge Management, CIKM 2020 ; Conference date: 19-10-2020 Through 23-10-2020",
year = "2020",
month = oct,
day = "19",
doi = "10.1145/3340531.3412777",
language = "English",
series = "International Conference on Information and Knowledge Management, Proceedings",
publisher = "Association for Computing Machinery (ACM)",
pages = "3141--3148",
booktitle = "CIKM 2020",
address = "United States",

}

Download

TY - GEN

T1 - Falcon 2.0

T2 - 29th ACM International Conference on Information and Knowledge Management, CIKM 2020

AU - Sakor, Ahmad

AU - Singh, Kuldeep

AU - Patel, Anery

AU - Vidal, Maria Esther

N1 - Funding Information: This work has received funding from the EU H2020 Project No. 727658 (IASIS).

PY - 2020/10/19

Y1 - 2020/10/19

N2 - The Natural Language Processing (NLP) community has significantly contributed to the solutions for entity and relation recognition from a natural language text, and possibly linking them to proper matches in Knowledge Graphs (KGs). Considering Wikidata as the background KG, there are still limited tools to link knowledge within the text to Wikidata. In this paper, we present Falcon 2.0, the first joint entity and relation linking tool over Wikidata. It receives a short natural language text in the English language and outputs a ranked list of entities and relations annotated with the proper candidates in Wikidata. The candidates are represented by their Internationalized Resource Identifier (IRI) in Wikidata. Falcon 2.0 resorts to the English language model for the recognition task (e.g., N-Gram tiling and N-Gram splitting), and then an optimization approach for the linking task. We have empirically studied the performance of Falcon 2.0 on Wikidata and concluded that it outperforms all the existing baselines. Falcon 2.0 is open source and can be reused by the community; all the required instructions of Falcon 2.0 are well-documented at our GitHub repository (https://github.com/SDM-TIB/falcon2.0). We also demonstrate an online API, which can be run without any technical expertise. Falcon 2.0 and its background knowledge bases are available as resources at https://labs.tib.eu/falcon/falcon2/.

AB - The Natural Language Processing (NLP) community has significantly contributed to the solutions for entity and relation recognition from a natural language text, and possibly linking them to proper matches in Knowledge Graphs (KGs). Considering Wikidata as the background KG, there are still limited tools to link knowledge within the text to Wikidata. In this paper, we present Falcon 2.0, the first joint entity and relation linking tool over Wikidata. It receives a short natural language text in the English language and outputs a ranked list of entities and relations annotated with the proper candidates in Wikidata. The candidates are represented by their Internationalized Resource Identifier (IRI) in Wikidata. Falcon 2.0 resorts to the English language model for the recognition task (e.g., N-Gram tiling and N-Gram splitting), and then an optimization approach for the linking task. We have empirically studied the performance of Falcon 2.0 on Wikidata and concluded that it outperforms all the existing baselines. Falcon 2.0 is open source and can be reused by the community; all the required instructions of Falcon 2.0 are well-documented at our GitHub repository (https://github.com/SDM-TIB/falcon2.0). We also demonstrate an online API, which can be run without any technical expertise. Falcon 2.0 and its background knowledge bases are available as resources at https://labs.tib.eu/falcon/falcon2/.

KW - background knowledge

KW - dbpedia

KW - english morphology

KW - entity linking

KW - nlp

KW - relation linking

KW - wikidata

UR - http://www.scopus.com/inward/record.url?scp=85095866274&partnerID=8YFLogxK

U2 - 10.1145/3340531.3412777

DO - 10.1145/3340531.3412777

M3 - Conference contribution

AN - SCOPUS:85095866274

T3 - International Conference on Information and Knowledge Management, Proceedings

SP - 3141

EP - 3148

BT - CIKM 2020

PB - Association for Computing Machinery (ACM)

Y2 - 19 October 2020 through 23 October 2020

ER -