GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Ignacio Traverso-Ribón
  • Maria Esther Vidal

Research Organisations

External Research Organisations

  • Universidad de Cadiz
  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationDatabase and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings
EditorsHui Ma, Roland R. Wagner, Sven Hartmann, Gunther Pernul, Abdelkader Hameurlain
PublisherSpringer Verlag
Pages169-183
Number of pages15
ISBN (print)9783319988085
Publication statusE-pub ahead of print - 9 Aug 2018
Event29th International Conference on Database and Expert Systems Applications, DEXA 2018 - Regensburg, Germany
Duration: 3 Sept 20186 Sept 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11029 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.

ASJC Scopus subject areas

Cite this

GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. / Traverso-Ribón, Ignacio; Vidal, Maria Esther.
Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. ed. / Hui Ma; Roland R. Wagner; Sven Hartmann; Gunther Pernul; Abdelkader Hameurlain. Springer Verlag, 2018. p. 169-183 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11029 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Traverso-Ribón, I & Vidal, ME 2018, GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. in H Ma, RR Wagner, S Hartmann, G Pernul & A Hameurlain (eds), Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11029 LNCS, Springer Verlag, pp. 169-183, 29th International Conference on Database and Expert Systems Applications, DEXA 2018, Regensburg, Germany, 3 Sept 2018. https://doi.org/10.1007/978-3-319-98809-2_11
Traverso-Ribón, I., & Vidal, M. E. (2018). GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. In H. Ma, R. R. Wagner, S. Hartmann, G. Pernul, & A. Hameurlain (Eds.), Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings (pp. 169-183). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11029 LNCS). Springer Verlag. Advance online publication. https://doi.org/10.1007/978-3-319-98809-2_11
Traverso-Ribón I, Vidal ME. GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. In Ma H, Wagner RR, Hartmann S, Pernul G, Hameurlain A, editors, Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. Springer Verlag. 2018. p. 169-183. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2018 Aug 9. doi: 10.1007/978-3-319-98809-2_11
Traverso-Ribón, Ignacio ; Vidal, Maria Esther. / GARUM : A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. editor / Hui Ma ; Roland R. Wagner ; Sven Hartmann ; Gunther Pernul ; Abdelkader Hameurlain. Springer Verlag, 2018. pp. 169-183 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{6c76890edade44cb90a1f714c0ad7b38,
title = "GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics",
abstract = "Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.",
author = "Ignacio Traverso-Rib{\'o}n and Vidal, {Maria Esther}",
year = "2018",
month = aug,
day = "9",
doi = "10.1007/978-3-319-98809-2_11",
language = "English",
isbn = "9783319988085",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "169--183",
editor = "Hui Ma and Wagner, {Roland R.} and Sven Hartmann and Gunther Pernul and Abdelkader Hameurlain",
booktitle = "Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings",
address = "Germany",
note = "29th International Conference on Database and Expert Systems Applications, DEXA 2018 ; Conference date: 03-09-2018 Through 06-09-2018",

}

Download

TY - GEN

T1 - GARUM

T2 - 29th International Conference on Database and Expert Systems Applications, DEXA 2018

AU - Traverso-Ribón, Ignacio

AU - Vidal, Maria Esther

PY - 2018/8/9

Y1 - 2018/8/9

N2 - Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.

AB - Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.

UR - http://www.scopus.com/inward/record.url?scp=85052092006&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-98809-2_11

DO - 10.1007/978-3-319-98809-2_11

M3 - Conference contribution

AN - SCOPUS:85052092006

SN - 9783319988085

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 169

EP - 183

BT - Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings

A2 - Ma, Hui

A2 - Wagner, Roland R.

A2 - Hartmann, Sven

A2 - Pernul, Gunther

A2 - Hameurlain, Abdelkader

PB - Springer Verlag

Y2 - 3 September 2018 through 6 September 2018

ER -