GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics

Ignacio Traverso-Ribón; Maria Esther Vidal

doi:10.1007/978-3-319-98809-2_11

Details

Originalsprache	Englisch
Titel des Sammelwerks	Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings
Herausgeber/-innen	Hui Ma, Roland R. Wagner, Sven Hartmann, Gunther Pernul, Abdelkader Hameurlain
Herausgeber (Verlag)	Springer Verlag
Seiten	169-183
Seitenumfang	15
ISBN (Print)	9783319988085
Publikationsstatus	Elektronisch veröffentlicht (E-Pub) - 9 Aug. 2018
Veranstaltung	29th International Conference on Database and Expert Systems Applications, DEXA 2018 - Regensburg, Deutschland Dauer: 3 Sept. 2018 → 6 Sept. 2018

Publikationsreihe

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band	11029 LNCS
ISSN (Print)	0302-9743
ISSN (elektronisch)	1611-3349

Abstract

Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.

ASJC Scopus Sachgebiete

Mathematik (insg.)
Theoretische Informatik
Informatik (insg.)
Allgemeine Computerwissenschaft

Zitieren

GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. / Traverso-Ribón, Ignacio; Vidal, Maria Esther.
Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. Hrsg. / Hui Ma; Roland R. Wagner; Sven Hartmann; Gunther Pernul; Abdelkader Hameurlain. Springer Verlag, 2018. S. 169-183 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 11029 LNCS).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Traverso-Ribón, I & Vidal, ME 2018, GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. in H Ma, RR Wagner, S Hartmann, G Pernul & A Hameurlain (Hrsg.), Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 11029 LNCS, Springer Verlag, S. 169-183, 29th International Conference on Database and Expert Systems Applications, DEXA 2018, Regensburg, Deutschland, 3 Sept. 2018. https://doi.org/10.1007/978-3-319-98809-2_11

Traverso-Ribón, I., & Vidal, M. E. (2018). GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. In H. Ma, R. R. Wagner, S. Hartmann, G. Pernul, & A. Hameurlain (Hrsg.), Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings (S. 169-183). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 11029 LNCS). Springer Verlag. Vorabveröffentlichung online. https://doi.org/10.1007/978-3-319-98809-2_11

Traverso-Ribón I, Vidal ME. GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. in Ma H, Wagner RR, Hartmann S, Pernul G, Hameurlain A, Hrsg., Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. Springer Verlag. 2018. S. 169-183. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2018 Aug 9. doi: 10.1007/978-3-319-98809-2_11

Traverso-Ribón, Ignacio ; Vidal, Maria Esther. / GARUM : A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics. Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings. Hrsg. / Hui Ma ; Roland R. Wagner ; Sven Hartmann ; Gunther Pernul ; Abdelkader Hameurlain. Springer Verlag, 2018. S. 169-183 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Download

@inproceedings{6c76890edade44cb90a1f714c0ad7b38,

title = "GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics",

abstract = "Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.",

author = "Ignacio Traverso-Rib{\'o}n and Vidal, {Maria Esther}",

year = "2018",

month = aug,

day = "9",

doi = "10.1007/978-3-319-98809-2_11",

language = "English",

isbn = "9783319988085",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "169--183",

editor = "Hui Ma and Wagner, {Roland R.} and Sven Hartmann and Gunther Pernul and Abdelkader Hameurlain",

booktitle = "Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings",

address = "Germany",

note = "29th International Conference on Database and Expert Systems Applications, DEXA 2018 ; Conference date: 03-09-2018 Through 06-09-2018",

}

Download

TY - GEN

T1 - GARUM

T2 - 29th International Conference on Database and Expert Systems Applications, DEXA 2018

AU - Traverso-Ribón, Ignacio

AU - Vidal, Maria Esther

PY - 2018/8/9

Y1 - 2018/8/9

N2 - Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.

AB - Knowledge graphs encode semantics that describes entities in terms of several characteristics, e.g., attributes, neighbors, class hierarchies, or association degrees. Several data-driven tasks, e.g., ranking, clustering, or link discovery, require for determining the relatedness between knowledge graph entities. However, state-of-the-art similarity measures may not consider all the characteristics of an entity to determine entity relatedness. We address the problem of similarity assessment between knowledge graph entities and devise GARUM, a semantic similarity measure for knowledge graphs. GARUM relies on similarities of entity characteristics and computes similarity values considering simultaneously several entity characteristics. This combination can be manually or automatically defined with the help of a machine learning approach. We empirically evaluate the accuracy of GARUM on knowledge graphs from different domains, e.g., networks of proteins and media news. In the experimental study, GARUM exhibits higher correlation with gold standards than studied existing approaches. Thus, these results suggest that similarity measures should not consider entity characteristics in isolation; contrary, combinations of these characteristics are required to precisely determine relatedness among entities in a knowledge graph. Further, the combination functions found by a machine learning approach outperform the results obtained by the manually defined aggregation functions.

UR - http://www.scopus.com/inward/record.url?scp=85052092006&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-98809-2_11

DO - 10.1007/978-3-319-98809-2_11

M3 - Conference contribution

AN - SCOPUS:85052092006

SN - 9783319988085

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 169

EP - 183

BT - Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Proceedings

A2 - Ma, Hui

A2 - Wagner, Roland R.

A2 - Hartmann, Sven

A2 - Pernul, Gunther

A2 - Hameurlain, Abdelkader

PB - Springer Verlag

Y2 - 3 September 2018 through 6 September 2018

ER -

Research@Leibniz University

GARUM: A Semantic Similarity Measure Based on Machine Learning and Entity Characteristics

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren