Probabilistic entity linkage for heterogeneous information spaces

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationAdvanced Information Systems Engineering
Subtitle of host publication20th International Conference, CAiSE 2008, Proceedings
Pages556-570
Number of pages15
ISBN (electronic)978-3-540-69534-9
Publication statusPublished - 2008
Event20th International Conference on Advanced Information Systems Engineering, CAiSE 2008 - Montpellier, France
Duration: 16 Jun 200820 Jun 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5074 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Heterogeneous information spaces are typically created by merging data from a variety of different applications and information sources. These sources often use different identifiers for data that describe the same real-word entity (for example an artist, a conference, an organization). In this paper we propose a new probabilistic Entity Linkage algorithm for identifying and linking data that refer to the same real-world entity. Our approach focuses on managing entity linkage information in heterogeneous information spaces using probabilistic methods. We use a Bayesian network to model evidences which support the possible object matches along with the interdependencies between them. This enables us to flexibly update the network when new information becomes available, and to cope with the different requirements imposed by applications build on top of information spaces.

Keywords

    Data integration, Entity linkage, Metadata management

ASJC Scopus subject areas

Cite this

Probabilistic entity linkage for heterogeneous information spaces. / Ioannou, Ekaterini; Niederée, Claudia; Nejdl, Wolfgang.
Advanced Information Systems Engineering : 20th International Conference, CAiSE 2008, Proceedings. 2008. p. 556-570 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5074 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Ioannou, E, Niederée, C & Nejdl, W 2008, Probabilistic entity linkage for heterogeneous information spaces. in Advanced Information Systems Engineering : 20th International Conference, CAiSE 2008, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5074 LNCS, pp. 556-570, 20th International Conference on Advanced Information Systems Engineering, CAiSE 2008, Montpellier, France, 16 Jun 2008. https://doi.org/10.1007/978-3-540-69534-9_41
Ioannou, E., Niederée, C., & Nejdl, W. (2008). Probabilistic entity linkage for heterogeneous information spaces. In Advanced Information Systems Engineering : 20th International Conference, CAiSE 2008, Proceedings (pp. 556-570). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5074 LNCS). https://doi.org/10.1007/978-3-540-69534-9_41
Ioannou E, Niederée C, Nejdl W. Probabilistic entity linkage for heterogeneous information spaces. In Advanced Information Systems Engineering : 20th International Conference, CAiSE 2008, Proceedings. 2008. p. 556-570. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-540-69534-9_41
Ioannou, Ekaterini ; Niederée, Claudia ; Nejdl, Wolfgang. / Probabilistic entity linkage for heterogeneous information spaces. Advanced Information Systems Engineering : 20th International Conference, CAiSE 2008, Proceedings. 2008. pp. 556-570 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{a8e27b1d239c4323945fe7604f74e572,
title = "Probabilistic entity linkage for heterogeneous information spaces",
abstract = "Heterogeneous information spaces are typically created by merging data from a variety of different applications and information sources. These sources often use different identifiers for data that describe the same real-word entity (for example an artist, a conference, an organization). In this paper we propose a new probabilistic Entity Linkage algorithm for identifying and linking data that refer to the same real-world entity. Our approach focuses on managing entity linkage information in heterogeneous information spaces using probabilistic methods. We use a Bayesian network to model evidences which support the possible object matches along with the interdependencies between them. This enables us to flexibly update the network when new information becomes available, and to cope with the different requirements imposed by applications build on top of information spaces.",
keywords = "Data integration, Entity linkage, Metadata management",
author = "Ekaterini Ioannou and Claudia Nieder{\'e}e and Wolfgang Nejdl",
year = "2008",
doi = "10.1007/978-3-540-69534-9_41",
language = "English",
isbn = "978-3-540-69533-2",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "556--570",
booktitle = "Advanced Information Systems Engineering",
note = "20th International Conference on Advanced Information Systems Engineering, CAiSE 2008 ; Conference date: 16-06-2008 Through 20-06-2008",

}

Download

TY - GEN

T1 - Probabilistic entity linkage for heterogeneous information spaces

AU - Ioannou, Ekaterini

AU - Niederée, Claudia

AU - Nejdl, Wolfgang

PY - 2008

Y1 - 2008

N2 - Heterogeneous information spaces are typically created by merging data from a variety of different applications and information sources. These sources often use different identifiers for data that describe the same real-word entity (for example an artist, a conference, an organization). In this paper we propose a new probabilistic Entity Linkage algorithm for identifying and linking data that refer to the same real-world entity. Our approach focuses on managing entity linkage information in heterogeneous information spaces using probabilistic methods. We use a Bayesian network to model evidences which support the possible object matches along with the interdependencies between them. This enables us to flexibly update the network when new information becomes available, and to cope with the different requirements imposed by applications build on top of information spaces.

AB - Heterogeneous information spaces are typically created by merging data from a variety of different applications and information sources. These sources often use different identifiers for data that describe the same real-word entity (for example an artist, a conference, an organization). In this paper we propose a new probabilistic Entity Linkage algorithm for identifying and linking data that refer to the same real-world entity. Our approach focuses on managing entity linkage information in heterogeneous information spaces using probabilistic methods. We use a Bayesian network to model evidences which support the possible object matches along with the interdependencies between them. This enables us to flexibly update the network when new information becomes available, and to cope with the different requirements imposed by applications build on top of information spaces.

KW - Data integration

KW - Entity linkage

KW - Metadata management

UR - http://www.scopus.com/inward/record.url?scp=45849137843&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-69534-9_41

DO - 10.1007/978-3-540-69534-9_41

M3 - Conference contribution

AN - SCOPUS:45849137843

SN - 978-3-540-69533-2

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 556

EP - 570

BT - Advanced Information Systems Engineering

T2 - 20th International Conference on Advanced Information Systems Engineering, CAiSE 2008

Y2 - 16 June 2008 through 20 June 2008

ER -

By the same author(s)