How to trace and revise identities

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationThe Semantic Web
Subtitle of host publicationResearch and Applications - 6th European Semantic Web Conference, ESWC 2009, Proceedings
Pages414-428
Number of pages15
ISBN (electronic)978-3-642-02121-3
Publication statusPublished - 2009
Event6th European Semantic Web Conference, ESWC 2009 - Heraklion, Crete, Greece
Duration: 31 May 20094 Jun 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5554 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

The Entity Name System (ENS) is a service aiming at providing globally unique URIs for all kinds of real-world entities such as persons, locations and products, based on descriptions of such entities. Because entity descriptions available to the ENS for deciding on entity identity-Do two entity descriptions refer to the same real-world entity?-are changing over time, the system has to revise its past decisions: One entity has been given two different URIs or two entities have been attributed the same URI. The question we have to investigate in this context is then: How do we propagate entity decision revisions to the clients which make use of the URIs provided by the ENS? In this paper we propose a solution which relies on labelling the IDs with additional history information. These labels allow clients to locally detect deprecated URIs they are using and also merge IDs referring to the same real-world entity without needing to consult the ENS. Making update requests to the ENS only for the IDs detected as deprecated considerably reduces the number of update requests, at the cost of a decrease in uniqueness quality. We investigate how much the number of update requests decreases using ID history labelling, as well as how this impacts the uniqueness of the IDs on the client. For the experiments we use both artificially generated entity revision histories as well as a real case study based on the revision history of the Dutch and Simple English Wikipedia.

ASJC Scopus subject areas

Cite this

How to trace and revise identities. / Gaugaz, Julien; Zakrzewski, Jakub; Demartini, Gianluca et al.
The Semantic Web: Research and Applications - 6th European Semantic Web Conference, ESWC 2009, Proceedings. 2009. p. 414-428 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5554 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Gaugaz, J, Zakrzewski, J, Demartini, G & Nejdl, W 2009, How to trace and revise identities. in The Semantic Web: Research and Applications - 6th European Semantic Web Conference, ESWC 2009, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5554 LNCS, pp. 414-428, 6th European Semantic Web Conference, ESWC 2009, Heraklion, Crete, Greece, 31 May 2009. https://doi.org/10.1007/978-3-642-02121-3_32
Gaugaz, J., Zakrzewski, J., Demartini, G., & Nejdl, W. (2009). How to trace and revise identities. In The Semantic Web: Research and Applications - 6th European Semantic Web Conference, ESWC 2009, Proceedings (pp. 414-428). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5554 LNCS). https://doi.org/10.1007/978-3-642-02121-3_32
Gaugaz J, Zakrzewski J, Demartini G, Nejdl W. How to trace and revise identities. In The Semantic Web: Research and Applications - 6th European Semantic Web Conference, ESWC 2009, Proceedings. 2009. p. 414-428. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-02121-3_32
Gaugaz, Julien ; Zakrzewski, Jakub ; Demartini, Gianluca et al. / How to trace and revise identities. The Semantic Web: Research and Applications - 6th European Semantic Web Conference, ESWC 2009, Proceedings. 2009. pp. 414-428 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{a16f9490b0ae4010881038a72a4c6200,
title = "How to trace and revise identities",
abstract = "The Entity Name System (ENS) is a service aiming at providing globally unique URIs for all kinds of real-world entities such as persons, locations and products, based on descriptions of such entities. Because entity descriptions available to the ENS for deciding on entity identity-Do two entity descriptions refer to the same real-world entity?-are changing over time, the system has to revise its past decisions: One entity has been given two different URIs or two entities have been attributed the same URI. The question we have to investigate in this context is then: How do we propagate entity decision revisions to the clients which make use of the URIs provided by the ENS? In this paper we propose a solution which relies on labelling the IDs with additional history information. These labels allow clients to locally detect deprecated URIs they are using and also merge IDs referring to the same real-world entity without needing to consult the ENS. Making update requests to the ENS only for the IDs detected as deprecated considerably reduces the number of update requests, at the cost of a decrease in uniqueness quality. We investigate how much the number of update requests decreases using ID history labelling, as well as how this impacts the uniqueness of the IDs on the client. For the experiments we use both artificially generated entity revision histories as well as a real case study based on the revision history of the Dutch and Simple English Wikipedia.",
author = "Julien Gaugaz and Jakub Zakrzewski and Gianluca Demartini and Wolfgang Nejdl",
note = "Funding Information: This work is partially supported by the FP7 EU Large-Scale Integrating Project OKKAM Enabling a Web of Entities (contract no. ICT-215032).; 6th European Semantic Web Conference, ESWC 2009 ; Conference date: 31-05-2009 Through 04-06-2009",
year = "2009",
doi = "10.1007/978-3-642-02121-3_32",
language = "English",
isbn = "978-3-642-02120-6",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "414--428",
booktitle = "The Semantic Web",

}

Download

TY - GEN

T1 - How to trace and revise identities

AU - Gaugaz, Julien

AU - Zakrzewski, Jakub

AU - Demartini, Gianluca

AU - Nejdl, Wolfgang

N1 - Funding Information: This work is partially supported by the FP7 EU Large-Scale Integrating Project OKKAM Enabling a Web of Entities (contract no. ICT-215032).

PY - 2009

Y1 - 2009

N2 - The Entity Name System (ENS) is a service aiming at providing globally unique URIs for all kinds of real-world entities such as persons, locations and products, based on descriptions of such entities. Because entity descriptions available to the ENS for deciding on entity identity-Do two entity descriptions refer to the same real-world entity?-are changing over time, the system has to revise its past decisions: One entity has been given two different URIs or two entities have been attributed the same URI. The question we have to investigate in this context is then: How do we propagate entity decision revisions to the clients which make use of the URIs provided by the ENS? In this paper we propose a solution which relies on labelling the IDs with additional history information. These labels allow clients to locally detect deprecated URIs they are using and also merge IDs referring to the same real-world entity without needing to consult the ENS. Making update requests to the ENS only for the IDs detected as deprecated considerably reduces the number of update requests, at the cost of a decrease in uniqueness quality. We investigate how much the number of update requests decreases using ID history labelling, as well as how this impacts the uniqueness of the IDs on the client. For the experiments we use both artificially generated entity revision histories as well as a real case study based on the revision history of the Dutch and Simple English Wikipedia.

AB - The Entity Name System (ENS) is a service aiming at providing globally unique URIs for all kinds of real-world entities such as persons, locations and products, based on descriptions of such entities. Because entity descriptions available to the ENS for deciding on entity identity-Do two entity descriptions refer to the same real-world entity?-are changing over time, the system has to revise its past decisions: One entity has been given two different URIs or two entities have been attributed the same URI. The question we have to investigate in this context is then: How do we propagate entity decision revisions to the clients which make use of the URIs provided by the ENS? In this paper we propose a solution which relies on labelling the IDs with additional history information. These labels allow clients to locally detect deprecated URIs they are using and also merge IDs referring to the same real-world entity without needing to consult the ENS. Making update requests to the ENS only for the IDs detected as deprecated considerably reduces the number of update requests, at the cost of a decrease in uniqueness quality. We investigate how much the number of update requests decreases using ID history labelling, as well as how this impacts the uniqueness of the IDs on the client. For the experiments we use both artificially generated entity revision histories as well as a real case study based on the revision history of the Dutch and Simple English Wikipedia.

UR - http://www.scopus.com/inward/record.url?scp=69949096915&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-02121-3_32

DO - 10.1007/978-3-642-02121-3_32

M3 - Conference contribution

AN - SCOPUS:69949096915

SN - 978-3-642-02120-6

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 414

EP - 428

BT - The Semantic Web

T2 - 6th European Semantic Web Conference, ESWC 2009

Y2 - 31 May 2009 through 4 June 2009

ER -

By the same author(s)