An architecture for finding entities on the web

Gianluca Demartini; Claudiu S. Firan; Mihai Georgescu; Tereza Iofciu; Ralf Krestel; Wolfgang Nejdl

doi:10.1109/LA-WEB.2009.14

Details

Originalsprache	Englisch
Titel des Sammelwerks	2009 Latin American Web Congress
Untertitel	Joint LA-WEB/CLIHC Conference
Seiten	230-237
Seitenumfang	8
Publikationsstatus	Veröffentlicht - 1 Dez. 2009
Veranstaltung	2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference - Merida, Yucatan, Mexiko Dauer: 9 Nov. 2009 → 11 Nov. 2009

Publikationsreihe

Name	2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference

Abstract

Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.

ASJC Scopus Sachgebiete

Informatik (insg.)
Computernetzwerke und -kommunikation
Informatik (insg.)
Software

Zitieren

An architecture for finding entities on the web. / Demartini, Gianluca; Firan, Claudiu S.; Georgescu, Mihai et al.
2009 Latin American Web Congress: Joint LA-WEB/CLIHC Conference. 2009. S. 230-237 5341521 (2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Demartini, G, Firan, CS, Georgescu, M, Iofciu, T, Krestel, R & Nejdl, W 2009, An architecture for finding entities on the web. in 2009 Latin American Web Congress: Joint LA-WEB/CLIHC Conference., 5341521, 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference, S. 230-237, 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference, Merida, Yucatan, Mexiko, 9 Nov. 2009. https://doi.org/10.1109/LA-WEB.2009.14

Demartini, G., Firan, C. S., Georgescu, M., Iofciu, T., Krestel, R., & Nejdl, W. (2009). An architecture for finding entities on the web. In 2009 Latin American Web Congress: Joint LA-WEB/CLIHC Conference (S. 230-237). Artikel 5341521 (2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference). https://doi.org/10.1109/LA-WEB.2009.14

Demartini G, Firan CS, Georgescu M, Iofciu T, Krestel R, Nejdl W. An architecture for finding entities on the web. in 2009 Latin American Web Congress: Joint LA-WEB/CLIHC Conference. 2009. S. 230-237. 5341521. (2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference). doi: 10.1109/LA-WEB.2009.14

Demartini, Gianluca ; Firan, Claudiu S. ; Georgescu, Mihai et al. / An architecture for finding entities on the web. 2009 Latin American Web Congress: Joint LA-WEB/CLIHC Conference. 2009. S. 230-237 (2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference).

Download

@inproceedings{828d3be6e9454552b97b0447294410a3,

title = "An architecture for finding entities on the web",

abstract = "Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.",

keywords = "Entity retrieval, Natural language processing, Web search",

author = "Gianluca Demartini and Firan, {Claudiu S.} and Mihai Georgescu and Tereza Iofciu and Ralf Krestel and Wolfgang Nejdl",

year = "2009",

month = dec,

day = "1",

doi = "10.1109/LA-WEB.2009.14",

language = "English",

isbn = "9780769538563",

series = "2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference",

pages = "230--237",

booktitle = "2009 Latin American Web Congress",

note = "2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference ; Conference date: 09-11-2009 Through 11-11-2009",

}

Download

TY - GEN

T1 - An architecture for finding entities on the web

AU - Demartini, Gianluca

AU - Firan, Claudiu S.

AU - Georgescu, Mihai

AU - Iofciu, Tereza

AU - Krestel, Ralf

AU - Nejdl, Wolfgang

PY - 2009/12/1

Y1 - 2009/12/1

N2 - Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.

AB - Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.

KW - Entity retrieval

KW - Natural language processing

KW - Web search

UR - http://www.scopus.com/inward/record.url?scp=72449182171&partnerID=8YFLogxK

U2 - 10.1109/LA-WEB.2009.14

DO - 10.1109/LA-WEB.2009.14

M3 - Conference contribution

AN - SCOPUS:72449182171

SN - 9780769538563

T3 - 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference

SP - 230

EP - 237

BT - 2009 Latin American Web Congress

T2 - 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference

Y2 - 9 November 2009 through 11 November 2009

ER -

Research@Leibniz University

An architecture for finding entities on the web

Autoren

Organisationseinheiten

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Robust Fusion of Time Series and Image Data for Improved Multimodal Clinical Prediction

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

Open benchmark for filtering techniques in entity resolution

Beyond Accuracy: Investigating Error Types in GPT-4 Responses to USMLE Questions

An artificial intelligence-assisted clinical framework to facilitate diagnostics and translational discovery in hematologic neoplasia