Details
Original language | English |
---|---|
Title of host publication | 2009 Latin American Web Congress |
Subtitle of host publication | Joint LA-WEB/CLIHC Conference |
Pages | 230-237 |
Number of pages | 8 |
Publication status | Published - 1 Dec 2009 |
Event | 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference - Merida, Yucatan, Mexico Duration: 9 Nov 2009 → 11 Nov 2009 |
Publication series
Name | 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference |
---|
Abstract
Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.
Keywords
- Entity retrieval, Natural language processing, Web search
ASJC Scopus subject areas
- Computer Science(all)
- Computer Networks and Communications
- Computer Science(all)
- Software
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
2009 Latin American Web Congress: Joint LA-WEB/CLIHC Conference. 2009. p. 230-237 5341521 (2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - An architecture for finding entities on the web
AU - Demartini, Gianluca
AU - Firan, Claudiu S.
AU - Georgescu, Mihai
AU - Iofciu, Tereza
AU - Krestel, Ralf
AU - Nejdl, Wolfgang
PY - 2009/12/1
Y1 - 2009/12/1
N2 - Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.
AB - Recent progress in research fields such as Information Extraction and Information Retrieval enables the creation of systems providing better search experiences to web users. For example, systems that retrieve entities instead of just documents have been built. In this paper we present an approach for large-scale Entity Retrieval using web collections as underlying corpus. We propose an architecture for entity extraction and entity ranking starting from web documents. This is obtained (1) using an existing web document index and (2) creating an entity centric index. We describe advantages and feasibility of our approach using state-of-the-art tools.
KW - Entity retrieval
KW - Natural language processing
KW - Web search
UR - http://www.scopus.com/inward/record.url?scp=72449182171&partnerID=8YFLogxK
U2 - 10.1109/LA-WEB.2009.14
DO - 10.1109/LA-WEB.2009.14
M3 - Conference contribution
AN - SCOPUS:72449182171
SN - 9780769538563
T3 - 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference
SP - 230
EP - 237
BT - 2009 Latin American Web Congress
T2 - 2009 Latin American Web Congress - Joint LA-WEB/CLIHC Conference
Y2 - 9 November 2009 through 11 November 2009
ER -