Details
Original language | English |
---|---|
Title of host publication | CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management |
Publisher | Association for Computing Machinery (ACM) |
Pages | 337-346 |
Number of pages | 10 |
ISBN (electronic) | 9781450340731 |
Publication status | Published - Oct 2016 |
Event | 25th ACM International Conference on Information and Knowledge Management, CIKM 2016 - Indianapolis, United States Duration: 24 Oct 2016 → 28 Oct 2016 |
Publication series
Name | International Conference on Information and Knowledge Management, Proceedings |
---|---|
Volume | 24-28-October-2016 |
Abstract
An important editing policy in Wikipedia is to provide citations for added statements in Wikipedia pages, where statements can be arbitrary pieces of text, ranging from a sentence to a paragraph. In many cases citations are either outdated or missing altogether. In this work we address the problem of finding and updating news citations for statements in entity pages. We propose a two-stage supervised approach for this problem. In the first step, we construct a classifier to find out whether statements need a news citation or other kinds of citations (web, book, journal, etc.). In the second step, we develop a news citation algorithm for Wikipedia statements, which recommends appropriate citations from a given news collection. Apart from IR techniques that use the statement to query the news collection, we also formalize three properties of an appropriate citation, namely: (i) the citation should entail the Wikipedia statement, (ii) the statement should be central to the citation, and (iii) the citation should be from an authoritative source. We perform an extensive evaluation of both steps, using 20 million articles from a real-world news collection. Our results are quite promising, and show that we can perform this task with high precision and at scale.
ASJC Scopus subject areas
- Business, Management and Accounting(all)
- General Business,Management and Accounting
- Decision Sciences(all)
- General Decision Sciences
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management. Association for Computing Machinery (ACM), 2016. p. 337-346 (International Conference on Information and Knowledge Management, Proceedings; Vol. 24-28-October-2016).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - Finding News Citations for Wikipedia
AU - Fetahu, Besnik
AU - Markert, Katja
AU - Nejdl, Wolfgang
AU - Anand, Avishek
N1 - Funding information: This work is funded by the ERC Advanced Grant ALEXANDRIA (grant no. 339233).
PY - 2016/10
Y1 - 2016/10
N2 - An important editing policy in Wikipedia is to provide citations for added statements in Wikipedia pages, where statements can be arbitrary pieces of text, ranging from a sentence to a paragraph. In many cases citations are either outdated or missing altogether. In this work we address the problem of finding and updating news citations for statements in entity pages. We propose a two-stage supervised approach for this problem. In the first step, we construct a classifier to find out whether statements need a news citation or other kinds of citations (web, book, journal, etc.). In the second step, we develop a news citation algorithm for Wikipedia statements, which recommends appropriate citations from a given news collection. Apart from IR techniques that use the statement to query the news collection, we also formalize three properties of an appropriate citation, namely: (i) the citation should entail the Wikipedia statement, (ii) the statement should be central to the citation, and (iii) the citation should be from an authoritative source. We perform an extensive evaluation of both steps, using 20 million articles from a real-world news collection. Our results are quite promising, and show that we can perform this task with high precision and at scale.
AB - An important editing policy in Wikipedia is to provide citations for added statements in Wikipedia pages, where statements can be arbitrary pieces of text, ranging from a sentence to a paragraph. In many cases citations are either outdated or missing altogether. In this work we address the problem of finding and updating news citations for statements in entity pages. We propose a two-stage supervised approach for this problem. In the first step, we construct a classifier to find out whether statements need a news citation or other kinds of citations (web, book, journal, etc.). In the second step, we develop a news citation algorithm for Wikipedia statements, which recommends appropriate citations from a given news collection. Apart from IR techniques that use the statement to query the news collection, we also formalize three properties of an appropriate citation, namely: (i) the citation should entail the Wikipedia statement, (ii) the statement should be central to the citation, and (iii) the citation should be from an authoritative source. We perform an extensive evaluation of both steps, using 20 million articles from a real-world news collection. Our results are quite promising, and show that we can perform this task with high precision and at scale.
UR - http://www.scopus.com/inward/record.url?scp=84996490260&partnerID=8YFLogxK
U2 - 10.1145/2983323.2983808
DO - 10.1145/2983323.2983808
M3 - Conference contribution
AN - SCOPUS:84996490260
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 337
EP - 346
BT - CIKM 2016 - Proceedings of the 2016 ACM Conference on Information and Knowledge Management
PB - Association for Computing Machinery (ACM)
T2 - 25th ACM International Conference on Information and Knowledge Management, CIKM 2016
Y2 - 24 October 2016 through 28 October 2016
ER -