Fine-Grained Citation Span Detection for References in Wikipedia

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Besnik Fetahu
  • Katja Markert
  • Avishek Anand

Organisationseinheiten

Externe Organisationen

  • Ruprecht-Karls-Universität Heidelberg
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksEMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings
Seiten1990-1999
Seitenumfang10
ISBN (elektronisch)9781945626838
PublikationsstatusVeröffentlicht - Sept. 2017
Veranstaltung2017 Conference on Empirical Methods in Natural Language Processing - Copenhagen, Dänemark
Dauer: 7 Sept. 201711 Sept. 2017

Abstract

Verifiability is one of the core editing principles in Wikipedia, editors being encouraged to provide citations for the added content. For a Wikipedia article, determining the citation span of a citation, i.e. what content is covered by a citation, is important as it helps decide for which content citations are still missing. We are the first to address the problem of determining the citation span in Wikipedia articles. We approach this problem by classifying which textual fragments in an article are covered by a citation. We propose a sequence classification approach where for a paragraph and a citation, we determine the citation span at a fine-grained level. We provide a thorough experimental evaluation and compare our approach against baselines adopted from the scientific domain, where we show improvement for all evaluation metrics.

ASJC Scopus Sachgebiete

Zitieren

Fine-Grained Citation Span Detection for References in Wikipedia. / Fetahu, Besnik; Markert, Katja; Anand, Avishek.
EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings. 2017. S. 1990-1999.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Fetahu, B, Markert, K & Anand, A 2017, Fine-Grained Citation Span Detection for References in Wikipedia. in EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings. S. 1990-1999, 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Dänemark, 7 Sept. 2017. https://doi.org/10.18653/v1/D17-1
Fetahu, B., Markert, K., & Anand, A. (2017). Fine-Grained Citation Span Detection for References in Wikipedia. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (S. 1990-1999) https://doi.org/10.18653/v1/D17-1
Fetahu B, Markert K, Anand A. Fine-Grained Citation Span Detection for References in Wikipedia. in EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings. 2017. S. 1990-1999 doi: 10.18653/v1/D17-1
Fetahu, Besnik ; Markert, Katja ; Anand, Avishek. / Fine-Grained Citation Span Detection for References in Wikipedia. EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings. 2017. S. 1990-1999
Download
@inproceedings{89941dc7559044fe933259ee4e20d83f,
title = "Fine-Grained Citation Span Detection for References in Wikipedia",
abstract = "Verifiability is one of the core editing principles in Wikipedia, editors being encouraged to provide citations for the added content. For a Wikipedia article, determining the citation span of a citation, i.e. what content is covered by a citation, is important as it helps decide for which content citations are still missing. We are the first to address the problem of determining the citation span in Wikipedia articles. We approach this problem by classifying which textual fragments in an article are covered by a citation. We propose a sequence classification approach where for a paragraph and a citation, we determine the citation span at a fine-grained level. We provide a thorough experimental evaluation and compare our approach against baselines adopted from the scientific domain, where we show improvement for all evaluation metrics.",
author = "Besnik Fetahu and Katja Markert and Avishek Anand",
note = "Funding information: This work is funded by the ERC Advanced Grant ALEXANDRIA (grant no. 339233), and H2020 AFEL project (grant no. 687916).; 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017 ; Conference date: 07-09-2017 Through 11-09-2017",
year = "2017",
month = sep,
doi = "10.18653/v1/D17-1",
language = "English",
pages = "1990--1999",
booktitle = "EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings",

}

Download

TY - GEN

T1 - Fine-Grained Citation Span Detection for References in Wikipedia

AU - Fetahu, Besnik

AU - Markert, Katja

AU - Anand, Avishek

N1 - Funding information: This work is funded by the ERC Advanced Grant ALEXANDRIA (grant no. 339233), and H2020 AFEL project (grant no. 687916).

PY - 2017/9

Y1 - 2017/9

N2 - Verifiability is one of the core editing principles in Wikipedia, editors being encouraged to provide citations for the added content. For a Wikipedia article, determining the citation span of a citation, i.e. what content is covered by a citation, is important as it helps decide for which content citations are still missing. We are the first to address the problem of determining the citation span in Wikipedia articles. We approach this problem by classifying which textual fragments in an article are covered by a citation. We propose a sequence classification approach where for a paragraph and a citation, we determine the citation span at a fine-grained level. We provide a thorough experimental evaluation and compare our approach against baselines adopted from the scientific domain, where we show improvement for all evaluation metrics.

AB - Verifiability is one of the core editing principles in Wikipedia, editors being encouraged to provide citations for the added content. For a Wikipedia article, determining the citation span of a citation, i.e. what content is covered by a citation, is important as it helps decide for which content citations are still missing. We are the first to address the problem of determining the citation span in Wikipedia articles. We approach this problem by classifying which textual fragments in an article are covered by a citation. We propose a sequence classification approach where for a paragraph and a citation, we determine the citation span at a fine-grained level. We provide a thorough experimental evaluation and compare our approach against baselines adopted from the scientific domain, where we show improvement for all evaluation metrics.

UR - http://www.scopus.com/inward/record.url?scp=85066893853&partnerID=8YFLogxK

U2 - 10.18653/v1/D17-1

DO - 10.18653/v1/D17-1

M3 - Conference contribution

AN - SCOPUS:85066893853

SP - 1990

EP - 1999

BT - EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings

T2 - 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017

Y2 - 7 September 2017 through 11 September 2017

ER -