Annotation uncertainty in the context of grammatical change

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autoren

  • Marie Luis Merten
  • Marcel Wever
  • Michaela Geierhos
  • Doris Tophinke
  • Eyke Hüllermeier

Externe Organisationen

  • Universität Zürich (UZH)
  • Universität der Bundeswehr München
  • Universität Paderborn
  • Ludwig-Maximilians-Universität München (LMU)
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)430-459
Seitenumfang30
FachzeitschriftInternational Journal of Corpus Linguistics
Jahrgang28
Ausgabenummer3
PublikationsstatusVeröffentlicht - 19 Juli 2023
Extern publiziertJa

Abstract

This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

ASJC Scopus Sachgebiete

Zitieren

Annotation uncertainty in the context of grammatical change. / Merten, Marie Luis; Wever, Marcel; Geierhos, Michaela et al.
in: International Journal of Corpus Linguistics, Jahrgang 28, Nr. 3, 19.07.2023, S. 430-459.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Merten ML, Wever M, Geierhos M, Tophinke D, Hüllermeier E. Annotation uncertainty in the context of grammatical change. International Journal of Corpus Linguistics. 2023 Jul 19;28(3):430-459. doi: 10.1075/ijcl.20113.mer
Merten, Marie Luis ; Wever, Marcel ; Geierhos, Michaela et al. / Annotation uncertainty in the context of grammatical change. in: International Journal of Corpus Linguistics. 2023 ; Jahrgang 28, Nr. 3. S. 430-459.
Download
@article{35cf913042ec4117aa6ec473f1e7450c,
title = "Annotation uncertainty in the context of grammatical change",
abstract = "This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.",
keywords = "annotation, fuzziness, grammatical change, uncertainty",
author = "Merten, {Marie Luis} and Marcel Wever and Michaela Geierhos and Doris Tophinke and Eyke H{\"u}llermeier",
note = "Publisher Copyright: {\textcopyright} 2023 John Benjamins Publishing Company.",
year = "2023",
month = jul,
day = "19",
doi = "10.1075/ijcl.20113.mer",
language = "English",
volume = "28",
pages = "430--459",
journal = "International Journal of Corpus Linguistics",
issn = "1384-6655",
publisher = "John Benjamins Publishing Company",
number = "3",

}

Download

TY - JOUR

T1 - Annotation uncertainty in the context of grammatical change

AU - Merten, Marie Luis

AU - Wever, Marcel

AU - Geierhos, Michaela

AU - Tophinke, Doris

AU - Hüllermeier, Eyke

N1 - Publisher Copyright: © 2023 John Benjamins Publishing Company.

PY - 2023/7/19

Y1 - 2023/7/19

N2 - This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

AB - This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.

KW - annotation

KW - fuzziness

KW - grammatical change

KW - uncertainty

UR - http://www.scopus.com/inward/record.url?scp=85168533774&partnerID=8YFLogxK

U2 - 10.1075/ijcl.20113.mer

DO - 10.1075/ijcl.20113.mer

M3 - Article

AN - SCOPUS:85168533774

VL - 28

SP - 430

EP - 459

JO - International Journal of Corpus Linguistics

JF - International Journal of Corpus Linguistics

SN - 1384-6655

IS - 3

ER -

Von denselben Autoren