Details
Originalsprache | Englisch |
---|---|
Seiten (von - bis) | 430-459 |
Seitenumfang | 30 |
Fachzeitschrift | International Journal of Corpus Linguistics |
Jahrgang | 28 |
Ausgabenummer | 3 |
Publikationsstatus | Veröffentlicht - 19 Juli 2023 |
Extern publiziert | Ja |
Abstract
This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.
ASJC Scopus Sachgebiete
- Geisteswissenschaftliche Fächer (insg.)
- Sprache und Linguistik
- Sozialwissenschaften (insg.)
- Linguistik und Sprache
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
in: International Journal of Corpus Linguistics, Jahrgang 28, Nr. 3, 19.07.2023, S. 430-459.
Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review
}
TY - JOUR
T1 - Annotation uncertainty in the context of grammatical change
AU - Merten, Marie Luis
AU - Wever, Marcel
AU - Geierhos, Michaela
AU - Tophinke, Doris
AU - Hüllermeier, Eyke
N1 - Publisher Copyright: © 2023 John Benjamins Publishing Company.
PY - 2023/7/19
Y1 - 2023/7/19
N2 - This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.
AB - This paper elaborates on the notion of uncertainty in the context of annotation in large text corpora, specifically focusing on (but not limited to) historical languages. Such uncertainty might be due to inherent properties of the language, for example, linguistic ambiguity and overlapping categories of linguistic description, but could also be caused by a lack of annotation expertise. By examining annotation uncertainty in more detail, we identify the sources, deepen our understanding of the nature and different types of uncertainty encountered in daily annotation practice, and discuss practical implications of our theoretical findings. This paper can be seen as an attempt to reconcile the perspectives of the main scientific disciplines involved in corpus projects, linguistics and computer science, to develop a unified view and to highlight the potential synergies between these disciplines.
KW - annotation
KW - fuzziness
KW - grammatical change
KW - uncertainty
UR - http://www.scopus.com/inward/record.url?scp=85168533774&partnerID=8YFLogxK
U2 - 10.1075/ijcl.20113.mer
DO - 10.1075/ijcl.20113.mer
M3 - Article
AN - SCOPUS:85168533774
VL - 28
SP - 430
EP - 459
JO - International Journal of Corpus Linguistics
JF - International Journal of Corpus Linguistics
SN - 1384-6655
IS - 3
ER -