Combining Textual Features for the Detection of Hateful and Offensive Language

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Sherzod Hakimov
  • Ralph Ewerth

Organisationseinheiten

Externe Organisationen

  • Technische Informationsbibliothek (TIB) Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksFIRE 2021 Working Notes
UntertitelWorking Notes of FIRE 2021 - Forum for Information Retrieval Evaluation
Seiten412-418
Seitenumfang7
PublikationsstatusVeröffentlicht - Dez. 2021
VeranstaltungWorking Notes of FIRE - 13th Forum for Information Retrieval Evaluation, FIRE-WN 2021 - Gandhinagar, Indien
Dauer: 13 Dez. 202117 Dez. 2021

Publikationsreihe

NameCEUR Workshop Proceedings
Herausgeber (Verlag)CEUR Workshop Proceedings
Band3159
ISSN (Print)1613-0073

Abstract

The detection of offensive, hateful and profane language has become a critical challenge since many users in social networks are exposed to cyberbullying activities on a daily basis. In this paper, we present an analysis of combining different textual features for the detection of hateful or offensive posts on Twitter. We provide a detailed experimental evaluation to understand the impact of each building block in a neural network architecture. The proposed architecture is evaluated on the English Subtask 1A: Identifying Hate, offensive and profane content from the post datasets of HASOC-2021 dataset under the team name TIB-VA. We compared different variants of the contextual word embeddings combined with the character level embeddings and the encoding of collected hate terms.

ASJC Scopus Sachgebiete

Zitieren

Combining Textual Features for the Detection of Hateful and Offensive Language. / Hakimov, Sherzod; Ewerth, Ralph.
FIRE 2021 Working Notes: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation. 2021. S. 412-418 (CEUR Workshop Proceedings; Band 3159).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Hakimov, S & Ewerth, R 2021, Combining Textual Features for the Detection of Hateful and Offensive Language. in FIRE 2021 Working Notes: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation. CEUR Workshop Proceedings, Bd. 3159, S. 412-418, Working Notes of FIRE - 13th Forum for Information Retrieval Evaluation, FIRE-WN 2021, Gandhinagar, Indien, 13 Dez. 2021. https://doi.org/10.48550/arXiv.2112.04803
Hakimov, S., & Ewerth, R. (2021). Combining Textual Features for the Detection of Hateful and Offensive Language. In FIRE 2021 Working Notes: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation (S. 412-418). (CEUR Workshop Proceedings; Band 3159). https://doi.org/10.48550/arXiv.2112.04803
Hakimov S, Ewerth R. Combining Textual Features for the Detection of Hateful and Offensive Language. in FIRE 2021 Working Notes: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation. 2021. S. 412-418. (CEUR Workshop Proceedings). doi: 10.48550/arXiv.2112.04803
Hakimov, Sherzod ; Ewerth, Ralph. / Combining Textual Features for the Detection of Hateful and Offensive Language. FIRE 2021 Working Notes: Working Notes of FIRE 2021 - Forum for Information Retrieval Evaluation. 2021. S. 412-418 (CEUR Workshop Proceedings).
Download
@inproceedings{f20bf371926c4dc7b58f733d50ed37ad,
title = "Combining Textual Features for the Detection of Hateful and Offensive Language",
abstract = "The detection of offensive, hateful and profane language has become a critical challenge since many users in social networks are exposed to cyberbullying activities on a daily basis. In this paper, we present an analysis of combining different textual features for the detection of hateful or offensive posts on Twitter. We provide a detailed experimental evaluation to understand the impact of each building block in a neural network architecture. The proposed architecture is evaluated on the English Subtask 1A: Identifying Hate, offensive and profane content from the post datasets of HASOC-2021 dataset under the team name TIB-VA. We compared different variants of the contextual word embeddings combined with the character level embeddings and the encoding of collected hate terms.",
keywords = "abusive language detection, hate speech detection, offensive language detection, social media mining",
author = "Sherzod Hakimov and Ralph Ewerth",
year = "2021",
month = dec,
doi = "10.48550/arXiv.2112.04803",
language = "English",
series = "CEUR Workshop Proceedings",
publisher = "CEUR Workshop Proceedings",
pages = "412--418",
booktitle = "FIRE 2021 Working Notes",
note = "Working Notes of FIRE - 13th Forum for Information Retrieval Evaluation, FIRE-WN 2021 ; Conference date: 13-12-2021 Through 17-12-2021",

}

Download

TY - GEN

T1 - Combining Textual Features for the Detection of Hateful and Offensive Language

AU - Hakimov, Sherzod

AU - Ewerth, Ralph

PY - 2021/12

Y1 - 2021/12

N2 - The detection of offensive, hateful and profane language has become a critical challenge since many users in social networks are exposed to cyberbullying activities on a daily basis. In this paper, we present an analysis of combining different textual features for the detection of hateful or offensive posts on Twitter. We provide a detailed experimental evaluation to understand the impact of each building block in a neural network architecture. The proposed architecture is evaluated on the English Subtask 1A: Identifying Hate, offensive and profane content from the post datasets of HASOC-2021 dataset under the team name TIB-VA. We compared different variants of the contextual word embeddings combined with the character level embeddings and the encoding of collected hate terms.

AB - The detection of offensive, hateful and profane language has become a critical challenge since many users in social networks are exposed to cyberbullying activities on a daily basis. In this paper, we present an analysis of combining different textual features for the detection of hateful or offensive posts on Twitter. We provide a detailed experimental evaluation to understand the impact of each building block in a neural network architecture. The proposed architecture is evaluated on the English Subtask 1A: Identifying Hate, offensive and profane content from the post datasets of HASOC-2021 dataset under the team name TIB-VA. We compared different variants of the contextual word embeddings combined with the character level embeddings and the encoding of collected hate terms.

KW - abusive language detection

KW - hate speech detection

KW - offensive language detection

KW - social media mining

UR - http://www.scopus.com/inward/record.url?scp=85134272740&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2112.04803

DO - 10.48550/arXiv.2112.04803

M3 - Conference contribution

AN - SCOPUS:85134272740

T3 - CEUR Workshop Proceedings

SP - 412

EP - 418

BT - FIRE 2021 Working Notes

T2 - Working Notes of FIRE - 13th Forum for Information Retrieval Evaluation, FIRE-WN 2021

Y2 - 13 December 2021 through 17 December 2021

ER -