Detecting Media Bias in News Articles using Gaussian Bias Distributions

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

External Research Organisations

  • Paderborn University
  • Bauhaus-Universität Weimar
View graph of relations

Details

Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics
Subtitle of host publicationEMNLP 2020
Pages4290-4300
Number of pages11
Publication statusPublished - Nov 2020
Externally publishedYes
EventFindings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 - Virtual, Online
Duration: 16 Nov 202020 Nov 2020

Abstract

Media plays an important role in shaping public opinion. Biased media can influence people in undesirable directions and hence should be unmasked as such. We observe that feature-based and neural text classification approaches which rely only on the distribution of low-level lexical information fail to detect media bias. This weakness becomes most noticeable for articles on new events, where words appear in new contexts and hence their “bias predictiveness” is unclear. In this paper, we therefore study how second-order information about biased statements in an article helps to improve detection effectiveness. In particular, we utilize the probability distributions of the frequency, positions, and sequential order of lexical and informational sentence-level bias in a Gaussian Mixture Model. On an existing media bias dataset, we find that the frequency and positions of biased statements strongly impact article-level bias, whereas their exact sequential order is secondary. Using a standard model for sentence-level bias detection, we provide empirical evidence that article-level bias detectors that use second-order information clearly outperform those without.

ASJC Scopus subject areas

Cite this

Detecting Media Bias in News Articles using Gaussian Bias Distributions. / Chen, Wei Fan; Al-Khatib, Khalid; Stein, Benno et al.
Findings of the Association for Computational Linguistics: EMNLP 2020. 2020. p. 4290-4300.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Chen, WF, Al-Khatib, K, Stein, B & Wachsmuth, H 2020, Detecting Media Bias in News Articles using Gaussian Bias Distributions. in Findings of the Association for Computational Linguistics: EMNLP 2020. pp. 4290-4300, Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020, Virtual, Online, 16 Nov 2020. https://doi.org/10.48550/arXiv.2010.10649, https://doi.org/10.18653/v1/2020.findings-emnlp.383
Chen, W. F., Al-Khatib, K., Stein, B., & Wachsmuth, H. (2020). Detecting Media Bias in News Articles using Gaussian Bias Distributions. In Findings of the Association for Computational Linguistics: EMNLP 2020 (pp. 4290-4300) https://doi.org/10.48550/arXiv.2010.10649, https://doi.org/10.18653/v1/2020.findings-emnlp.383
Chen WF, Al-Khatib K, Stein B, Wachsmuth H. Detecting Media Bias in News Articles using Gaussian Bias Distributions. In Findings of the Association for Computational Linguistics: EMNLP 2020. 2020. p. 4290-4300 doi: 10.48550/arXiv.2010.10649, 10.18653/v1/2020.findings-emnlp.383
Chen, Wei Fan ; Al-Khatib, Khalid ; Stein, Benno et al. / Detecting Media Bias in News Articles using Gaussian Bias Distributions. Findings of the Association for Computational Linguistics: EMNLP 2020. 2020. pp. 4290-4300
Download
@inproceedings{3e51ca0cdd3249f392fab5bba63b2f72,
title = "Detecting Media Bias in News Articles using Gaussian Bias Distributions",
abstract = "Media plays an important role in shaping public opinion. Biased media can influence people in undesirable directions and hence should be unmasked as such. We observe that feature-based and neural text classification approaches which rely only on the distribution of low-level lexical information fail to detect media bias. This weakness becomes most noticeable for articles on new events, where words appear in new contexts and hence their “bias predictiveness” is unclear. In this paper, we therefore study how second-order information about biased statements in an article helps to improve detection effectiveness. In particular, we utilize the probability distributions of the frequency, positions, and sequential order of lexical and informational sentence-level bias in a Gaussian Mixture Model. On an existing media bias dataset, we find that the frequency and positions of biased statements strongly impact article-level bias, whereas their exact sequential order is secondary. Using a standard model for sentence-level bias detection, we provide empirical evidence that article-level bias detectors that use second-order information clearly outperform those without.",
author = "Chen, {Wei Fan} and Khalid Al-Khatib and Benno Stein and Henning Wachsmuth",
note = "Funding Information: This work was partially supported by the German Research Foundation (DFG) within the Collaborative Research Center “On-The-Fly Computing” (SFB 901/3) under the project number 160364472.; Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 ; Conference date: 16-11-2020 Through 20-11-2020",
year = "2020",
month = nov,
doi = "10.48550/arXiv.2010.10649",
language = "English",
pages = "4290--4300",
booktitle = "Findings of the Association for Computational Linguistics",

}

Download

TY - GEN

T1 - Detecting Media Bias in News Articles using Gaussian Bias Distributions

AU - Chen, Wei Fan

AU - Al-Khatib, Khalid

AU - Stein, Benno

AU - Wachsmuth, Henning

N1 - Funding Information: This work was partially supported by the German Research Foundation (DFG) within the Collaborative Research Center “On-The-Fly Computing” (SFB 901/3) under the project number 160364472.

PY - 2020/11

Y1 - 2020/11

N2 - Media plays an important role in shaping public opinion. Biased media can influence people in undesirable directions and hence should be unmasked as such. We observe that feature-based and neural text classification approaches which rely only on the distribution of low-level lexical information fail to detect media bias. This weakness becomes most noticeable for articles on new events, where words appear in new contexts and hence their “bias predictiveness” is unclear. In this paper, we therefore study how second-order information about biased statements in an article helps to improve detection effectiveness. In particular, we utilize the probability distributions of the frequency, positions, and sequential order of lexical and informational sentence-level bias in a Gaussian Mixture Model. On an existing media bias dataset, we find that the frequency and positions of biased statements strongly impact article-level bias, whereas their exact sequential order is secondary. Using a standard model for sentence-level bias detection, we provide empirical evidence that article-level bias detectors that use second-order information clearly outperform those without.

AB - Media plays an important role in shaping public opinion. Biased media can influence people in undesirable directions and hence should be unmasked as such. We observe that feature-based and neural text classification approaches which rely only on the distribution of low-level lexical information fail to detect media bias. This weakness becomes most noticeable for articles on new events, where words appear in new contexts and hence their “bias predictiveness” is unclear. In this paper, we therefore study how second-order information about biased statements in an article helps to improve detection effectiveness. In particular, we utilize the probability distributions of the frequency, positions, and sequential order of lexical and informational sentence-level bias in a Gaussian Mixture Model. On an existing media bias dataset, we find that the frequency and positions of biased statements strongly impact article-level bias, whereas their exact sequential order is secondary. Using a standard model for sentence-level bias detection, we provide empirical evidence that article-level bias detectors that use second-order information clearly outperform those without.

UR - http://www.scopus.com/inward/record.url?scp=85115719536&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2010.10649

DO - 10.48550/arXiv.2010.10649

M3 - Conference contribution

AN - SCOPUS:85115719536

SP - 4290

EP - 4300

BT - Findings of the Association for Computational Linguistics

T2 - Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020

Y2 - 16 November 2020 through 20 November 2020

ER -

By the same author(s)