DistrustRank: Spotting False News Domains

Vinicius Woloszyn; Wolfgang Nejdl

doi:10.1145/3201064.3201083

Details

Original language	English
Title of host publication	WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science
Pages	221-228
Number of pages	8
ISBN (electronic)	9781450355636
Publication status	Published - 15 May 2018
Event	10th ACM Conference on Web Science, WebSci 2018 - Amsterdam, Netherlands Duration: 27 May 2018 → 30 May 2018

Abstract

In this paper we propose a semi-supervised learning strategy to automatically separate fake News from reliable News sources: DistrustRank. We first select a small set of unreliable News, manually evaluated and classified by experts on fact checking portals. Once this set is created, DistrustRank constructs a weighted graph where nodes represent websites, connected by edges based on a minimum similarity between a pair of websites. Next it computes the centrality using a biased PageRank, where a bias is applied to the selected set of seeds. As an output of the proposed model we obtain a trust (or distrust) rank that can be used in two ways: a) as a counter-bias to be applied when News about a specific subject is ranked, in order to discount possible boosts achieved by false claims; and b) to assist humans to identify sources that are likely to be source of fake News (or that are likely to be reputable), suggesting websites that should be examined more closely or to be avoided. In our experiments, DistrustRank outperforms the supervised approaches in either ranking and classification task.

Keywords

Credibility analysis, Rumor detection, Text mining

ASJC Scopus subject areas

Computer Science(all)
Computer Networks and Communications

Cite this

DistrustRank: Spotting False News Domains. / Woloszyn, Vinicius; Nejdl, Wolfgang.
WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science. 2018. p. 221-228.

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Woloszyn, V & Nejdl, W 2018, DistrustRank: Spotting False News Domains. in WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science. pp. 221-228, 10th ACM Conference on Web Science, WebSci 2018, Amsterdam, Netherlands, 27 May 2018. https://doi.org/10.1145/3201064.3201083

Woloszyn, V., & Nejdl, W. (2018). DistrustRank: Spotting False News Domains. In WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science (pp. 221-228) https://doi.org/10.1145/3201064.3201083

Woloszyn V, Nejdl W. DistrustRank: Spotting False News Domains. In WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science. 2018. p. 221-228 doi: 10.1145/3201064.3201083

Woloszyn, Vinicius ; Nejdl, Wolfgang. / DistrustRank : Spotting False News Domains. WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science. 2018. pp. 221-228

Download

@inproceedings{f0335af072914352a29ac0d7821ebb2c,

title = "DistrustRank: Spotting False News Domains",

abstract = "In this paper we propose a semi-supervised learning strategy to automatically separate fake News from reliable News sources: DistrustRank. We first select a small set of unreliable News, manually evaluated and classified by experts on fact checking portals. Once this set is created, DistrustRank constructs a weighted graph where nodes represent websites, connected by edges based on a minimum similarity between a pair of websites. Next it computes the centrality using a biased PageRank, where a bias is applied to the selected set of seeds. As an output of the proposed model we obtain a trust (or distrust) rank that can be used in two ways: a) as a counter-bias to be applied when News about a specific subject is ranked, in order to discount possible boosts achieved by false claims; and b) to assist humans to identify sources that are likely to be source of fake News (or that are likely to be reputable), suggesting websites that should be examined more closely or to be avoided. In our experiments, DistrustRank outperforms the supervised approaches in either ranking and classification task.",

keywords = "Credibility analysis, Rumor detection, Text mining",

author = "Vinicius Woloszyn and Wolfgang Nejdl",

note = "Funding information: This work was partially funded by the European Research Council under ALEXANDRIA (ERC 339233) and CAPES, a Brazilian government institution for scientific development.; 10th ACM Conference on Web Science, WebSci 2018 ; Conference date: 27-05-2018 Through 30-05-2018",

year = "2018",

month = may,

day = "15",

doi = "10.1145/3201064.3201083",

language = "English",

pages = "221--228",

booktitle = "WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science",

}

Download

TY - GEN

T1 - DistrustRank

T2 - 10th ACM Conference on Web Science, WebSci 2018

AU - Woloszyn, Vinicius

AU - Nejdl, Wolfgang

N1 - Funding information: This work was partially funded by the European Research Council under ALEXANDRIA (ERC 339233) and CAPES, a Brazilian government institution for scientific development.

PY - 2018/5/15

Y1 - 2018/5/15

N2 - In this paper we propose a semi-supervised learning strategy to automatically separate fake News from reliable News sources: DistrustRank. We first select a small set of unreliable News, manually evaluated and classified by experts on fact checking portals. Once this set is created, DistrustRank constructs a weighted graph where nodes represent websites, connected by edges based on a minimum similarity between a pair of websites. Next it computes the centrality using a biased PageRank, where a bias is applied to the selected set of seeds. As an output of the proposed model we obtain a trust (or distrust) rank that can be used in two ways: a) as a counter-bias to be applied when News about a specific subject is ranked, in order to discount possible boosts achieved by false claims; and b) to assist humans to identify sources that are likely to be source of fake News (or that are likely to be reputable), suggesting websites that should be examined more closely or to be avoided. In our experiments, DistrustRank outperforms the supervised approaches in either ranking and classification task.

AB - In this paper we propose a semi-supervised learning strategy to automatically separate fake News from reliable News sources: DistrustRank. We first select a small set of unreliable News, manually evaluated and classified by experts on fact checking portals. Once this set is created, DistrustRank constructs a weighted graph where nodes represent websites, connected by edges based on a minimum similarity between a pair of websites. Next it computes the centrality using a biased PageRank, where a bias is applied to the selected set of seeds. As an output of the proposed model we obtain a trust (or distrust) rank that can be used in two ways: a) as a counter-bias to be applied when News about a specific subject is ranked, in order to discount possible boosts achieved by false claims; and b) to assist humans to identify sources that are likely to be source of fake News (or that are likely to be reputable), suggesting websites that should be examined more closely or to be avoided. In our experiments, DistrustRank outperforms the supervised approaches in either ranking and classification task.

KW - Credibility analysis

KW - Rumor detection

KW - Text mining

UR - http://www.scopus.com/inward/record.url?scp=85049402001&partnerID=8YFLogxK

U2 - 10.1145/3201064.3201083

DO - 10.1145/3201064.3201083

M3 - Conference contribution

AN - SCOPUS:85049402001

SP - 221

EP - 228

BT - WebSci 2018 - Proceedings of the 10th ACM Conference on Web Science

Y2 - 27 May 2018 through 30 May 2018

ER -

Research@Leibniz University

DistrustRank: Spotting False News Domains

Authors

Research Organisations

External Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Robust Fusion of Time Series and Image Data for Improved Multimodal Clinical Prediction

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

Open benchmark for filtering techniques in entity resolution

A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task

Adaptive Dispatching of Mobile Charging Stations using Multi-Agent Graph Convolutional Cooperative-Competitive Reinforcement Learning

Robust Fusion of Time Series and Image Data for Improved Multimodal Clinical Prediction

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

Open benchmark for filtering techniques in entity resolution

A Systematic Evaluation of Single-Cell Foundation Models on Cell-Type Classification Task

Adaptive Dispatching of Mobile Charging Stations using Multi-Agent Graph Convolutional Cooperative-Competitive Reinforcement Learning

Robust Fusion of Time Series and Image Data for Improved Multimodal Clinical Prediction