How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Shreya Ghosh
  • Prasenjit Mitra

Research Organisations

External Research Organisations

  • Pennsylvania State University
View graph of relations

Details

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases
Subtitle of host publicationApplied Data Science and Demo Track
EditorsGianmarco De Francisci Morales, Francesco Bonchi, Claudia Perlich, Natali Ruchansky, Nicolas Kourtellis, Elena Baralis
PublisherSpringer Science and Business Media Deutschland GmbH
Pages174-189
Number of pages16
ISBN (electronic)978-3-031-43427-3
ISBN (print)9783031434266
Publication statusPublished - 17 Sept 2023
EventEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023 - Turin, Italy
Duration: 18 Sept 202322 Sept 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14174 LNAI
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

The rise of social media has amplified the need for automated detection of misinformation. Current methods face limitations in early detection because crucial information that they rely on is unavailable during the initial phases of information dissemination. This paper presents an innovative model for the early detection of misinformation on social media through the classification of information propagation paths and using linguistic patterns. We have developed and incorporated a causal user attribute inference model to label users as potential misinformation propagators or believers. Our model is designed for early detection of false information and includes two auxiliary tasks: predicting the extent of misinformation dissemination and clustering similar nodes (or users) based on their attributes. We demonstrate that our proposed model can identify fake news on real-world datasets with 86.5% accuracy within 30 min of its initial distribution and before it reaches 50 retweets, outperforming existing state-of-the-art benchmarks.

Keywords

    discourse analysis, Misinformation, social network

ASJC Scopus subject areas

Cite this

How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics. / Ghosh, Shreya; Mitra, Prasenjit.
Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track . ed. / Gianmarco De Francisci Morales; Francesco Bonchi; Claudia Perlich; Natali Ruchansky; Nicolas Kourtellis; Elena Baralis. Springer Science and Business Media Deutschland GmbH, 2023. p. 174-189 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14174 LNAI).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Ghosh, S & Mitra, P 2023, How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics. in G De Francisci Morales, F Bonchi, C Perlich, N Ruchansky, N Kourtellis & E Baralis (eds), Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track . Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 14174 LNAI, Springer Science and Business Media Deutschland GmbH, pp. 174-189, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023, Turin, Italy, 18 Sept 2023. https://doi.org/10.1007/978-3-031-43427-3_11
Ghosh, S., & Mitra, P. (2023). How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics. In G. De Francisci Morales, F. Bonchi, C. Perlich, N. Ruchansky, N. Kourtellis, & E. Baralis (Eds.), Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track (pp. 174-189). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 14174 LNAI). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-43427-3_11
Ghosh S, Mitra P. How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics. In De Francisci Morales G, Bonchi F, Perlich C, Ruchansky N, Kourtellis N, Baralis E, editors, Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track . Springer Science and Business Media Deutschland GmbH. 2023. p. 174-189. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-43427-3_11
Ghosh, Shreya ; Mitra, Prasenjit. / How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics. Machine Learning and Knowledge Discovery in Databases: Applied Data Science and Demo Track . editor / Gianmarco De Francisci Morales ; Francesco Bonchi ; Claudia Perlich ; Natali Ruchansky ; Nicolas Kourtellis ; Elena Baralis. Springer Science and Business Media Deutschland GmbH, 2023. pp. 174-189 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{49c665cd3cdd44f0ba7ae1ac050e1ca0,
title = "How Early Can We Detect?: Detecting Misinformation on Social Media Using User Profiling and Network Characteristics",
abstract = "The rise of social media has amplified the need for automated detection of misinformation. Current methods face limitations in early detection because crucial information that they rely on is unavailable during the initial phases of information dissemination. This paper presents an innovative model for the early detection of misinformation on social media through the classification of information propagation paths and using linguistic patterns. We have developed and incorporated a causal user attribute inference model to label users as potential misinformation propagators or believers. Our model is designed for early detection of false information and includes two auxiliary tasks: predicting the extent of misinformation dissemination and clustering similar nodes (or users) based on their attributes. We demonstrate that our proposed model can identify fake news on real-world datasets with 86.5% accuracy within 30 min of its initial distribution and before it reaches 50 retweets, outperforming existing state-of-the-art benchmarks.",
keywords = "discourse analysis, Misinformation, social network",
author = "Shreya Ghosh and Prasenjit Mitra",
note = "Funding Information: This research was funded by the Federal Ministry of Education and Research (BMBF), Germany under the project LeibnizKILabor with grant No. 01DD20003.; European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023 ; Conference date: 18-09-2023 Through 22-09-2023",
year = "2023",
month = sep,
day = "17",
doi = "10.1007/978-3-031-43427-3_11",
language = "English",
isbn = "9783031434266",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "174--189",
editor = "{De Francisci Morales}, Gianmarco and Francesco Bonchi and Claudia Perlich and Natali Ruchansky and Nicolas Kourtellis and Elena Baralis",
booktitle = "Machine Learning and Knowledge Discovery in Databases",
address = "Germany",

}

Download

TY - GEN

T1 - How Early Can We Detect?

T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2023

AU - Ghosh, Shreya

AU - Mitra, Prasenjit

N1 - Funding Information: This research was funded by the Federal Ministry of Education and Research (BMBF), Germany under the project LeibnizKILabor with grant No. 01DD20003.

PY - 2023/9/17

Y1 - 2023/9/17

N2 - The rise of social media has amplified the need for automated detection of misinformation. Current methods face limitations in early detection because crucial information that they rely on is unavailable during the initial phases of information dissemination. This paper presents an innovative model for the early detection of misinformation on social media through the classification of information propagation paths and using linguistic patterns. We have developed and incorporated a causal user attribute inference model to label users as potential misinformation propagators or believers. Our model is designed for early detection of false information and includes two auxiliary tasks: predicting the extent of misinformation dissemination and clustering similar nodes (or users) based on their attributes. We demonstrate that our proposed model can identify fake news on real-world datasets with 86.5% accuracy within 30 min of its initial distribution and before it reaches 50 retweets, outperforming existing state-of-the-art benchmarks.

AB - The rise of social media has amplified the need for automated detection of misinformation. Current methods face limitations in early detection because crucial information that they rely on is unavailable during the initial phases of information dissemination. This paper presents an innovative model for the early detection of misinformation on social media through the classification of information propagation paths and using linguistic patterns. We have developed and incorporated a causal user attribute inference model to label users as potential misinformation propagators or believers. Our model is designed for early detection of false information and includes two auxiliary tasks: predicting the extent of misinformation dissemination and clustering similar nodes (or users) based on their attributes. We demonstrate that our proposed model can identify fake news on real-world datasets with 86.5% accuracy within 30 min of its initial distribution and before it reaches 50 retweets, outperforming existing state-of-the-art benchmarks.

KW - discourse analysis

KW - Misinformation

KW - social network

UR - http://www.scopus.com/inward/record.url?scp=85174437359&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-43427-3_11

DO - 10.1007/978-3-031-43427-3_11

M3 - Conference contribution

AN - SCOPUS:85174437359

SN - 9783031434266

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 174

EP - 189

BT - Machine Learning and Knowledge Discovery in Databases

A2 - De Francisci Morales, Gianmarco

A2 - Bonchi, Francesco

A2 - Perlich, Claudia

A2 - Ruchansky, Natali

A2 - Kourtellis, Nicolas

A2 - Baralis, Elena

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 18 September 2023 through 22 September 2023

ER -