Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Ahmad Sakor
  • Kuldeep Singh
  • Maria Esther Vidal

Research Organisations

External Research Organisations

  • Zerotha Research
View graph of relations

Details

Original languageEnglish
Pages (from-to)115351-115371
Number of pages21
JournalIEEE ACCESS
Volume10
Early online date26 Oct 2022
Publication statusPublished - 2022

Abstract

Social media networks have become a prime source for sharing news, opinions, and research accomplishments in various domains, and hundreds of millions of posts are announced daily. Given this wealth of information in social media, finding related announcements has become a relevant task, particularly in trending news (e.g., COVID-19 or lung cancer). To facilitate the search of connected posts, social networks enable users to annotate their posts, e.g., with hashtags in tweets. Albeit effective, an annotation-based search is limited because results will only include the posts that share the same annotations. This paper focuses on retrieving context-related posts based on a specific topic, and presents PINYON, a knowledge-driven framework, that retrieves associated posts effectively. PINYON implements a two-fold pipeline. First, it encodes, in a graph, a CORPUS of posts and an input post; posts are annotated with entities for existing knowledge graphs and connected based on the similarity of their entities. In a decoding phase, the encoded graph is used to discover communities of related posts. We cast this problem into the Vertex Coloring Problem, where communities of similar posts include the posts annotated with entities colored with the same colors. Built on results reported in the graph theory, PINYON implements the decoding phase guided by a heuristic-based method that determines relatedness among posts based on contextual knowledge, and efficiently groups the most similar posts in the same communities. PINYON is empirically evaluated on various datasets and compared with state-of-the-art implementations of the decoding phase. The quality of the generated communities is also analyzed based on multiple metrics. The observed outcomes indicate that PINYON accurately identifies semantically related posts in different contexts. Moreover, the reported results put in perspective the impact of known properties about the optimality of existing heuristics for vertex graph coloring and their implications on PINYON scalability.

Keywords

    community detection, COVID-19, knowledge graph, knowledge retrieval, post relatedness, Social media networks

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts. / Sakor, Ahmad; Singh, Kuldeep; Vidal, Maria Esther.
In: IEEE ACCESS, Vol. 10, 2022, p. 115351-115371.

Research output: Contribution to journalArticleResearchpeer review

Sakor A, Singh K, Vidal ME. Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts. IEEE ACCESS. 2022;10:115351-115371. Epub 2022 Oct 26. doi: 10.1109/ACCESS.2022.3217492
Sakor, Ahmad ; Singh, Kuldeep ; Vidal, Maria Esther. / Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts. In: IEEE ACCESS. 2022 ; Vol. 10. pp. 115351-115371.
Download
@article{71f13089f6f041988c0f31d4ba9a4723,
title = "Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts",
abstract = "Social media networks have become a prime source for sharing news, opinions, and research accomplishments in various domains, and hundreds of millions of posts are announced daily. Given this wealth of information in social media, finding related announcements has become a relevant task, particularly in trending news (e.g., COVID-19 or lung cancer). To facilitate the search of connected posts, social networks enable users to annotate their posts, e.g., with hashtags in tweets. Albeit effective, an annotation-based search is limited because results will only include the posts that share the same annotations. This paper focuses on retrieving context-related posts based on a specific topic, and presents PINYON, a knowledge-driven framework, that retrieves associated posts effectively. PINYON implements a two-fold pipeline. First, it encodes, in a graph, a CORPUS of posts and an input post; posts are annotated with entities for existing knowledge graphs and connected based on the similarity of their entities. In a decoding phase, the encoded graph is used to discover communities of related posts. We cast this problem into the Vertex Coloring Problem, where communities of similar posts include the posts annotated with entities colored with the same colors. Built on results reported in the graph theory, PINYON implements the decoding phase guided by a heuristic-based method that determines relatedness among posts based on contextual knowledge, and efficiently groups the most similar posts in the same communities. PINYON is empirically evaluated on various datasets and compared with state-of-the-art implementations of the decoding phase. The quality of the generated communities is also analyzed based on multiple metrics. The observed outcomes indicate that PINYON accurately identifies semantically related posts in different contexts. Moreover, the reported results put in perspective the impact of known properties about the optimality of existing heuristics for vertex graph coloring and their implications on PINYON scalability.",
keywords = "community detection, COVID-19, knowledge graph, knowledge retrieval, post relatedness, Social media networks",
author = "Ahmad Sakor and Kuldeep Singh and Vidal, {Maria Esther}",
note = "Funding Information: This work was supported in part by the European Union (EU) H2020 Research and Innovation Action (RIA) Project CLARIFY under Agreement 875160; and in part by the Federal Ministry for Economic Affairs and Energy of Germany in the project Cognitive Economy Intelligence Plattform f{\"u}r die Resilienz wirtschaftlicher {\"O}kosysteme (CoyPu), Germany, under Grant 01MK21007[A-L]. The work of Maria-Esther Vidal was supported in part by the Leibniz Association in the program {"}Leibniz Best Minds: Programme for Women Professors,{"} Project TrustKG Transforming Data in Trustable Insights, under Grant P99/2020. ",
year = "2022",
doi = "10.1109/ACCESS.2022.3217492",
language = "English",
volume = "10",
pages = "115351--115371",
journal = "IEEE ACCESS",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

Download

TY - JOUR

T1 - Resorting to Context-Aware Background Knowledge for Unveiling Semantically Related Social Media Posts

AU - Sakor, Ahmad

AU - Singh, Kuldeep

AU - Vidal, Maria Esther

N1 - Funding Information: This work was supported in part by the European Union (EU) H2020 Research and Innovation Action (RIA) Project CLARIFY under Agreement 875160; and in part by the Federal Ministry for Economic Affairs and Energy of Germany in the project Cognitive Economy Intelligence Plattform für die Resilienz wirtschaftlicher Ökosysteme (CoyPu), Germany, under Grant 01MK21007[A-L]. The work of Maria-Esther Vidal was supported in part by the Leibniz Association in the program "Leibniz Best Minds: Programme for Women Professors," Project TrustKG Transforming Data in Trustable Insights, under Grant P99/2020.

PY - 2022

Y1 - 2022

N2 - Social media networks have become a prime source for sharing news, opinions, and research accomplishments in various domains, and hundreds of millions of posts are announced daily. Given this wealth of information in social media, finding related announcements has become a relevant task, particularly in trending news (e.g., COVID-19 or lung cancer). To facilitate the search of connected posts, social networks enable users to annotate their posts, e.g., with hashtags in tweets. Albeit effective, an annotation-based search is limited because results will only include the posts that share the same annotations. This paper focuses on retrieving context-related posts based on a specific topic, and presents PINYON, a knowledge-driven framework, that retrieves associated posts effectively. PINYON implements a two-fold pipeline. First, it encodes, in a graph, a CORPUS of posts and an input post; posts are annotated with entities for existing knowledge graphs and connected based on the similarity of their entities. In a decoding phase, the encoded graph is used to discover communities of related posts. We cast this problem into the Vertex Coloring Problem, where communities of similar posts include the posts annotated with entities colored with the same colors. Built on results reported in the graph theory, PINYON implements the decoding phase guided by a heuristic-based method that determines relatedness among posts based on contextual knowledge, and efficiently groups the most similar posts in the same communities. PINYON is empirically evaluated on various datasets and compared with state-of-the-art implementations of the decoding phase. The quality of the generated communities is also analyzed based on multiple metrics. The observed outcomes indicate that PINYON accurately identifies semantically related posts in different contexts. Moreover, the reported results put in perspective the impact of known properties about the optimality of existing heuristics for vertex graph coloring and their implications on PINYON scalability.

AB - Social media networks have become a prime source for sharing news, opinions, and research accomplishments in various domains, and hundreds of millions of posts are announced daily. Given this wealth of information in social media, finding related announcements has become a relevant task, particularly in trending news (e.g., COVID-19 or lung cancer). To facilitate the search of connected posts, social networks enable users to annotate their posts, e.g., with hashtags in tweets. Albeit effective, an annotation-based search is limited because results will only include the posts that share the same annotations. This paper focuses on retrieving context-related posts based on a specific topic, and presents PINYON, a knowledge-driven framework, that retrieves associated posts effectively. PINYON implements a two-fold pipeline. First, it encodes, in a graph, a CORPUS of posts and an input post; posts are annotated with entities for existing knowledge graphs and connected based on the similarity of their entities. In a decoding phase, the encoded graph is used to discover communities of related posts. We cast this problem into the Vertex Coloring Problem, where communities of similar posts include the posts annotated with entities colored with the same colors. Built on results reported in the graph theory, PINYON implements the decoding phase guided by a heuristic-based method that determines relatedness among posts based on contextual knowledge, and efficiently groups the most similar posts in the same communities. PINYON is empirically evaluated on various datasets and compared with state-of-the-art implementations of the decoding phase. The quality of the generated communities is also analyzed based on multiple metrics. The observed outcomes indicate that PINYON accurately identifies semantically related posts in different contexts. Moreover, the reported results put in perspective the impact of known properties about the optimality of existing heuristics for vertex graph coloring and their implications on PINYON scalability.

KW - community detection

KW - COVID-19

KW - knowledge graph

KW - knowledge retrieval

KW - post relatedness

KW - Social media networks

UR - http://www.scopus.com/inward/record.url?scp=85141499426&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2022.3217492

DO - 10.1109/ACCESS.2022.3217492

M3 - Article

AN - SCOPUS:85141499426

VL - 10

SP - 115351

EP - 115371

JO - IEEE ACCESS

JF - IEEE ACCESS

SN - 2169-3536

ER -