QuTI! Quantifying Text-Image Consistency in Multimodal Documents

Matthias Springstein; Eric Müller-Budack; Ralph Ewerth

doi:10.48550/arXiv.2104.13748

Details

Original language	English
Title of host publication	SIGIR 2021
Subtitle of host publication	Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
Pages	2575-2579
Number of pages	5
ISBN (electronic)	9781450380379
Publication status	Published - 11 Jul 2021
Event	44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 - Virtual, Online, Canada Duration: 11 Jul 2021 → 15 Jul 2021

Publication series

Name	SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

Abstract

The World Wide Web and social media platforms have become popular sources for news and information. Typically, multimodal information, e.g., image and text is used to convey information more effectively and to attract attention. While in most cases image content is decorative or depicts additional information, it has also been leveraged to spread misinformation and rumors in recent years. In this paper, we present a web-based demo application that automatically quantifies the cross-modal relations of entities∼(persons, locations, and events) in image and text. The applications are manifold. For example, the system can help users to explore multimodal articles more efficiently, or can assist human assessors and fact-checking efforts in the verification of the credibility of news stories, tweets, or other multimodal documents.

Keywords

cross-modal consistency, deep learning, image-text-relations, multimodal documents

ASJC Scopus subject areas

Computer Science(all)
Software
Computer Science(all)
Computer Graphics and Computer-Aided Design
Computer Science(all)
Information Systems

Cite this

QuTI! Quantifying Text-Image Consistency in Multimodal Documents. / Springstein, Matthias; Müller-Budack, Eric; Ewerth, Ralph.
SIGIR 2021: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021. p. 2575-2579 3462796 (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Springstein, M, Müller-Budack, E & Ewerth, R 2021, QuTI! Quantifying Text-Image Consistency in Multimodal Documents. in SIGIR 2021: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval., 3462796, SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2575-2579, 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021, Virtual, Online, Canada, 11 Jul 2021. https://doi.org/10.48550/arXiv.2104.13748, https://doi.org/10.1145/3404835.3462796

Springstein, M., Müller-Budack, E., & Ewerth, R. (2021). QuTI! Quantifying Text-Image Consistency in Multimodal Documents. In SIGIR 2021: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2575-2579). Article 3462796 (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval). https://doi.org/10.48550/arXiv.2104.13748, https://doi.org/10.1145/3404835.3462796

Springstein M, Müller-Budack E, Ewerth R. QuTI! Quantifying Text-Image Consistency in Multimodal Documents. In SIGIR 2021: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021. p. 2575-2579. 3462796. (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval). doi: 10.48550/arXiv.2104.13748, 10.1145/3404835.3462796

Springstein, Matthias ; Müller-Budack, Eric ; Ewerth, Ralph. / QuTI! Quantifying Text-Image Consistency in Multimodal Documents. SIGIR 2021: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021. pp. 2575-2579 (SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval).

Download

@inproceedings{11c98ff5578545d3b2659179640ab0c5,

title = "QuTI! Quantifying Text-Image Consistency in Multimodal Documents",

abstract = "The World Wide Web and social media platforms have become popular sources for news and information. Typically, multimodal information, e.g., image and text is used to convey information more effectively and to attract attention. While in most cases image content is decorative or depicts additional information, it has also been leveraged to spread misinformation and rumors in recent years. In this paper, we present a web-based demo application that automatically quantifies the cross-modal relations of entities∼(persons, locations, and events) in image and text. The applications are manifold. For example, the system can help users to explore multimodal articles more efficiently, or can assist human assessors and fact-checking efforts in the verification of the credibility of news stories, tweets, or other multimodal documents.",

keywords = "cross-modal consistency, deep learning, image-text-relations, multimodal documents",

author = "Matthias Springstein and Eric M{\"u}ller-Budack and Ralph Ewerth",

year = "2021",

month = jul,

day = "11",

doi = "10.48550/arXiv.2104.13748",

language = "English",

series = "SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval",

pages = "2575--2579",

booktitle = "SIGIR 2021",

note = "44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021 ; Conference date: 11-07-2021 Through 15-07-2021",

}

Download

TY - GEN

T1 - QuTI! Quantifying Text-Image Consistency in Multimodal Documents

AU - Springstein, Matthias

AU - Müller-Budack, Eric

AU - Ewerth, Ralph

PY - 2021/7/11

Y1 - 2021/7/11

N2 - The World Wide Web and social media platforms have become popular sources for news and information. Typically, multimodal information, e.g., image and text is used to convey information more effectively and to attract attention. While in most cases image content is decorative or depicts additional information, it has also been leveraged to spread misinformation and rumors in recent years. In this paper, we present a web-based demo application that automatically quantifies the cross-modal relations of entities∼(persons, locations, and events) in image and text. The applications are manifold. For example, the system can help users to explore multimodal articles more efficiently, or can assist human assessors and fact-checking efforts in the verification of the credibility of news stories, tweets, or other multimodal documents.

AB - The World Wide Web and social media platforms have become popular sources for news and information. Typically, multimodal information, e.g., image and text is used to convey information more effectively and to attract attention. While in most cases image content is decorative or depicts additional information, it has also been leveraged to spread misinformation and rumors in recent years. In this paper, we present a web-based demo application that automatically quantifies the cross-modal relations of entities∼(persons, locations, and events) in image and text. The applications are manifold. For example, the system can help users to explore multimodal articles more efficiently, or can assist human assessors and fact-checking efforts in the verification of the credibility of news stories, tweets, or other multimodal documents.

KW - cross-modal consistency

KW - deep learning

KW - image-text-relations

KW - multimodal documents

UR - http://www.scopus.com/inward/record.url?scp=85111632068&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2104.13748

DO - 10.48550/arXiv.2104.13748

M3 - Conference contribution

AN - SCOPUS:85111632068

T3 - SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

SP - 2575

EP - 2579

BT - SIGIR 2021

T2 - 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2021

Y2 - 11 July 2021 through 15 July 2021

ER -

Research@Leibniz University

QuTI! Quantifying Text-Image Consistency in Multimodal Documents

Authors

Research Organisations

External Research Organisations