MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationFrom Born-Physical to Born-Virtual
Subtitle of host publicationAugmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings
EditorsYuen-Hsien Tseng, Marie Katsurai, Hoa N. Nguyen
PublisherSpringer Science and Business Media Deutschland GmbH
Pages290-300
Number of pages11
ISBN (print)9783031217555
Publication statusPublished - 7 Dec 2022
Event24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022 - Hanoi, Viet Nam
Duration: 30 Nov 20222 Dec 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13636 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Information extraction from scholarly articles is a challenging task due to the sizable document length and implicit information hidden in text, figures, and citations. Scholarly information extraction has various applications in exploration, archival, and curation services for digital libraries and knowledge management systems. We present MORTY, an information extraction technique that creates structured summaries of text from scholarly articles. Our approach condenses the article’s full-text to property-value pairs as a segmented text snippet called structured summary. We also present a sizable scholarly dataset combining structured summaries retrieved from a scholarly knowledge graph and corresponding publicly available scientific articles, which we openly publish as a resource for the research community. Our results show that structured summarization is a suitable approach for targeted information extraction that complements other commonly used methods such as question answering and named entity recognition.

Keywords

    Information extraction, Literature review completion, Natural language processing, Scholarly knowledge, Summarization

ASJC Scopus subject areas

Cite this

MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles. / Jaradeh, Mohamad Yaser; Stocker, Markus; Auer, Sören.
From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. ed. / Yuen-Hsien Tseng; Marie Katsurai; Hoa N. Nguyen. Springer Science and Business Media Deutschland GmbH, 2022. p. 290-300 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13636 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Jaradeh, MY, Stocker, M & Auer, S 2022, MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles. in Y-H Tseng, M Katsurai & HN Nguyen (eds), From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13636 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 290-300, 24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022, Hanoi, Viet Nam, 30 Nov 2022. https://doi.org/10.48550/arXiv.2212.05429, https://doi.org/10.1007/978-3-031-21756-2_23
Jaradeh, M. Y., Stocker, M., & Auer, S. (2022). MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles. In Y.-H. Tseng, M. Katsurai, & H. N. Nguyen (Eds.), From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings (pp. 290-300). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13636 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.48550/arXiv.2212.05429, https://doi.org/10.1007/978-3-031-21756-2_23
Jaradeh MY, Stocker M, Auer S. MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles. In Tseng YH, Katsurai M, Nguyen HN, editors, From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. Springer Science and Business Media Deutschland GmbH. 2022. p. 290-300. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.48550/arXiv.2212.05429, 10.1007/978-3-031-21756-2_23
Jaradeh, Mohamad Yaser ; Stocker, Markus ; Auer, Sören. / MORTY : Structured Summarization for Targeted Information Extraction from Scholarly Articles. From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries - 24th International Conference on Asian Digital Libraries, ICADL 2022, Proceedings. editor / Yuen-Hsien Tseng ; Marie Katsurai ; Hoa N. Nguyen. Springer Science and Business Media Deutschland GmbH, 2022. pp. 290-300 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{e2f745b3abec4900a94f894397101a5d,
title = "MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles",
abstract = "Information extraction from scholarly articles is a challenging task due to the sizable document length and implicit information hidden in text, figures, and citations. Scholarly information extraction has various applications in exploration, archival, and curation services for digital libraries and knowledge management systems. We present MORTY, an information extraction technique that creates structured summaries of text from scholarly articles. Our approach condenses the article{\textquoteright}s full-text to property-value pairs as a segmented text snippet called structured summary. We also present a sizable scholarly dataset combining structured summaries retrieved from a scholarly knowledge graph and corresponding publicly available scientific articles, which we openly publish as a resource for the research community. Our results show that structured summarization is a suitable approach for targeted information extraction that complements other commonly used methods such as question answering and named entity recognition.",
keywords = "Information extraction, Literature review completion, Natural language processing, Scholarly knowledge, Summarization",
author = "Jaradeh, {Mohamad Yaser} and Markus Stocker and S{\"o}ren Auer",
year = "2022",
month = dec,
day = "7",
doi = "10.48550/arXiv.2212.05429",
language = "English",
isbn = "9783031217555",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "290--300",
editor = "Yuen-Hsien Tseng and Marie Katsurai and Nguyen, {Hoa N.}",
booktitle = "From Born-Physical to Born-Virtual",
address = "Germany",
note = "24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022 ; Conference date: 30-11-2022 Through 02-12-2022",

}

Download

TY - GEN

T1 - MORTY

T2 - 24th International Conference on Asia-Pacific Digital Libraries, ICADL 2022

AU - Jaradeh, Mohamad Yaser

AU - Stocker, Markus

AU - Auer, Sören

PY - 2022/12/7

Y1 - 2022/12/7

N2 - Information extraction from scholarly articles is a challenging task due to the sizable document length and implicit information hidden in text, figures, and citations. Scholarly information extraction has various applications in exploration, archival, and curation services for digital libraries and knowledge management systems. We present MORTY, an information extraction technique that creates structured summaries of text from scholarly articles. Our approach condenses the article’s full-text to property-value pairs as a segmented text snippet called structured summary. We also present a sizable scholarly dataset combining structured summaries retrieved from a scholarly knowledge graph and corresponding publicly available scientific articles, which we openly publish as a resource for the research community. Our results show that structured summarization is a suitable approach for targeted information extraction that complements other commonly used methods such as question answering and named entity recognition.

AB - Information extraction from scholarly articles is a challenging task due to the sizable document length and implicit information hidden in text, figures, and citations. Scholarly information extraction has various applications in exploration, archival, and curation services for digital libraries and knowledge management systems. We present MORTY, an information extraction technique that creates structured summaries of text from scholarly articles. Our approach condenses the article’s full-text to property-value pairs as a segmented text snippet called structured summary. We also present a sizable scholarly dataset combining structured summaries retrieved from a scholarly knowledge graph and corresponding publicly available scientific articles, which we openly publish as a resource for the research community. Our results show that structured summarization is a suitable approach for targeted information extraction that complements other commonly used methods such as question answering and named entity recognition.

KW - Information extraction

KW - Literature review completion

KW - Natural language processing

KW - Scholarly knowledge

KW - Summarization

UR - http://www.scopus.com/inward/record.url?scp=85145008843&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2212.05429

DO - 10.48550/arXiv.2212.05429

M3 - Conference contribution

AN - SCOPUS:85145008843

SN - 9783031217555

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 290

EP - 300

BT - From Born-Physical to Born-Virtual

A2 - Tseng, Yuen-Hsien

A2 - Katsurai, Marie

A2 - Nguyen, Hoa N.

PB - Springer Science and Business Media Deutschland GmbH

Y2 - 30 November 2022 through 2 December 2022

ER -

By the same author(s)