SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Marco Anteghini
  • Jennifer D'Souza
  • Vitor A.P.Martins Dos Santos
  • Sören Auer

External Research Organisations

  • LifeGlimmer GmbH
  • Wageningen University and Research
  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationPosters and Demonstrations at EKAW 2020
Subtitle of host publicationProceedings of the EKAW 2020 Posters and Demonstrations Session co-located with 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020)
Pages22-30
Number of pages9
Publication statusPublished - 2020
Externally publishedYes
Event22nd International Conference on Knowledge Engineering and Knowledge Management - Posters and Demonstrations Session, EKAW-PD 2020 - Virtual, Bozen-Bolzano, Italy
Duration: 16 Sept 202018 Sept 2020

Publication series

NameCEUR Workshop Proceedings
PublisherCEUR Workshop Proceedings
Volume2751
ISSN (Print)1613-0073

Abstract

As a novel contribution to the problem of semantifying bio- logical assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequencybased baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method. The work in this paper aligns with the present cutting-edge trend of the scholarly knowledge digitalization impetus which aim to convert the long-standing document-based format of scholarly content into knowledge graphs (KG). To this end, our selected data domain of bioassays are a prime candidate for structuring into KGs.

Keywords

    Bioassays, Machine Learning, Open Science Graphs

ASJC Scopus subject areas

Cite this

SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph. / Anteghini, Marco; D'Souza, Jennifer; Dos Santos, Vitor A.P.Martins et al.
Posters and Demonstrations at EKAW 2020: Proceedings of the EKAW 2020 Posters and Demonstrations Session co-located with 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020) . 2020. p. 22-30 (CEUR Workshop Proceedings; Vol. 2751).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Anteghini, M, D'Souza, J, Dos Santos, VAPM & Auer, S 2020, SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph. in Posters and Demonstrations at EKAW 2020: Proceedings of the EKAW 2020 Posters and Demonstrations Session co-located with 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020) . CEUR Workshop Proceedings, vol. 2751, pp. 22-30, 22nd International Conference on Knowledge Engineering and Knowledge Management - Posters and Demonstrations Session, EKAW-PD 2020, Virtual, Bozen-Bolzano, Italy, 16 Sept 2020. https://doi.org/10.48550/arXiv.2009.08801
Anteghini, M., D'Souza, J., Dos Santos, V. A. P. M., & Auer, S. (2020). SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph. In Posters and Demonstrations at EKAW 2020: Proceedings of the EKAW 2020 Posters and Demonstrations Session co-located with 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020) (pp. 22-30). (CEUR Workshop Proceedings; Vol. 2751). https://doi.org/10.48550/arXiv.2009.08801
Anteghini M, D'Souza J, Dos Santos VAPM, Auer S. SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph. In Posters and Demonstrations at EKAW 2020: Proceedings of the EKAW 2020 Posters and Demonstrations Session co-located with 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020) . 2020. p. 22-30. (CEUR Workshop Proceedings). doi: 10.48550/arXiv.2009.08801
Anteghini, Marco ; D'Souza, Jennifer ; Dos Santos, Vitor A.P.Martins et al. / SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph. Posters and Demonstrations at EKAW 2020: Proceedings of the EKAW 2020 Posters and Demonstrations Session co-located with 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020) . 2020. pp. 22-30 (CEUR Workshop Proceedings).
Download
@inproceedings{f596a81fc73c40adb308bdc0d1c52afa,
title = "SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph",
abstract = "As a novel contribution to the problem of semantifying bio- logical assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequencybased baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method. The work in this paper aligns with the present cutting-edge trend of the scholarly knowledge digitalization impetus which aim to convert the long-standing document-based format of scholarly content into knowledge graphs (KG). To this end, our selected data domain of bioassays are a prime candidate for structuring into KGs.",
keywords = "Bioassays, Machine Learning, Open Science Graphs",
author = "Marco Anteghini and Jennifer D'Souza and {Dos Santos}, {Vitor A.P.Martins} and S{\"o}ren Auer",
year = "2020",
doi = "10.48550/arXiv.2009.08801",
language = "English",
series = "CEUR Workshop Proceedings",
publisher = "CEUR Workshop Proceedings",
pages = "22--30",
booktitle = "Posters and Demonstrations at EKAW 2020",
note = "22nd International Conference on Knowledge Engineering and Knowledge Management - Posters and Demonstrations Session, EKAW-PD 2020 ; Conference date: 16-09-2020 Through 18-09-2020",

}

Download

TY - GEN

T1 - SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph

AU - Anteghini, Marco

AU - D'Souza, Jennifer

AU - Dos Santos, Vitor A.P.Martins

AU - Auer, Sören

PY - 2020

Y1 - 2020

N2 - As a novel contribution to the problem of semantifying bio- logical assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequencybased baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method. The work in this paper aligns with the present cutting-edge trend of the scholarly knowledge digitalization impetus which aim to convert the long-standing document-based format of scholarly content into knowledge graphs (KG). To this end, our selected data domain of bioassays are a prime candidate for structuring into KGs.

AB - As a novel contribution to the problem of semantifying bio- logical assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequencybased baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method. The work in this paper aligns with the present cutting-edge trend of the scholarly knowledge digitalization impetus which aim to convert the long-standing document-based format of scholarly content into knowledge graphs (KG). To this end, our selected data domain of bioassays are a prime candidate for structuring into KGs.

KW - Bioassays

KW - Machine Learning

KW - Open Science Graphs

UR - http://www.scopus.com/inward/record.url?scp=85097291239&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2009.08801

DO - 10.48550/arXiv.2009.08801

M3 - Conference contribution

AN - SCOPUS:85097291239

T3 - CEUR Workshop Proceedings

SP - 22

EP - 30

BT - Posters and Demonstrations at EKAW 2020

T2 - 22nd International Conference on Knowledge Engineering and Knowledge Management - Posters and Demonstrations Session, EKAW-PD 2020

Y2 - 16 September 2020 through 18 September 2020

ER -

By the same author(s)