Semantic Annotation, Representation and Linking of Survey Data

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Felix Bensmann
  • Andrea Papenmeier
  • Dagmar Kern
  • Benjamin Zapilko
  • Stefan Dietze

Research Organisations

External Research Organisations

  • GESIS - Leibniz Institute for the Social Sciences
  • University Hospital Düsseldorf
View graph of relations

Details

Original languageEnglish
Title of host publicationSemantic Systems. In the Era of Knowledge Graphs
Subtitle of host publication16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings
EditorsEva Blomqvist, Paul Groth, Victor de Boer, Tassilo Pellegrini, Mehwish Alam, Tobias Käfer, Peter Kieseberg, Sabrina Kirrane, Albert Meroño-Peñuela, Harshvardhan J. Pandit
PublisherSpringer Science and Business Media Deutschland GmbH
Pages53-69
Number of pages17
ISBN (electronic)978-3-030-59833-4
ISBN (print)9783030598327
Publication statusPublished - 2020
Event16th International Conference on Semantic Systems, SEMANTiCS 2020 - Amsterdam, Netherlands
Duration: 7 Sept 202010 Sept 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12378 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.

Keywords

    Natural language processing, Question feature extraction, Semantic data modelling, Social sciences survey data

ASJC Scopus subject areas

Cite this

Semantic Annotation, Representation and Linking of Survey Data. / Bensmann, Felix; Papenmeier, Andrea; Kern, Dagmar et al.
Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. ed. / Eva Blomqvist; Paul Groth; Victor de Boer; Tassilo Pellegrini; Mehwish Alam; Tobias Käfer; Peter Kieseberg; Sabrina Kirrane; Albert Meroño-Peñuela; Harshvardhan J. Pandit. Springer Science and Business Media Deutschland GmbH, 2020. p. 53-69 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12378 LNCS).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Bensmann, F, Papenmeier, A, Kern, D, Zapilko, B & Dietze, S 2020, Semantic Annotation, Representation and Linking of Survey Data. in E Blomqvist, P Groth, V de Boer, T Pellegrini, M Alam, T Käfer, P Kieseberg, S Kirrane, A Meroño-Peñuela & HJ Pandit (eds), Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12378 LNCS, Springer Science and Business Media Deutschland GmbH, pp. 53-69, 16th International Conference on Semantic Systems, SEMANTiCS 2020, Amsterdam, Netherlands, 7 Sept 2020. https://doi.org/10.1007/978-3-030-59833-4_4
Bensmann, F., Papenmeier, A., Kern, D., Zapilko, B., & Dietze, S. (2020). Semantic Annotation, Representation and Linking of Survey Data. In E. Blomqvist, P. Groth, V. de Boer, T. Pellegrini, M. Alam, T. Käfer, P. Kieseberg, S. Kirrane, A. Meroño-Peñuela, & H. J. Pandit (Eds.), Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings (pp. 53-69). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12378 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-59833-4_4
Bensmann F, Papenmeier A, Kern D, Zapilko B, Dietze S. Semantic Annotation, Representation and Linking of Survey Data. In Blomqvist E, Groth P, de Boer V, Pellegrini T, Alam M, Käfer T, Kieseberg P, Kirrane S, Meroño-Peñuela A, Pandit HJ, editors, Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. p. 53-69. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2020 Oct 27. doi: 10.1007/978-3-030-59833-4_4
Bensmann, Felix ; Papenmeier, Andrea ; Kern, Dagmar et al. / Semantic Annotation, Representation and Linking of Survey Data. Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. editor / Eva Blomqvist ; Paul Groth ; Victor de Boer ; Tassilo Pellegrini ; Mehwish Alam ; Tobias Käfer ; Peter Kieseberg ; Sabrina Kirrane ; Albert Meroño-Peñuela ; Harshvardhan J. Pandit. Springer Science and Business Media Deutschland GmbH, 2020. pp. 53-69 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{7374cc1a2590460eb24ee4527805c190,
title = "Semantic Annotation, Representation and Linking of Survey Data",
abstract = "Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.",
keywords = "Natural language processing, Question feature extraction, Semantic data modelling, Social sciences survey data",
author = "Felix Bensmann and Andrea Papenmeier and Dagmar Kern and Benjamin Zapilko and Stefan Dietze",
note = "Funding Information: Funding. This work was partly funded by the DFG, grant no. 388815326; the VACOS project at GESIS.; 16th International Conference on Semantic Systems, SEMANTiCS 2020 ; Conference date: 07-09-2020 Through 10-09-2020",
year = "2020",
doi = "10.1007/978-3-030-59833-4_4",
language = "English",
isbn = "9783030598327",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "53--69",
editor = "Eva Blomqvist and Paul Groth and {de Boer}, Victor and Tassilo Pellegrini and Mehwish Alam and Tobias K{\"a}fer and Peter Kieseberg and Sabrina Kirrane and Albert Mero{\~n}o-Pe{\~n}uela and Pandit, {Harshvardhan J.}",
booktitle = "Semantic Systems. In the Era of Knowledge Graphs",
address = "Germany",

}

Download

TY - GEN

T1 - Semantic Annotation, Representation and Linking of Survey Data

AU - Bensmann, Felix

AU - Papenmeier, Andrea

AU - Kern, Dagmar

AU - Zapilko, Benjamin

AU - Dietze, Stefan

N1 - Funding Information: Funding. This work was partly funded by the DFG, grant no. 388815326; the VACOS project at GESIS.

PY - 2020

Y1 - 2020

N2 - Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.

AB - Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.

KW - Natural language processing

KW - Question feature extraction

KW - Semantic data modelling

KW - Social sciences survey data

UR - http://www.scopus.com/inward/record.url?scp=85096607361&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-59833-4_4

DO - 10.1007/978-3-030-59833-4_4

M3 - Conference contribution

AN - SCOPUS:85096607361

SN - 9783030598327

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 53

EP - 69

BT - Semantic Systems. In the Era of Knowledge Graphs

A2 - Blomqvist, Eva

A2 - Groth, Paul

A2 - de Boer, Victor

A2 - Pellegrini, Tassilo

A2 - Alam, Mehwish

A2 - Käfer, Tobias

A2 - Kieseberg, Peter

A2 - Kirrane, Sabrina

A2 - Meroño-Peñuela, Albert

A2 - Pandit, Harshvardhan J.

PB - Springer Science and Business Media Deutschland GmbH

T2 - 16th International Conference on Semantic Systems, SEMANTiCS 2020

Y2 - 7 September 2020 through 10 September 2020

ER -