Semantic Annotation, Representation and Linking of Survey Data

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Felix Bensmann
  • Andrea Papenmeier
  • Dagmar Kern
  • Benjamin Zapilko
  • Stefan Dietze

Organisationseinheiten

Externe Organisationen

  • GESIS - Leibniz-Institut für Sozialwissenschaften
  • Universitätsklinikum Düsseldorf
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksSemantic Systems. In the Era of Knowledge Graphs
Untertitel16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings
Herausgeber/-innenEva Blomqvist, Paul Groth, Victor de Boer, Tassilo Pellegrini, Mehwish Alam, Tobias Käfer, Peter Kieseberg, Sabrina Kirrane, Albert Meroño-Peñuela, Harshvardhan J. Pandit
Herausgeber (Verlag)Springer Science and Business Media Deutschland GmbH
Seiten53-69
Seitenumfang17
ISBN (elektronisch)978-3-030-59833-4
ISBN (Print)9783030598327
PublikationsstatusVeröffentlicht - 2020
Veranstaltung16th International Conference on Semantic Systems, SEMANTiCS 2020 - Amsterdam, Niederlande
Dauer: 7 Sept. 202010 Sept. 2020

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band12378 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Abstract

Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.

ASJC Scopus Sachgebiete

Zitieren

Semantic Annotation, Representation and Linking of Survey Data. / Bensmann, Felix; Papenmeier, Andrea; Kern, Dagmar et al.
Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. Hrsg. / Eva Blomqvist; Paul Groth; Victor de Boer; Tassilo Pellegrini; Mehwish Alam; Tobias Käfer; Peter Kieseberg; Sabrina Kirrane; Albert Meroño-Peñuela; Harshvardhan J. Pandit. Springer Science and Business Media Deutschland GmbH, 2020. S. 53-69 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 12378 LNCS).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Bensmann, F, Papenmeier, A, Kern, D, Zapilko, B & Dietze, S 2020, Semantic Annotation, Representation and Linking of Survey Data. in E Blomqvist, P Groth, V de Boer, T Pellegrini, M Alam, T Käfer, P Kieseberg, S Kirrane, A Meroño-Peñuela & HJ Pandit (Hrsg.), Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 12378 LNCS, Springer Science and Business Media Deutschland GmbH, S. 53-69, 16th International Conference on Semantic Systems, SEMANTiCS 2020, Amsterdam, Niederlande, 7 Sept. 2020. https://doi.org/10.1007/978-3-030-59833-4_4
Bensmann, F., Papenmeier, A., Kern, D., Zapilko, B., & Dietze, S. (2020). Semantic Annotation, Representation and Linking of Survey Data. In E. Blomqvist, P. Groth, V. de Boer, T. Pellegrini, M. Alam, T. Käfer, P. Kieseberg, S. Kirrane, A. Meroño-Peñuela, & H. J. Pandit (Hrsg.), Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings (S. 53-69). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 12378 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-59833-4_4
Bensmann F, Papenmeier A, Kern D, Zapilko B, Dietze S. Semantic Annotation, Representation and Linking of Survey Data. in Blomqvist E, Groth P, de Boer V, Pellegrini T, Alam M, Käfer T, Kieseberg P, Kirrane S, Meroño-Peñuela A, Pandit HJ, Hrsg., Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. S. 53-69. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2020 Okt 27. doi: 10.1007/978-3-030-59833-4_4
Bensmann, Felix ; Papenmeier, Andrea ; Kern, Dagmar et al. / Semantic Annotation, Representation and Linking of Survey Data. Semantic Systems. In the Era of Knowledge Graphs: 16th International Conference on Semantic Systems, SEMANTiCS 2020, Proceedings. Hrsg. / Eva Blomqvist ; Paul Groth ; Victor de Boer ; Tassilo Pellegrini ; Mehwish Alam ; Tobias Käfer ; Peter Kieseberg ; Sabrina Kirrane ; Albert Meroño-Peñuela ; Harshvardhan J. Pandit. Springer Science and Business Media Deutschland GmbH, 2020. S. 53-69 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{7374cc1a2590460eb24ee4527805c190,
title = "Semantic Annotation, Representation and Linking of Survey Data",
abstract = "Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.",
keywords = "Natural language processing, Question feature extraction, Semantic data modelling, Social sciences survey data",
author = "Felix Bensmann and Andrea Papenmeier and Dagmar Kern and Benjamin Zapilko and Stefan Dietze",
note = "Funding Information: Funding. This work was partly funded by the DFG, grant no. 388815326; the VACOS project at GESIS.; 16th International Conference on Semantic Systems, SEMANTiCS 2020 ; Conference date: 07-09-2020 Through 10-09-2020",
year = "2020",
doi = "10.1007/978-3-030-59833-4_4",
language = "English",
isbn = "9783030598327",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Science and Business Media Deutschland GmbH",
pages = "53--69",
editor = "Eva Blomqvist and Paul Groth and {de Boer}, Victor and Tassilo Pellegrini and Mehwish Alam and Tobias K{\"a}fer and Peter Kieseberg and Sabrina Kirrane and Albert Mero{\~n}o-Pe{\~n}uela and Pandit, {Harshvardhan J.}",
booktitle = "Semantic Systems. In the Era of Knowledge Graphs",
address = "Germany",

}

Download

TY - GEN

T1 - Semantic Annotation, Representation and Linking of Survey Data

AU - Bensmann, Felix

AU - Papenmeier, Andrea

AU - Kern, Dagmar

AU - Zapilko, Benjamin

AU - Dietze, Stefan

N1 - Funding Information: Funding. This work was partly funded by the DFG, grant no. 388815326; the VACOS project at GESIS.

PY - 2020

Y1 - 2020

N2 - Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.

AB - Semantic technologies offer significant potential for improving data search applications. Ongoing work thrives to equip data catalogs with new semantic search features to supplement existing keyword search and browsing capabilities. In particular within the social sciences, searching and reusing data is essential to foster efficient research. In this paper, we introduce an approach and experimental results aimed at improving interoperability and findability of social sciences survey items. Our contributions include a conceptual model for semantically representing survey items and questions, detailing meaningful dimensions of items, as well as experimental results geared towards the automated prediction of such item features using state-of-the-art machine learning models. Dimensions of interest include, for instance, references to geolocation and time periods or the scope and style of particular questions. We define classification tasks using neural and traditional machine learning models combined with sentence structure features. Applications of our work include semantic and faceted search for questions as part of our GESIS Search. We also provide the lifted data as a knowledge graph via a SPARQL endpoint for further reuse and sharing.

KW - Natural language processing

KW - Question feature extraction

KW - Semantic data modelling

KW - Social sciences survey data

UR - http://www.scopus.com/inward/record.url?scp=85096607361&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-59833-4_4

DO - 10.1007/978-3-030-59833-4_4

M3 - Conference contribution

AN - SCOPUS:85096607361

SN - 9783030598327

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 53

EP - 69

BT - Semantic Systems. In the Era of Knowledge Graphs

A2 - Blomqvist, Eva

A2 - Groth, Paul

A2 - de Boer, Victor

A2 - Pellegrini, Tassilo

A2 - Alam, Mehwish

A2 - Käfer, Tobias

A2 - Kieseberg, Peter

A2 - Kirrane, Sabrina

A2 - Meroño-Peñuela, Albert

A2 - Pandit, Harshvardhan J.

PB - Springer Science and Business Media Deutschland GmbH

T2 - 16th International Conference on Semantic Systems, SEMANTiCS 2020

Y2 - 7 September 2020 through 10 September 2020

ER -