FCA2VEC: Embedding Techniques for Formal Concept Analysis

Research output: Chapter in book/report/conference proceedingContribution to book/anthologyResearchpeer review

Authors

  • Dominik Dürrschnabel
  • Tom Hanika
  • Maximilian Stubbemann

Research Organisations

External Research Organisations

  • University of Kassel
View graph of relations

Details

Original languageEnglish
Title of host publicationComplex Data Analytics with Formal Concept Analysis
PublisherSpringer International Publishing AG
Pages47-74
Number of pages28
ISBN (electronic)9783030932787
ISBN (print)9783030932770
Publication statusPublished - 8 Dec 2021

Abstract

Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding ‘latent semantic analysis’ recent approaches like ‘word2vec’ or ‘node2vec’ are well established tools in this realm. In the present paper we add to this line of research by introducing ‘fca2vec’, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computationally feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.

Keywords

    Closed sets, Complex data, Covering relation, Formal concept analysis, Link prediction, Low dimensional embedding, Vector space embedding, Word2Vec

ASJC Scopus subject areas

Cite this

FCA2VEC: Embedding Techniques for Formal Concept Analysis. / Dürrschnabel, Dominik; Hanika, Tom; Stubbemann, Maximilian.
Complex Data Analytics with Formal Concept Analysis. Springer International Publishing AG, 2021. p. 47-74.

Research output: Chapter in book/report/conference proceedingContribution to book/anthologyResearchpeer review

Dürrschnabel, D, Hanika, T & Stubbemann, M 2021, FCA2VEC: Embedding Techniques for Formal Concept Analysis. in Complex Data Analytics with Formal Concept Analysis. Springer International Publishing AG, pp. 47-74. https://doi.org/10.48550/arXiv.1911.11496, https://doi.org/10.1007/978-3-030-93278-7_3
Dürrschnabel, D., Hanika, T., & Stubbemann, M. (2021). FCA2VEC: Embedding Techniques for Formal Concept Analysis. In Complex Data Analytics with Formal Concept Analysis (pp. 47-74). Springer International Publishing AG. https://doi.org/10.48550/arXiv.1911.11496, https://doi.org/10.1007/978-3-030-93278-7_3
Dürrschnabel D, Hanika T, Stubbemann M. FCA2VEC: Embedding Techniques for Formal Concept Analysis. In Complex Data Analytics with Formal Concept Analysis. Springer International Publishing AG. 2021. p. 47-74 doi: 10.48550/arXiv.1911.11496, 10.1007/978-3-030-93278-7_3
Dürrschnabel, Dominik ; Hanika, Tom ; Stubbemann, Maximilian. / FCA2VEC : Embedding Techniques for Formal Concept Analysis. Complex Data Analytics with Formal Concept Analysis. Springer International Publishing AG, 2021. pp. 47-74
Download
@inbook{b425f537a91048b0a500e5ad481cd3a5,
title = "FCA2VEC: Embedding Techniques for Formal Concept Analysis",
abstract = "Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding {\textquoteleft}latent semantic analysis{\textquoteright} recent approaches like {\textquoteleft}word2vec{\textquoteright} or {\textquoteleft}node2vec{\textquoteright} are well established tools in this realm. In the present paper we add to this line of research by introducing {\textquoteleft}fca2vec{\textquoteright}, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computationally feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.",
keywords = "Closed sets, Complex data, Covering relation, Formal concept analysis, Link prediction, Low dimensional embedding, Vector space embedding, Word2Vec",
author = "Dominik D{\"u}rrschnabel and Tom Hanika and Maximilian Stubbemann",
note = "This work is partially funded by the German Federal Ministry of Education and Research (BMBF) in its program “Quantitative Wissenschaftsforschung” as part of the REGIO project under grant 01PU17012, and in its program “Forschung zu den Karrierebedingungen und Karriereentwicklungen des Wissenschaftlichen Nachwuchses (FoWiN)” under grant 16FWN016.",
year = "2021",
month = dec,
day = "8",
doi = "10.48550/arXiv.1911.11496",
language = "English",
isbn = "9783030932770",
pages = "47--74",
booktitle = "Complex Data Analytics with Formal Concept Analysis",
publisher = "Springer International Publishing AG",
address = "Switzerland",

}

Download

TY - CHAP

T1 - FCA2VEC

T2 - Embedding Techniques for Formal Concept Analysis

AU - Dürrschnabel, Dominik

AU - Hanika, Tom

AU - Stubbemann, Maximilian

N1 - This work is partially funded by the German Federal Ministry of Education and Research (BMBF) in its program “Quantitative Wissenschaftsforschung” as part of the REGIO project under grant 01PU17012, and in its program “Forschung zu den Karrierebedingungen und Karriereentwicklungen des Wissenschaftlichen Nachwuchses (FoWiN)” under grant 16FWN016.

PY - 2021/12/8

Y1 - 2021/12/8

N2 - Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding ‘latent semantic analysis’ recent approaches like ‘word2vec’ or ‘node2vec’ are well established tools in this realm. In the present paper we add to this line of research by introducing ‘fca2vec’, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computationally feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.

AB - Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding ‘latent semantic analysis’ recent approaches like ‘word2vec’ or ‘node2vec’ are well established tools in this realm. In the present paper we add to this line of research by introducing ‘fca2vec’, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computationally feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.

KW - Closed sets

KW - Complex data

KW - Covering relation

KW - Formal concept analysis

KW - Link prediction

KW - Low dimensional embedding

KW - Vector space embedding

KW - Word2Vec

UR - http://www.scopus.com/inward/record.url?scp=85160476406&partnerID=8YFLogxK

U2 - 10.48550/arXiv.1911.11496

DO - 10.48550/arXiv.1911.11496

M3 - Contribution to book/anthology

AN - SCOPUS:85160476406

SN - 9783030932770

SP - 47

EP - 74

BT - Complex Data Analytics with Formal Concept Analysis

PB - Springer International Publishing AG

ER -