Text mining in career studies: Generating insights from unstructured textual data1

Research output: Chapter in book/report/conference proceedingContribution to book/anthologyResearchpeer review

Authors

  • Vladimer B. Kobayashi
  • Stefan T. Mol
  • Jarno Vrolijk
  • Gábor Kismihók

External Research Organisations

  • University of Amsterdam
  • University of the Philippines
  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationHandbook of Research Methods in Careers
PublisherEdward Elgar Publishing Ltd.
Pages139-163
Number of pages25
ISBN (electronic)9781788976725
ISBN (print)9781788976718
Publication statusPublished - 8 Jun 2021
Externally publishedYes

Abstract

The abundance of unstructured data, primarily consisting of text, coupled with the development of modern techniques to analyze them, offer new opportunities for theory generation, theory testing, and contextualization within the field of career studies. This chapter explains the process of text mining as it may be applied to the study of careers, starting with text preprocessing and ending with the application of analytical techniques. Both conventional techniques such as latent semantic and principal components analysis, and more novel text analytical techniques such as word embeddings are discussed with the hope of inspiring careers researchers to further embrace text analysis as a means to answer novel research questions. Furthermore, this chapter elaborates on how to evaluate the validity of information extracted from text. In order to illustrate the text analytical process, we provide an example that leverages an abundant and growing source of data that may be of interest to career researchers, namely, vacancy data. In the example, vacancy data are used to identify those knowledge, skills and abilities that (co)determine the salary associated with particular occupations. We end the chapter by providing a guide on how text mining can be applied to study existing and new career concepts.

ASJC Scopus subject areas

Cite this

Text mining in career studies: Generating insights from unstructured textual data1. / Kobayashi, Vladimer B.; Mol, Stefan T.; Vrolijk, Jarno et al.
Handbook of Research Methods in Careers. Edward Elgar Publishing Ltd., 2021. p. 139-163.

Research output: Chapter in book/report/conference proceedingContribution to book/anthologyResearchpeer review

Kobayashi, VB, Mol, ST, Vrolijk, J & Kismihók, G 2021, Text mining in career studies: Generating insights from unstructured textual data1. in Handbook of Research Methods in Careers. Edward Elgar Publishing Ltd., pp. 139-163. https://doi.org/10.4337/9781788976725.00015
Kobayashi, V. B., Mol, S. T., Vrolijk, J., & Kismihók, G. (2021). Text mining in career studies: Generating insights from unstructured textual data1. In Handbook of Research Methods in Careers (pp. 139-163). Edward Elgar Publishing Ltd.. https://doi.org/10.4337/9781788976725.00015
Kobayashi VB, Mol ST, Vrolijk J, Kismihók G. Text mining in career studies: Generating insights from unstructured textual data1. In Handbook of Research Methods in Careers. Edward Elgar Publishing Ltd. 2021. p. 139-163 Epub 2021 Jan 1. doi: 10.4337/9781788976725.00015
Kobayashi, Vladimer B. ; Mol, Stefan T. ; Vrolijk, Jarno et al. / Text mining in career studies : Generating insights from unstructured textual data1. Handbook of Research Methods in Careers. Edward Elgar Publishing Ltd., 2021. pp. 139-163
Download
@inbook{22f3f5be3f884fc9a7cdc3844217e1ae,
title = "Text mining in career studies: Generating insights from unstructured textual data1",
abstract = "The abundance of unstructured data, primarily consisting of text, coupled with the development of modern techniques to analyze them, offer new opportunities for theory generation, theory testing, and contextualization within the field of career studies. This chapter explains the process of text mining as it may be applied to the study of careers, starting with text preprocessing and ending with the application of analytical techniques. Both conventional techniques such as latent semantic and principal components analysis, and more novel text analytical techniques such as word embeddings are discussed with the hope of inspiring careers researchers to further embrace text analysis as a means to answer novel research questions. Furthermore, this chapter elaborates on how to evaluate the validity of information extracted from text. In order to illustrate the text analytical process, we provide an example that leverages an abundant and growing source of data that may be of interest to career researchers, namely, vacancy data. In the example, vacancy data are used to identify those knowledge, skills and abilities that (co)determine the salary associated with particular occupations. We end the chapter by providing a guide on how text mining can be applied to study existing and new career concepts.",
author = "Kobayashi, {Vladimer B.} and Mol, {Stefan T.} and Jarno Vrolijk and G{\'a}bor Kismih{\'o}k",
year = "2021",
month = jun,
day = "8",
doi = "10.4337/9781788976725.00015",
language = "English",
isbn = "9781788976718",
pages = "139--163",
booktitle = "Handbook of Research Methods in Careers",
publisher = "Edward Elgar Publishing Ltd.",
address = "United Kingdom (UK)",

}

Download

TY - CHAP

T1 - Text mining in career studies

T2 - Generating insights from unstructured textual data1

AU - Kobayashi, Vladimer B.

AU - Mol, Stefan T.

AU - Vrolijk, Jarno

AU - Kismihók, Gábor

PY - 2021/6/8

Y1 - 2021/6/8

N2 - The abundance of unstructured data, primarily consisting of text, coupled with the development of modern techniques to analyze them, offer new opportunities for theory generation, theory testing, and contextualization within the field of career studies. This chapter explains the process of text mining as it may be applied to the study of careers, starting with text preprocessing and ending with the application of analytical techniques. Both conventional techniques such as latent semantic and principal components analysis, and more novel text analytical techniques such as word embeddings are discussed with the hope of inspiring careers researchers to further embrace text analysis as a means to answer novel research questions. Furthermore, this chapter elaborates on how to evaluate the validity of information extracted from text. In order to illustrate the text analytical process, we provide an example that leverages an abundant and growing source of data that may be of interest to career researchers, namely, vacancy data. In the example, vacancy data are used to identify those knowledge, skills and abilities that (co)determine the salary associated with particular occupations. We end the chapter by providing a guide on how text mining can be applied to study existing and new career concepts.

AB - The abundance of unstructured data, primarily consisting of text, coupled with the development of modern techniques to analyze them, offer new opportunities for theory generation, theory testing, and contextualization within the field of career studies. This chapter explains the process of text mining as it may be applied to the study of careers, starting with text preprocessing and ending with the application of analytical techniques. Both conventional techniques such as latent semantic and principal components analysis, and more novel text analytical techniques such as word embeddings are discussed with the hope of inspiring careers researchers to further embrace text analysis as a means to answer novel research questions. Furthermore, this chapter elaborates on how to evaluate the validity of information extracted from text. In order to illustrate the text analytical process, we provide an example that leverages an abundant and growing source of data that may be of interest to career researchers, namely, vacancy data. In the example, vacancy data are used to identify those knowledge, skills and abilities that (co)determine the salary associated with particular occupations. We end the chapter by providing a guide on how text mining can be applied to study existing and new career concepts.

UR - http://www.scopus.com/inward/record.url?scp=85129419305&partnerID=8YFLogxK

U2 - 10.4337/9781788976725.00015

DO - 10.4337/9781788976725.00015

M3 - Contribution to book/anthology

AN - SCOPUS:85129419305

SN - 9781788976718

SP - 139

EP - 163

BT - Handbook of Research Methods in Careers

PB - Edward Elgar Publishing Ltd.

ER -