Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Ujwal Kumar Gadiraju
  • Gianluca Demartini
  • Ricardo Kawase
  • Stefan Dietze

Research Organisations

External Research Organisations

  • University of Queensland
  • eBay Inc.
View graph of relations

Details

Original languageEnglish
Pages (from-to)815-841
Number of pages27
JournalComputer Supported Cooperative Work: CSCW: An International Journal
Volume28
Issue number5
Early online date26 Jun 2018
Publication statusPublished - 1 Sept 2019

Abstract

The suitability of crowdsourcing to solve a variety of problems has been investigated widely. Yet, there is still a lack of understanding about the distinct behavior and performance of workers within microtasks. In this paper, we first introduce a fine-grained data-driven worker typology based on different dimensions and derived from behavioral traces of workers. Next, we propose and evaluate novel models of crowd worker behavior and show the benefits of behavior-based worker pre-selection using machine learning models. We also study the effect of task complexity on worker behavior. Finally, we evaluate our novel typology-based worker pre-selection method in image transcription and information finding tasks involving crowd workers completing 1,800 HITs. Our proposed method for worker pre-selection leads to a higher quality of results when compared to the standard practice of using qualification or pre-screening tests. For image transcription tasks our method resulted in an accuracy increase of nearly 7% over the baseline and of almost 10% in information finding tasks, without a significant difference in task completion time. Our findings have important implications for crowdsourcing systems where a worker’s behavioral type is unknown prior to participation in a task. We highlight the potential of leveraging worker types to identify and aid those workers who require further training to improve their performance. Having proposed a powerful automated mechanism to detect worker types, we reflect on promoting fairness, trust and transparency in microtask crowdsourcing platforms.

Keywords

    Behavioral traces, Crowdsourcing, Microtasks, Pre-screening, Pre-selection, Worker typology, Workers

ASJC Scopus subject areas

Cite this

Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection. / Gadiraju, Ujwal Kumar; Demartini, Gianluca; Kawase, Ricardo et al.
In: Computer Supported Cooperative Work: CSCW: An International Journal, Vol. 28, No. 5, 01.09.2019, p. 815-841.

Research output: Contribution to journalArticleResearchpeer review

Gadiraju UK, Demartini G, Kawase R, Dietze S. Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection. Computer Supported Cooperative Work: CSCW: An International Journal. 2019 Sept 1;28(5):815-841. Epub 2018 Jun 26. doi: 10.1007/s10606-018-9336-y
Gadiraju, Ujwal Kumar ; Demartini, Gianluca ; Kawase, Ricardo et al. / Crowd Anatomy Beyond the Good and Bad : Behavioral Traces for Crowd Worker Modeling and Pre-selection. In: Computer Supported Cooperative Work: CSCW: An International Journal. 2019 ; Vol. 28, No. 5. pp. 815-841.
Download
@article{e6355bdf4ea84aa284ce8f8718b06c59,
title = "Crowd Anatomy Beyond the Good and Bad: Behavioral Traces for Crowd Worker Modeling and Pre-selection",
abstract = "The suitability of crowdsourcing to solve a variety of problems has been investigated widely. Yet, there is still a lack of understanding about the distinct behavior and performance of workers within microtasks. In this paper, we first introduce a fine-grained data-driven worker typology based on different dimensions and derived from behavioral traces of workers. Next, we propose and evaluate novel models of crowd worker behavior and show the benefits of behavior-based worker pre-selection using machine learning models. We also study the effect of task complexity on worker behavior. Finally, we evaluate our novel typology-based worker pre-selection method in image transcription and information finding tasks involving crowd workers completing 1,800 HITs. Our proposed method for worker pre-selection leads to a higher quality of results when compared to the standard practice of using qualification or pre-screening tests. For image transcription tasks our method resulted in an accuracy increase of nearly 7% over the baseline and of almost 10% in information finding tasks, without a significant difference in task completion time. Our findings have important implications for crowdsourcing systems where a worker{\textquoteright}s behavioral type is unknown prior to participation in a task. We highlight the potential of leveraging worker types to identify and aid those workers who require further training to improve their performance. Having proposed a powerful automated mechanism to detect worker types, we reflect on promoting fairness, trust and transparency in microtask crowdsourcing platforms.",
keywords = "Behavioral traces, Crowdsourcing, Microtasks, Pre-screening, Pre-selection, Worker typology, Workers",
author = "Gadiraju, {Ujwal Kumar} and Gianluca Demartini and Ricardo Kawase and Stefan Dietze",
year = "2019",
month = sep,
day = "1",
doi = "10.1007/s10606-018-9336-y",
language = "English",
volume = "28",
pages = "815--841",
journal = "Computer Supported Cooperative Work: CSCW: An International Journal",
issn = "0925-9724",
publisher = "Springer Netherlands",
number = "5",

}

Download

TY - JOUR

T1 - Crowd Anatomy Beyond the Good and Bad

T2 - Behavioral Traces for Crowd Worker Modeling and Pre-selection

AU - Gadiraju, Ujwal Kumar

AU - Demartini, Gianluca

AU - Kawase, Ricardo

AU - Dietze, Stefan

PY - 2019/9/1

Y1 - 2019/9/1

N2 - The suitability of crowdsourcing to solve a variety of problems has been investigated widely. Yet, there is still a lack of understanding about the distinct behavior and performance of workers within microtasks. In this paper, we first introduce a fine-grained data-driven worker typology based on different dimensions and derived from behavioral traces of workers. Next, we propose and evaluate novel models of crowd worker behavior and show the benefits of behavior-based worker pre-selection using machine learning models. We also study the effect of task complexity on worker behavior. Finally, we evaluate our novel typology-based worker pre-selection method in image transcription and information finding tasks involving crowd workers completing 1,800 HITs. Our proposed method for worker pre-selection leads to a higher quality of results when compared to the standard practice of using qualification or pre-screening tests. For image transcription tasks our method resulted in an accuracy increase of nearly 7% over the baseline and of almost 10% in information finding tasks, without a significant difference in task completion time. Our findings have important implications for crowdsourcing systems where a worker’s behavioral type is unknown prior to participation in a task. We highlight the potential of leveraging worker types to identify and aid those workers who require further training to improve their performance. Having proposed a powerful automated mechanism to detect worker types, we reflect on promoting fairness, trust and transparency in microtask crowdsourcing platforms.

AB - The suitability of crowdsourcing to solve a variety of problems has been investigated widely. Yet, there is still a lack of understanding about the distinct behavior and performance of workers within microtasks. In this paper, we first introduce a fine-grained data-driven worker typology based on different dimensions and derived from behavioral traces of workers. Next, we propose and evaluate novel models of crowd worker behavior and show the benefits of behavior-based worker pre-selection using machine learning models. We also study the effect of task complexity on worker behavior. Finally, we evaluate our novel typology-based worker pre-selection method in image transcription and information finding tasks involving crowd workers completing 1,800 HITs. Our proposed method for worker pre-selection leads to a higher quality of results when compared to the standard practice of using qualification or pre-screening tests. For image transcription tasks our method resulted in an accuracy increase of nearly 7% over the baseline and of almost 10% in information finding tasks, without a significant difference in task completion time. Our findings have important implications for crowdsourcing systems where a worker’s behavioral type is unknown prior to participation in a task. We highlight the potential of leveraging worker types to identify and aid those workers who require further training to improve their performance. Having proposed a powerful automated mechanism to detect worker types, we reflect on promoting fairness, trust and transparency in microtask crowdsourcing platforms.

KW - Behavioral traces

KW - Crowdsourcing

KW - Microtasks

KW - Pre-screening

KW - Pre-selection

KW - Worker typology

KW - Workers

UR - http://www.scopus.com/inward/record.url?scp=85049077761&partnerID=8YFLogxK

U2 - 10.1007/s10606-018-9336-y

DO - 10.1007/s10606-018-9336-y

M3 - Article

AN - SCOPUS:85049077761

VL - 28

SP - 815

EP - 841

JO - Computer Supported Cooperative Work: CSCW: An International Journal

JF - Computer Supported Cooperative Work: CSCW: An International Journal

SN - 0925-9724

IS - 5

ER -