An evaluation of data stream clustering algorithms

Publikation: Beitrag in FachzeitschriftÜbersichtsarbeitForschungPeer-Review

Autoren

  • Stratos Mansalis
  • Eirini Ntoutsi
  • Nikos Pelekis
  • Yannis Theodoridis

Organisationseinheiten

Externe Organisationen

  • University of Piraeus
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)167-187
Seitenumfang21
FachzeitschriftStatistical Analysis and Data Mining
Jahrgang11
Ausgabenummer4
Frühes Online-Datum25 Juni 2018
PublikationsstatusVeröffentlicht - 12 Juli 2018

Abstract

Data stream clustering is a hot research area due to the abundance of data streams collected nowadays and the need for understanding and acting upon such sort of data. Unsupervised learning (clustering) comprises one of the most popular data mining tasks for gaining insights into the data. Clustering is a challenging task, while clustering over data streams involves additional challenges such as the single pass constraint over the raw data and the need for fast response. Moreover, dealing with an infinite and fast changing data stream implies that the clustering model extracted upon such sort of data is also subject to evolution over time. Several stream clustering surveys exist already in the literature; however, they focus on a theoretical presentation of the surveyed algorithms. On the contrary, in this paper, we survey the state-of-the-art stream clustering algorithms and we evaluate their performance in different data sets and for different parameter settings.

ASJC Scopus Sachgebiete

Zitieren

An evaluation of data stream clustering algorithms. / Mansalis, Stratos; Ntoutsi, Eirini; Pelekis, Nikos et al.
in: Statistical Analysis and Data Mining, Jahrgang 11, Nr. 4, 12.07.2018, S. 167-187.

Publikation: Beitrag in FachzeitschriftÜbersichtsarbeitForschungPeer-Review

Mansalis, S, Ntoutsi, E, Pelekis, N & Theodoridis, Y 2018, 'An evaluation of data stream clustering algorithms', Statistical Analysis and Data Mining, Jg. 11, Nr. 4, S. 167-187. https://doi.org/10.1002/sam.11380
Mansalis, S., Ntoutsi, E., Pelekis, N., & Theodoridis, Y. (2018). An evaluation of data stream clustering algorithms. Statistical Analysis and Data Mining, 11(4), 167-187. https://doi.org/10.1002/sam.11380
Mansalis S, Ntoutsi E, Pelekis N, Theodoridis Y. An evaluation of data stream clustering algorithms. Statistical Analysis and Data Mining. 2018 Jul 12;11(4):167-187. Epub 2018 Jun 25. doi: 10.1002/sam.11380
Mansalis, Stratos ; Ntoutsi, Eirini ; Pelekis, Nikos et al. / An evaluation of data stream clustering algorithms. in: Statistical Analysis and Data Mining. 2018 ; Jahrgang 11, Nr. 4. S. 167-187.
Download
@article{81a8c798e85c436485ed1626770f6d1b,
title = "An evaluation of data stream clustering algorithms",
abstract = "Data stream clustering is a hot research area due to the abundance of data streams collected nowadays and the need for understanding and acting upon such sort of data. Unsupervised learning (clustering) comprises one of the most popular data mining tasks for gaining insights into the data. Clustering is a challenging task, while clustering over data streams involves additional challenges such as the single pass constraint over the raw data and the need for fast response. Moreover, dealing with an infinite and fast changing data stream implies that the clustering model extracted upon such sort of data is also subject to evolution over time. Several stream clustering surveys exist already in the literature; however, they focus on a theoretical presentation of the surveyed algorithms. On the contrary, in this paper, we survey the state-of-the-art stream clustering algorithms and we evaluate their performance in different data sets and for different parameter settings.",
keywords = "data stream clustering, data streams, evaluation, experimental, survey",
author = "Stratos Mansalis and Eirini Ntoutsi and Nikos Pelekis and Yannis Theodoridis",
note = "Publisher Copyright: {\textcopyright} 2018 Wiley Periodicals, Inc. Copyright: Copyright 2018 Elsevier B.V., All rights reserved.",
year = "2018",
month = jul,
day = "12",
doi = "10.1002/sam.11380",
language = "English",
volume = "11",
pages = "167--187",
journal = "Statistical Analysis and Data Mining",
issn = "1932-1864",
publisher = "John Wiley and Sons Inc.",
number = "4",

}

Download

TY - JOUR

T1 - An evaluation of data stream clustering algorithms

AU - Mansalis, Stratos

AU - Ntoutsi, Eirini

AU - Pelekis, Nikos

AU - Theodoridis, Yannis

N1 - Publisher Copyright: © 2018 Wiley Periodicals, Inc. Copyright: Copyright 2018 Elsevier B.V., All rights reserved.

PY - 2018/7/12

Y1 - 2018/7/12

N2 - Data stream clustering is a hot research area due to the abundance of data streams collected nowadays and the need for understanding and acting upon such sort of data. Unsupervised learning (clustering) comprises one of the most popular data mining tasks for gaining insights into the data. Clustering is a challenging task, while clustering over data streams involves additional challenges such as the single pass constraint over the raw data and the need for fast response. Moreover, dealing with an infinite and fast changing data stream implies that the clustering model extracted upon such sort of data is also subject to evolution over time. Several stream clustering surveys exist already in the literature; however, they focus on a theoretical presentation of the surveyed algorithms. On the contrary, in this paper, we survey the state-of-the-art stream clustering algorithms and we evaluate their performance in different data sets and for different parameter settings.

AB - Data stream clustering is a hot research area due to the abundance of data streams collected nowadays and the need for understanding and acting upon such sort of data. Unsupervised learning (clustering) comprises one of the most popular data mining tasks for gaining insights into the data. Clustering is a challenging task, while clustering over data streams involves additional challenges such as the single pass constraint over the raw data and the need for fast response. Moreover, dealing with an infinite and fast changing data stream implies that the clustering model extracted upon such sort of data is also subject to evolution over time. Several stream clustering surveys exist already in the literature; however, they focus on a theoretical presentation of the surveyed algorithms. On the contrary, in this paper, we survey the state-of-the-art stream clustering algorithms and we evaluate their performance in different data sets and for different parameter settings.

KW - data stream clustering

KW - data streams

KW - evaluation

KW - experimental

KW - survey

UR - http://www.scopus.com/inward/record.url?scp=85049771982&partnerID=8YFLogxK

U2 - 10.1002/sam.11380

DO - 10.1002/sam.11380

M3 - Review article

AN - SCOPUS:85049771982

VL - 11

SP - 167

EP - 187

JO - Statistical Analysis and Data Mining

JF - Statistical Analysis and Data Mining

SN - 1932-1864

IS - 4

ER -