Drift-Aware Multi-Memory Model for Imbalanced Data Streams

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Amir Abolfazli
  • Eirini Ntoutsi

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publication2020 IEEE International Conference on Big Data (Big Data)
EditorsXintao Wu, Chris Jermaine, Li Xiong, Xiaohua Tony Hu, Olivera Kotevska, Siyuan Lu, Weijia Xu, Srinivas Aluru, Chengxiang Zhai, Eyhab Al-Masri, Zhiyuan Chen, Jeff Saltz
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages878-885
Number of pages8
ISBN (electronic)9781728162515
ISBN (print)9781728162522
Publication statusPublished - 2021
Event8th IEEE International Conference on Big Data, Big Data 2020 - Virtual, Atlanta, United States
Duration: 10 Dec 202013 Dec 2020

Abstract

Online class imbalance learning deals with data streams that are affected by both concept drift and class imbalance. Online learning tries to find a trade-off between exploiting previously learned information and incorporating new information into the model. This requires both the incremental update of the model and the ability to unlearn outdated information. The improper use of unlearning, however, can lead to the retroactive interference problem, a phenomenon that occurs when newly learned information interferes with the old information and impedes the recall of previously learned information. The problem becomes more severe when the classes are not equally represented, resulting in the removal of minority information from the model. In this work, we propose the Drift-Aware Multi-Memory Model (DAM3), which addresses the class imbalance problem in online learning for memory-based models. DAM3 mitigates class imbalance by incorporating an imbalance-sensitive drift detector, preserving a balanced representation of classes in the model, and resolving retroactive interference using a working memory that prevents the forgetting of old information. We show through experiments on real-world and synthetic datasets that the proposed method mitigates class imbalance and outperforms the state-of-the-art methods.

Keywords

    class imbalance, concept drift, multi-memory model, online learning, retroactive interference

ASJC Scopus subject areas

Cite this

Drift-Aware Multi-Memory Model for Imbalanced Data Streams. / Abolfazli, Amir; Ntoutsi, Eirini.
2020 IEEE International Conference on Big Data (Big Data). ed. / Xintao Wu; Chris Jermaine; Li Xiong; Xiaohua Tony Hu; Olivera Kotevska; Siyuan Lu; Weijia Xu; Srinivas Aluru; Chengxiang Zhai; Eyhab Al-Masri; Zhiyuan Chen; Jeff Saltz. Institute of Electrical and Electronics Engineers Inc., 2021. p. 878-885.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Abolfazli, A & Ntoutsi, E 2021, Drift-Aware Multi-Memory Model for Imbalanced Data Streams. in X Wu, C Jermaine, L Xiong, XT Hu, O Kotevska, S Lu, W Xu, S Aluru, C Zhai, E Al-Masri, Z Chen & J Saltz (eds), 2020 IEEE International Conference on Big Data (Big Data). Institute of Electrical and Electronics Engineers Inc., pp. 878-885, 8th IEEE International Conference on Big Data, Big Data 2020, Virtual, Atlanta, United States, 10 Dec 2020. https://doi.org/10.1109/BigData50022.2020.9378101
Abolfazli, A., & Ntoutsi, E. (2021). Drift-Aware Multi-Memory Model for Imbalanced Data Streams. In X. Wu, C. Jermaine, L. Xiong, X. T. Hu, O. Kotevska, S. Lu, W. Xu, S. Aluru, C. Zhai, E. Al-Masri, Z. Chen, & J. Saltz (Eds.), 2020 IEEE International Conference on Big Data (Big Data) (pp. 878-885). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BigData50022.2020.9378101
Abolfazli A, Ntoutsi E. Drift-Aware Multi-Memory Model for Imbalanced Data Streams. In Wu X, Jermaine C, Xiong L, Hu XT, Kotevska O, Lu S, Xu W, Aluru S, Zhai C, Al-Masri E, Chen Z, Saltz J, editors, 2020 IEEE International Conference on Big Data (Big Data). Institute of Electrical and Electronics Engineers Inc. 2021. p. 878-885 doi: 10.1109/BigData50022.2020.9378101
Abolfazli, Amir ; Ntoutsi, Eirini. / Drift-Aware Multi-Memory Model for Imbalanced Data Streams. 2020 IEEE International Conference on Big Data (Big Data). editor / Xintao Wu ; Chris Jermaine ; Li Xiong ; Xiaohua Tony Hu ; Olivera Kotevska ; Siyuan Lu ; Weijia Xu ; Srinivas Aluru ; Chengxiang Zhai ; Eyhab Al-Masri ; Zhiyuan Chen ; Jeff Saltz. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 878-885
Download
@inproceedings{f8d23e16c3d141ccb6707e5ae7f4d473,
title = "Drift-Aware Multi-Memory Model for Imbalanced Data Streams",
abstract = "Online class imbalance learning deals with data streams that are affected by both concept drift and class imbalance. Online learning tries to find a trade-off between exploiting previously learned information and incorporating new information into the model. This requires both the incremental update of the model and the ability to unlearn outdated information. The improper use of unlearning, however, can lead to the retroactive interference problem, a phenomenon that occurs when newly learned information interferes with the old information and impedes the recall of previously learned information. The problem becomes more severe when the classes are not equally represented, resulting in the removal of minority information from the model. In this work, we propose the Drift-Aware Multi-Memory Model (DAM3), which addresses the class imbalance problem in online learning for memory-based models. DAM3 mitigates class imbalance by incorporating an imbalance-sensitive drift detector, preserving a balanced representation of classes in the model, and resolving retroactive interference using a working memory that prevents the forgetting of old information. We show through experiments on real-world and synthetic datasets that the proposed method mitigates class imbalance and outperforms the state-of-the-art methods.",
keywords = "class imbalance, concept drift, multi-memory model, online learning, retroactive interference",
author = "Amir Abolfazli and Eirini Ntoutsi",
note = "Funding Information: ACKNOWLEDGMENT The work of the first author was supported by the German Research Foundation (DFG) within the project OSCAR (Opinion Stream Classification with Ensembles and Active leaRners) and HEPHAESTUS (Machine learning methods for adaptive process planning of 5-axis milling), for both of which the second author is a principal investigator.; 8th IEEE International Conference on Big Data, Big Data 2020 ; Conference date: 10-12-2020 Through 13-12-2020",
year = "2021",
doi = "10.1109/BigData50022.2020.9378101",
language = "English",
isbn = "9781728162522",
pages = "878--885",
editor = "Xintao Wu and Chris Jermaine and Li Xiong and Hu, {Xiaohua Tony} and Olivera Kotevska and Siyuan Lu and Weijia Xu and Srinivas Aluru and Chengxiang Zhai and Eyhab Al-Masri and Zhiyuan Chen and Jeff Saltz",
booktitle = "2020 IEEE International Conference on Big Data (Big Data)",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

Download

TY - GEN

T1 - Drift-Aware Multi-Memory Model for Imbalanced Data Streams

AU - Abolfazli, Amir

AU - Ntoutsi, Eirini

N1 - Funding Information: ACKNOWLEDGMENT The work of the first author was supported by the German Research Foundation (DFG) within the project OSCAR (Opinion Stream Classification with Ensembles and Active leaRners) and HEPHAESTUS (Machine learning methods for adaptive process planning of 5-axis milling), for both of which the second author is a principal investigator.

PY - 2021

Y1 - 2021

N2 - Online class imbalance learning deals with data streams that are affected by both concept drift and class imbalance. Online learning tries to find a trade-off between exploiting previously learned information and incorporating new information into the model. This requires both the incremental update of the model and the ability to unlearn outdated information. The improper use of unlearning, however, can lead to the retroactive interference problem, a phenomenon that occurs when newly learned information interferes with the old information and impedes the recall of previously learned information. The problem becomes more severe when the classes are not equally represented, resulting in the removal of minority information from the model. In this work, we propose the Drift-Aware Multi-Memory Model (DAM3), which addresses the class imbalance problem in online learning for memory-based models. DAM3 mitigates class imbalance by incorporating an imbalance-sensitive drift detector, preserving a balanced representation of classes in the model, and resolving retroactive interference using a working memory that prevents the forgetting of old information. We show through experiments on real-world and synthetic datasets that the proposed method mitigates class imbalance and outperforms the state-of-the-art methods.

AB - Online class imbalance learning deals with data streams that are affected by both concept drift and class imbalance. Online learning tries to find a trade-off between exploiting previously learned information and incorporating new information into the model. This requires both the incremental update of the model and the ability to unlearn outdated information. The improper use of unlearning, however, can lead to the retroactive interference problem, a phenomenon that occurs when newly learned information interferes with the old information and impedes the recall of previously learned information. The problem becomes more severe when the classes are not equally represented, resulting in the removal of minority information from the model. In this work, we propose the Drift-Aware Multi-Memory Model (DAM3), which addresses the class imbalance problem in online learning for memory-based models. DAM3 mitigates class imbalance by incorporating an imbalance-sensitive drift detector, preserving a balanced representation of classes in the model, and resolving retroactive interference using a working memory that prevents the forgetting of old information. We show through experiments on real-world and synthetic datasets that the proposed method mitigates class imbalance and outperforms the state-of-the-art methods.

KW - class imbalance

KW - concept drift

KW - multi-memory model

KW - online learning

KW - retroactive interference

UR - http://www.scopus.com/inward/record.url?scp=85103823262&partnerID=8YFLogxK

U2 - 10.1109/BigData50022.2020.9378101

DO - 10.1109/BigData50022.2020.9378101

M3 - Conference contribution

AN - SCOPUS:85103823262

SN - 9781728162522

SP - 878

EP - 885

BT - 2020 IEEE International Conference on Big Data (Big Data)

A2 - Wu, Xintao

A2 - Jermaine, Chris

A2 - Xiong, Li

A2 - Hu, Xiaohua Tony

A2 - Kotevska, Olivera

A2 - Lu, Siyuan

A2 - Xu, Weijia

A2 - Aluru, Srinivas

A2 - Zhai, Chengxiang

A2 - Al-Masri, Eyhab

A2 - Chen, Zhiyuan

A2 - Saltz, Jeff

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 8th IEEE International Conference on Big Data, Big Data 2020

Y2 - 10 December 2020 through 13 December 2020

ER -