SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition

Hao Li; Zixuan Li; Kenli Li; Jan Rellermeyer; Lydia Chen; Keqin Li

doi:10.1109/TPDS.2020.3047460

Details

Originalsprache	Englisch
Aufsatznummer	9309187
Seiten (von - bis)	1828-1841
Seitenumfang	14
Fachzeitschrift	IEEE Transactions on Parallel and Distributed Systems
Jahrgang	32
Ausgabenummer	7
Publikationsstatus	Veröffentlicht - 1 Jan. 2021
Extern publiziert	Ja

Abstract

Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the High-Order, High-Dimension, and Sparse Tensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and matrix-matrix multiplication, follows the whole elements in sparse tensor. The above problems prevent deep fusion of efficient computation and big data platforms. To overcome the bottleneck, a novel stochastic optimization strategy (SGD__Tucker) is proposed for STD which can automatically divide the high-dimension intermediate variables into small batches of intermediate matrices. Specifically, SGD__Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate. In practice, SGD__Tucker features the two distinct advancements over the state of the art. First, SGD__Tucker can prune the communication overhead for the core tensor in distributed settings. Second, the low data-dependence of SGD__Tucker enables fine-grained parallelization, which makes SGD__Tucker obtaining lower computational overheads with the same accuracy. Experimental results show that SGD__Tucker runs at least 2XX faster than the state of the art.

ASJC Scopus Sachgebiete

Informatik (insg.)
Signalverarbeitung
Informatik (insg.)
Hardware und Architektur
Informatik (insg.)
Theoretische Informatik und Mathematik

Zitieren

SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition. / Li, Hao; Li, Zixuan; Li, Kenli et al.
in: IEEE Transactions on Parallel and Distributed Systems, Jahrgang 32, Nr. 7, 9309187, 01.01.2021, S. 1828-1841.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Li, H, Li, Z, Li, K, Rellermeyer, J, Chen, L & Li, K 2021, 'SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition', IEEE Transactions on Parallel and Distributed Systems, Jg. 32, Nr. 7, 9309187, S. 1828-1841. https://doi.org/10.1109/TPDS.2020.3047460

Li, H., Li, Z., Li, K., Rellermeyer, J., Chen, L., & Li, K. (2021). SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition. IEEE Transactions on Parallel and Distributed Systems, 32(7), 1828-1841. Artikel 9309187. https://doi.org/10.1109/TPDS.2020.3047460

Li H, Li Z, Li K, Rellermeyer J, Chen L, Li K. SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition. IEEE Transactions on Parallel and Distributed Systems. 2021 Jan 1;32(7):1828-1841. 9309187. doi: 10.1109/TPDS.2020.3047460

Li, Hao ; Li, Zixuan ; Li, Kenli et al. / SGD_Tucker : A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition. in: IEEE Transactions on Parallel and Distributed Systems. 2021 ; Jahrgang 32, Nr. 7. S. 1828-1841.

Download

@article{71245e95977e44a4bf6d9711ec800b81,

title = "SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition",

abstract = "Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the High-Order, High-Dimension, and Sparse Tensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and matrix-matrix multiplication, follows the whole elements in sparse tensor. The above problems prevent deep fusion of efficient computation and big data platforms. To overcome the bottleneck, a novel stochastic optimization strategy (SGD__Tucker) is proposed for STD which can automatically divide the high-dimension intermediate variables into small batches of intermediate matrices. Specifically, SGD__Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate. In practice, SGD__Tucker features the two distinct advancements over the state of the art. First, SGD__Tucker can prune the communication overhead for the core tensor in distributed settings. Second, the low data-dependence of SGD__Tucker enables fine-grained parallelization, which makes SGD__Tucker obtaining lower computational overheads with the same accuracy. Experimental results show that SGD__Tucker runs at least 2XX faster than the state of the art. ",

keywords = "High-order, high-dimension and sparse tensor, low-rank representation learning, machine learning algorithm, parallel strategy, sparse tucker decomposition, stochastic optimization",

author = "Hao Li and Zixuan Li and Kenli Li and Jan Rellermeyer and Lydia Chen and Keqin Li",

note = "Publisher Copyright: {\textcopyright} 1990-2012 IEEE.",

year = "2021",

month = jan,

day = "1",

doi = "10.1109/TPDS.2020.3047460",

language = "English",

volume = "32",

pages = "1828--1841",

journal = "IEEE Transactions on Parallel and Distributed Systems",

issn = "1045-9219",

publisher = "IEEE Computer Society",

number = "7",

}

Download

TY - JOUR

T1 - SGD_Tucker

T2 - A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition

AU - Li, Hao

AU - Li, Zixuan

AU - Li, Kenli

AU - Rellermeyer, Jan

AU - Chen, Lydia

AU - Li, Keqin

PY - 2021/1/1

Y1 - 2021/1/1

N2 - Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the High-Order, High-Dimension, and Sparse Tensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and matrix-matrix multiplication, follows the whole elements in sparse tensor. The above problems prevent deep fusion of efficient computation and big data platforms. To overcome the bottleneck, a novel stochastic optimization strategy (SGD__Tucker) is proposed for STD which can automatically divide the high-dimension intermediate variables into small batches of intermediate matrices. Specifically, SGD__Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate. In practice, SGD__Tucker features the two distinct advancements over the state of the art. First, SGD__Tucker can prune the communication overhead for the core tensor in distributed settings. Second, the low data-dependence of SGD__Tucker enables fine-grained parallelization, which makes SGD__Tucker obtaining lower computational overheads with the same accuracy. Experimental results show that SGD__Tucker runs at least 2XX faster than the state of the art.

AB - Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the High-Order, High-Dimension, and Sparse Tensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact that the formation of those variables, i.e., matrices Khatri-Rao product, Kronecker product, and matrix-matrix multiplication, follows the whole elements in sparse tensor. The above problems prevent deep fusion of efficient computation and big data platforms. To overcome the bottleneck, a novel stochastic optimization strategy (SGD__Tucker) is proposed for STD which can automatically divide the high-dimension intermediate variables into small batches of intermediate matrices. Specifically, SGD__Tucker only follows the randomly selected small samples rather than the whole elements, while maintaining the overall accuracy and convergence rate. In practice, SGD__Tucker features the two distinct advancements over the state of the art. First, SGD__Tucker can prune the communication overhead for the core tensor in distributed settings. Second, the low data-dependence of SGD__Tucker enables fine-grained parallelization, which makes SGD__Tucker obtaining lower computational overheads with the same accuracy. Experimental results show that SGD__Tucker runs at least 2XX faster than the state of the art.

KW - High-order, high-dimension and sparse tensor

KW - low-rank representation learning

KW - machine learning algorithm

KW - parallel strategy

KW - sparse tucker decomposition

KW - stochastic optimization

UR - http://www.scopus.com/inward/record.url?scp=85098783291&partnerID=8YFLogxK

U2 - 10.1109/TPDS.2020.3047460

DO - 10.1109/TPDS.2020.3047460

M3 - Article

VL - 32

SP - 1828

EP - 1841

JO - IEEE Transactions on Parallel and Distributed Systems

JF - IEEE Transactions on Parallel and Distributed Systems

SN - 1045-9219

IS - 7

M1 - 9309187

ER -

Research@Leibniz University

SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition

Autorschaft

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Toward Competitive Serverless Deep Learning

The Performance of Distributed Applications: A Traffic Shaping Perspective

Log Parsing Evaluation in the Era of Modern Software Systems

Brug: An Adaptive Memory (Re-)Allocator

Is Your Anomaly Detector Ready for Change? Adapting AIOps Solutions to the Real World

Toward Competitive Serverless Deep Learning

The Performance of Distributed Applications: A Traffic Shaping Perspective

Log Parsing Evaluation in the Era of Modern Software Systems

Brug: An Adaptive Memory (Re-)Allocator

Is Your Anomaly Detector Ready for Change? Adapting AIOps Solutions to the Real World

Toward Competitive Serverless Deep Learning