Non-Asymptotic Delay Bounds for Multi-Server Systems with Synchronization Constraints

Research output: Contribution to journalArticleResearchpeer review

Authors

External Research Organisations

  • Norwegian University of Science and Technology (NTNU)
View graph of relations

Details

Original languageEnglish
Pages (from-to)1545-1559
Number of pages15
JournalIEEE Transactions on Parallel and Distributed Systems
Volume29
Issue number7
Early online date1 Jan 2018
Publication statusPublished - 1 Jul 2018

Abstract

Parallel computing has become a standard tool with architectures such as Google MapReduce, Hadoop, and Spark being broadly used in applications such as data processing and machine learning. Common to these systems are a fork operation, where jobs are first divided into tasks that are processed in parallel, and a join operation where completed tasks wait for the other tasks of the job before leaving the system. The synchronization constraint of the join operation makes the analysis of fork-join systems challenging, and few explicit results are known. In this work, we formulate a max-plus server model for parallel systems which allows us to derive performance bounds for a variety of systems in the GI GI and G G cases. We contribute end-to-end delay bounds for multi-stage fork-join networks. We perform a detailed comparison of different multi-server configurations, including an analysis of single-queue fork-join systems that achieve a fundamental performance gain. We compare these results to both simulation and a live Spark system.

Keywords

    Hadoop, MapReduce, Parallel computing, performance analysis, spark, stochastic network calculus

ASJC Scopus subject areas

Cite this

Non-Asymptotic Delay Bounds for Multi-Server Systems with Synchronization Constraints. / Fidler, Markus; Walker, Brenton; Jiang, Yuming.
In: IEEE Transactions on Parallel and Distributed Systems, Vol. 29, No. 7, 01.07.2018, p. 1545-1559.

Research output: Contribution to journalArticleResearchpeer review

Fidler M, Walker B, Jiang Y. Non-Asymptotic Delay Bounds for Multi-Server Systems with Synchronization Constraints. IEEE Transactions on Parallel and Distributed Systems. 2018 Jul 1;29(7):1545-1559. Epub 2018 Jan 1. doi: 10.48550/arXiv.1610.06309, 10.1109/TPDS.2017.2779872
Download
@article{36efc69ae3a14ed69c29bf6a48c37984,
title = "Non-Asymptotic Delay Bounds for Multi-Server Systems with Synchronization Constraints",
abstract = "Parallel computing has become a standard tool with architectures such as Google MapReduce, Hadoop, and Spark being broadly used in applications such as data processing and machine learning. Common to these systems are a fork operation, where jobs are first divided into tasks that are processed in parallel, and a join operation where completed tasks wait for the other tasks of the job before leaving the system. The synchronization constraint of the join operation makes the analysis of fork-join systems challenging, and few explicit results are known. In this work, we formulate a max-plus server model for parallel systems which allows us to derive performance bounds for a variety of systems in the GI GI and G G cases. We contribute end-to-end delay bounds for multi-stage fork-join networks. We perform a detailed comparison of different multi-server configurations, including an analysis of single-queue fork-join systems that achieve a fundamental performance gain. We compare these results to both simulation and a live Spark system.",
keywords = "Hadoop, MapReduce, Parallel computing, performance analysis, spark, stochastic network calculus",
author = "Markus Fidler and Brenton Walker and Yuming Jiang",
note = "Funding information: This work was supported in part by the European Research Council (ERC) under StG 306644.",
year = "2018",
month = jul,
day = "1",
doi = "10.48550/arXiv.1610.06309",
language = "English",
volume = "29",
pages = "1545--1559",
journal = "IEEE Transactions on Parallel and Distributed Systems",
issn = "1045-9219",
publisher = "IEEE Computer Society",
number = "7",

}

Download

TY - JOUR

T1 - Non-Asymptotic Delay Bounds for Multi-Server Systems with Synchronization Constraints

AU - Fidler, Markus

AU - Walker, Brenton

AU - Jiang, Yuming

N1 - Funding information: This work was supported in part by the European Research Council (ERC) under StG 306644.

PY - 2018/7/1

Y1 - 2018/7/1

N2 - Parallel computing has become a standard tool with architectures such as Google MapReduce, Hadoop, and Spark being broadly used in applications such as data processing and machine learning. Common to these systems are a fork operation, where jobs are first divided into tasks that are processed in parallel, and a join operation where completed tasks wait for the other tasks of the job before leaving the system. The synchronization constraint of the join operation makes the analysis of fork-join systems challenging, and few explicit results are known. In this work, we formulate a max-plus server model for parallel systems which allows us to derive performance bounds for a variety of systems in the GI GI and G G cases. We contribute end-to-end delay bounds for multi-stage fork-join networks. We perform a detailed comparison of different multi-server configurations, including an analysis of single-queue fork-join systems that achieve a fundamental performance gain. We compare these results to both simulation and a live Spark system.

AB - Parallel computing has become a standard tool with architectures such as Google MapReduce, Hadoop, and Spark being broadly used in applications such as data processing and machine learning. Common to these systems are a fork operation, where jobs are first divided into tasks that are processed in parallel, and a join operation where completed tasks wait for the other tasks of the job before leaving the system. The synchronization constraint of the join operation makes the analysis of fork-join systems challenging, and few explicit results are known. In this work, we formulate a max-plus server model for parallel systems which allows us to derive performance bounds for a variety of systems in the GI GI and G G cases. We contribute end-to-end delay bounds for multi-stage fork-join networks. We perform a detailed comparison of different multi-server configurations, including an analysis of single-queue fork-join systems that achieve a fundamental performance gain. We compare these results to both simulation and a live Spark system.

KW - Hadoop

KW - MapReduce

KW - Parallel computing

KW - performance analysis

KW - spark

KW - stochastic network calculus

UR - http://www.scopus.com/inward/record.url?scp=85040042465&partnerID=8YFLogxK

U2 - 10.48550/arXiv.1610.06309

DO - 10.48550/arXiv.1610.06309

M3 - Article

AN - SCOPUS:85040042465

VL - 29

SP - 1545

EP - 1559

JO - IEEE Transactions on Parallel and Distributed Systems

JF - IEEE Transactions on Parallel and Distributed Systems

SN - 1045-9219

IS - 7

ER -

By the same author(s)