Evaluation of sparse linear algebra operations in Trilinos

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering
Herausgeber/-innenG. Stefanou, V. Papadopoulos, V. Plevris, M. Papadrakakis
Seiten1381-1391
Seitenumfang11
ISBN (elektronisch)9786188284401
PublikationsstatusVeröffentlicht - 2016
Veranstaltung7th European Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS Congress 2016 - Crete, Griechenland
Dauer: 5 Juni 201610 Juni 2016

Publikationsreihe

NameECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering
Band1

Abstract

The performance of numerous scientific libraries and applications depends heavily on efficiency of sparse linear algebra operations. In this paper, we survey the performance of several parallel sparse vector and matrix kernels provided in the Trilinos framework on supercomputer systems Cray XC30/40 and IBM Blue Gene/Q. The linear algebra operations in Trilinos are handled by one of the two packages Epetra or Tpetra. While the former is the mostused, the latter is the target of future developments and supports larger scale problems as well as shared memory parallelism. We compare the results obtained from both packages together with the MPI only and hybrid solutions. The hybrid parallelism is managed by the package Kokkos, which aims for performance portability among different architectures. We report the efficiency of a single node of the system and demonstrate the scalability behavior of the benchmarks up to 38,400 cores of the HLRN-III systems. Furthermore, for the Intel processors used in the Cray system we present measurements of the energy consumption of the kernels and compare the Energy-to-Solution between different compilers and parallel programing paradigms. In addition, we discuss the effect on the performance and the energy consumption by linking the vendor provided libraries compared to the user-compiled versions. These extensive comparisons obtained on the top most performant supercomputer systems help users and developers as a starting point for determining an optimal development strategy.

ASJC Scopus Sachgebiete

Ziele für nachhaltige Entwicklung

Zitieren

Evaluation of sparse linear algebra operations in Trilinos. / Siahatgar, Mohammad; Von Voigt, Gabriele.
ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering. Hrsg. / G. Stefanou; V. Papadopoulos; V. Plevris; M. Papadrakakis. 2016. S. 1381-1391 (ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering; Band 1).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Siahatgar, M & Von Voigt, G 2016, Evaluation of sparse linear algebra operations in Trilinos. in G Stefanou, V Papadopoulos, V Plevris & M Papadrakakis (Hrsg.), ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering. ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering, Bd. 1, S. 1381-1391, 7th European Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS Congress 2016, Crete, Griechenland, 5 Juni 2016. https://doi.org/10.7712/100016.1893.11500
Siahatgar, M., & Von Voigt, G. (2016). Evaluation of sparse linear algebra operations in Trilinos. In G. Stefanou, V. Papadopoulos, V. Plevris, & M. Papadrakakis (Hrsg.), ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering (S. 1381-1391). (ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering; Band 1). https://doi.org/10.7712/100016.1893.11500
Siahatgar M, Von Voigt G. Evaluation of sparse linear algebra operations in Trilinos. in Stefanou G, Papadopoulos V, Plevris V, Papadrakakis M, Hrsg., ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering. 2016. S. 1381-1391. (ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering). doi: 10.7712/100016.1893.11500
Siahatgar, Mohammad ; Von Voigt, Gabriele. / Evaluation of sparse linear algebra operations in Trilinos. ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering. Hrsg. / G. Stefanou ; V. Papadopoulos ; V. Plevris ; M. Papadrakakis. 2016. S. 1381-1391 (ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering).
Download
@inproceedings{45aa3e9029f547d9bca4e4952fb90a0f,
title = "Evaluation of sparse linear algebra operations in Trilinos",
abstract = "The performance of numerous scientific libraries and applications depends heavily on efficiency of sparse linear algebra operations. In this paper, we survey the performance of several parallel sparse vector and matrix kernels provided in the Trilinos framework on supercomputer systems Cray XC30/40 and IBM Blue Gene/Q. The linear algebra operations in Trilinos are handled by one of the two packages Epetra or Tpetra. While the former is the mostused, the latter is the target of future developments and supports larger scale problems as well as shared memory parallelism. We compare the results obtained from both packages together with the MPI only and hybrid solutions. The hybrid parallelism is managed by the package Kokkos, which aims for performance portability among different architectures. We report the efficiency of a single node of the system and demonstrate the scalability behavior of the benchmarks up to 38,400 cores of the HLRN-III systems. Furthermore, for the Intel processors used in the Cray system we present measurements of the energy consumption of the kernels and compare the Energy-to-Solution between different compilers and parallel programing paradigms. In addition, we discuss the effect on the performance and the energy consumption by linking the vendor provided libraries compared to the user-compiled versions. These extensive comparisons obtained on the top most performant supercomputer systems help users and developers as a starting point for determining an optimal development strategy.",
keywords = "Energy consumption, HPC, Performance evaluation, Sparse algebra, Trilinos",
author = "Mohammad Siahatgar and {Von Voigt}, Gabriele",
note = "Funding Information: This work has been funded by the European Research Council under the FP7 NUMEXAS project under grant agreement 611636. The authors gratefully acknowledge the Gauss Centre for Supercomputing (GCS) for providing computing time through the John von Neumann Institute for Computing (NIC) on the GCS share of the supercomputer JUQUEEN [16] at J{\"u}lich Supercomputing Centre (JSC). GCS is the alliance of the three national supercomputing centers HLRS (Universit{\"a}t Stuttgart), JSC (Forschungszentrum J{\"u}lich), and LRZ (Bayerische Akademie der Wissenschaften), funded by the German Federal Ministry of Education and Research (BMBF) and the German State Ministries for Research of Baden-W{\"u}rttemberg (MWK), Bayern (StMWFK) and Nordrhein-Westfalen (MIWF). The authors would like to thank the anonymous referee for the comments.; 7th European Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS Congress 2016 ; Conference date: 05-06-2016 Through 10-06-2016",
year = "2016",
doi = "10.7712/100016.1893.11500",
language = "English",
series = "ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering",
pages = "1381--1391",
editor = "G. Stefanou and V. Papadopoulos and V. Plevris and M. Papadrakakis",
booktitle = "ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering",

}

Download

TY - GEN

T1 - Evaluation of sparse linear algebra operations in Trilinos

AU - Siahatgar, Mohammad

AU - Von Voigt, Gabriele

N1 - Funding Information: This work has been funded by the European Research Council under the FP7 NUMEXAS project under grant agreement 611636. The authors gratefully acknowledge the Gauss Centre for Supercomputing (GCS) for providing computing time through the John von Neumann Institute for Computing (NIC) on the GCS share of the supercomputer JUQUEEN [16] at Jülich Supercomputing Centre (JSC). GCS is the alliance of the three national supercomputing centers HLRS (Universität Stuttgart), JSC (Forschungszentrum Jülich), and LRZ (Bayerische Akademie der Wissenschaften), funded by the German Federal Ministry of Education and Research (BMBF) and the German State Ministries for Research of Baden-Württemberg (MWK), Bayern (StMWFK) and Nordrhein-Westfalen (MIWF). The authors would like to thank the anonymous referee for the comments.

PY - 2016

Y1 - 2016

N2 - The performance of numerous scientific libraries and applications depends heavily on efficiency of sparse linear algebra operations. In this paper, we survey the performance of several parallel sparse vector and matrix kernels provided in the Trilinos framework on supercomputer systems Cray XC30/40 and IBM Blue Gene/Q. The linear algebra operations in Trilinos are handled by one of the two packages Epetra or Tpetra. While the former is the mostused, the latter is the target of future developments and supports larger scale problems as well as shared memory parallelism. We compare the results obtained from both packages together with the MPI only and hybrid solutions. The hybrid parallelism is managed by the package Kokkos, which aims for performance portability among different architectures. We report the efficiency of a single node of the system and demonstrate the scalability behavior of the benchmarks up to 38,400 cores of the HLRN-III systems. Furthermore, for the Intel processors used in the Cray system we present measurements of the energy consumption of the kernels and compare the Energy-to-Solution between different compilers and parallel programing paradigms. In addition, we discuss the effect on the performance and the energy consumption by linking the vendor provided libraries compared to the user-compiled versions. These extensive comparisons obtained on the top most performant supercomputer systems help users and developers as a starting point for determining an optimal development strategy.

AB - The performance of numerous scientific libraries and applications depends heavily on efficiency of sparse linear algebra operations. In this paper, we survey the performance of several parallel sparse vector and matrix kernels provided in the Trilinos framework on supercomputer systems Cray XC30/40 and IBM Blue Gene/Q. The linear algebra operations in Trilinos are handled by one of the two packages Epetra or Tpetra. While the former is the mostused, the latter is the target of future developments and supports larger scale problems as well as shared memory parallelism. We compare the results obtained from both packages together with the MPI only and hybrid solutions. The hybrid parallelism is managed by the package Kokkos, which aims for performance portability among different architectures. We report the efficiency of a single node of the system and demonstrate the scalability behavior of the benchmarks up to 38,400 cores of the HLRN-III systems. Furthermore, for the Intel processors used in the Cray system we present measurements of the energy consumption of the kernels and compare the Energy-to-Solution between different compilers and parallel programing paradigms. In addition, we discuss the effect on the performance and the energy consumption by linking the vendor provided libraries compared to the user-compiled versions. These extensive comparisons obtained on the top most performant supercomputer systems help users and developers as a starting point for determining an optimal development strategy.

KW - Energy consumption

KW - HPC

KW - Performance evaluation

KW - Sparse algebra

KW - Trilinos

UR - http://www.scopus.com/inward/record.url?scp=84995468920&partnerID=8YFLogxK

U2 - 10.7712/100016.1893.11500

DO - 10.7712/100016.1893.11500

M3 - Conference contribution

AN - SCOPUS:84995468920

T3 - ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering

SP - 1381

EP - 1391

BT - ECCOMAS Congress 2016 - Proceedings of the 7th European Congress on Computational Methods in Applied Sciences and Engineering

A2 - Stefanou, G.

A2 - Papadopoulos, V.

A2 - Plevris, V.

A2 - Papadrakakis, M.

T2 - 7th European Congress on Computational Methods in Applied Sciences and Engineering, ECCOMAS Congress 2016

Y2 - 5 June 2016 through 10 June 2016

ER -

Von denselben Autoren