Issue-Slot Based Predication Encoding Technique for VLIW Processors

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Lukas Gerlach
  • Fabian Stuckmann
  • Holger Blume
  • Guillermo Paya-Vaya

Externe Organisationen

  • Exzellenzcluster Hearing4all
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des Sammelwerks2020 9th International Conference on Modern Circuits and Systems Technologies
UntertitelMOCAST 2020
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
ISBN (elektronisch)9781728166872
PublikationsstatusVeröffentlicht - 2020
Veranstaltung9th International Conference on Modern Circuits and Systems Technologies, MOCAST 2020 - Bremen, Deutschland
Dauer: 7 Sept. 20209 Sept. 2020

Abstract

Predication is a well-known alternative to conditional branching. However, the implementation of predication is costly in terms of extending the instruction set of the processor architecture. In this paper, a predication encoding technique for VLIW processors is proposed. Instead of using additional bits in the instruction encoding, the assigned issue-slot of a conditionally executed instruction encodes the associated predicate register. The number of addressable predicate registers scales with the number of issue-slots. All predicate registers have only one read and write port and can be accessed in parallel. Compared to the related work, no additional instruction encoding bits for selecting a predicate register are required and the processor core area increases only by about 1% per predicate register set. With the proposed predication technique, the processing performance increases by up to 4.5% when using two instead of one predicate register for a digital filter case study with floating-point emulation operations. A second case study shows, that conditional execution with two predicate register in combination with loop unrolling and operation merging almost doubles the achieved parallel instructions per cycle for a bit-reversal permutation algorithm.

ASJC Scopus Sachgebiete

Zitieren

Issue-Slot Based Predication Encoding Technique for VLIW Processors. / Gerlach, Lukas; Stuckmann, Fabian; Blume, Holger et al.
2020 9th International Conference on Modern Circuits and Systems Technologies: MOCAST 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 9200304.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Gerlach, L, Stuckmann, F, Blume, H & Paya-Vaya, G 2020, Issue-Slot Based Predication Encoding Technique for VLIW Processors. in 2020 9th International Conference on Modern Circuits and Systems Technologies: MOCAST 2020., 9200304, Institute of Electrical and Electronics Engineers Inc., 9th International Conference on Modern Circuits and Systems Technologies, MOCAST 2020, Bremen, Deutschland, 7 Sept. 2020. https://doi.org/10.1109/mocast49295.2020.9200304
Gerlach, L., Stuckmann, F., Blume, H., & Paya-Vaya, G. (2020). Issue-Slot Based Predication Encoding Technique for VLIW Processors. In 2020 9th International Conference on Modern Circuits and Systems Technologies: MOCAST 2020 Artikel 9200304 Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/mocast49295.2020.9200304
Gerlach L, Stuckmann F, Blume H, Paya-Vaya G. Issue-Slot Based Predication Encoding Technique for VLIW Processors. in 2020 9th International Conference on Modern Circuits and Systems Technologies: MOCAST 2020. Institute of Electrical and Electronics Engineers Inc. 2020. 9200304 doi: 10.1109/mocast49295.2020.9200304
Gerlach, Lukas ; Stuckmann, Fabian ; Blume, Holger et al. / Issue-Slot Based Predication Encoding Technique for VLIW Processors. 2020 9th International Conference on Modern Circuits and Systems Technologies: MOCAST 2020. Institute of Electrical and Electronics Engineers Inc., 2020.
Download
@inproceedings{fb76b313c16041a5a609e4ca2a8316da,
title = "Issue-Slot Based Predication Encoding Technique for VLIW Processors",
abstract = "Predication is a well-known alternative to conditional branching. However, the implementation of predication is costly in terms of extending the instruction set of the processor architecture. In this paper, a predication encoding technique for VLIW processors is proposed. Instead of using additional bits in the instruction encoding, the assigned issue-slot of a conditionally executed instruction encodes the associated predicate register. The number of addressable predicate registers scales with the number of issue-slots. All predicate registers have only one read and write port and can be accessed in parallel. Compared to the related work, no additional instruction encoding bits for selecting a predicate register are required and the processor core area increases only by about 1% per predicate register set. With the proposed predication technique, the processing performance increases by up to 4.5% when using two instead of one predicate register for a digital filter case study with floating-point emulation operations. A second case study shows, that conditional execution with two predicate register in combination with loop unrolling and operation merging almost doubles the achieved parallel instructions per cycle for a bit-reversal permutation algorithm. ",
keywords = "conditional execution, predication, VLIW",
author = "Lukas Gerlach and Fabian Stuckmann and Holger Blume and Guillermo Paya-Vaya",
note = "Funding information: This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany{\textquoteright}s Excellence Strategy — EXC 2177/1 — Project ID 390895286.; 9th International Conference on Modern Circuits and Systems Technologies, MOCAST 2020 ; Conference date: 07-09-2020 Through 09-09-2020",
year = "2020",
doi = "10.1109/mocast49295.2020.9200304",
language = "English",
booktitle = "2020 9th International Conference on Modern Circuits and Systems Technologies",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

Download

TY - GEN

T1 - Issue-Slot Based Predication Encoding Technique for VLIW Processors

AU - Gerlach, Lukas

AU - Stuckmann, Fabian

AU - Blume, Holger

AU - Paya-Vaya, Guillermo

N1 - Funding information: This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy — EXC 2177/1 — Project ID 390895286.

PY - 2020

Y1 - 2020

N2 - Predication is a well-known alternative to conditional branching. However, the implementation of predication is costly in terms of extending the instruction set of the processor architecture. In this paper, a predication encoding technique for VLIW processors is proposed. Instead of using additional bits in the instruction encoding, the assigned issue-slot of a conditionally executed instruction encodes the associated predicate register. The number of addressable predicate registers scales with the number of issue-slots. All predicate registers have only one read and write port and can be accessed in parallel. Compared to the related work, no additional instruction encoding bits for selecting a predicate register are required and the processor core area increases only by about 1% per predicate register set. With the proposed predication technique, the processing performance increases by up to 4.5% when using two instead of one predicate register for a digital filter case study with floating-point emulation operations. A second case study shows, that conditional execution with two predicate register in combination with loop unrolling and operation merging almost doubles the achieved parallel instructions per cycle for a bit-reversal permutation algorithm.

AB - Predication is a well-known alternative to conditional branching. However, the implementation of predication is costly in terms of extending the instruction set of the processor architecture. In this paper, a predication encoding technique for VLIW processors is proposed. Instead of using additional bits in the instruction encoding, the assigned issue-slot of a conditionally executed instruction encodes the associated predicate register. The number of addressable predicate registers scales with the number of issue-slots. All predicate registers have only one read and write port and can be accessed in parallel. Compared to the related work, no additional instruction encoding bits for selecting a predicate register are required and the processor core area increases only by about 1% per predicate register set. With the proposed predication technique, the processing performance increases by up to 4.5% when using two instead of one predicate register for a digital filter case study with floating-point emulation operations. A second case study shows, that conditional execution with two predicate register in combination with loop unrolling and operation merging almost doubles the achieved parallel instructions per cycle for a bit-reversal permutation algorithm.

KW - conditional execution

KW - predication

KW - VLIW

UR - http://www.scopus.com/inward/record.url?scp=85093859311&partnerID=8YFLogxK

U2 - 10.1109/mocast49295.2020.9200304

DO - 10.1109/mocast49295.2020.9200304

M3 - Conference contribution

AN - SCOPUS:85093859311

BT - 2020 9th International Conference on Modern Circuits and Systems Technologies

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 9th International Conference on Modern Circuits and Systems Technologies, MOCAST 2020

Y2 - 7 September 2020 through 9 September 2020

ER -

Von denselben Autoren