A scalable, clustered SMT processor for digital signal processing

Mladen Berekovic; Sören Moch; Peter Pirsch

doi:10.1145/1152923.1024304

Details

Originalsprache	Englisch
Seiten	62-69
Seitenumfang	8
Publikationsstatus	Veröffentlicht - 27 Sept. 2003
Veranstaltung	2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03 - Antibes Juan-les-Pins, Frankreich Dauer: 29 Sept. 2004 → 3 Okt. 2004

Konferenz

Konferenz	2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03
Land/Gebiet	Frankreich
Ort	Antibes Juan-les-Pins
Zeitraum	29 Sept. 2004 → 3 Okt. 2004

Abstract

A scalable, distributed, processor architecture is presented that emphasizes on high performance computing for digital signal processing applications by combining high frequency design techniques with a very high degree of parallel processing on a chip. The architecture is based on a superscalar processor model with a modified Tomasulo scheme [1], that was extended to eliminate all central control structures for the data flow and to support simultaneous instruction issue from multiple independent threads (SMT). Consequent application of fine clustering reduces the cycle-time for wire-sensitive building blocks of the processor like the register file or the instruction scheduler and leads to a distributed architecture model, where independent thread processing units, ALUs, registers files and memories are distributed across the chip and communicate with each other by special networks. The performance of the architecture is scalable with both the number of function units and the number of thread units without having any impact on the processors cycle-time.

ASJC Scopus Sachgebiete

Informatik (insg.)
Angewandte Informatik
Informatik (insg.)
Hardware und Architektur
Ingenieurwesen (insg.)
Elektrotechnik und Elektronik

Zitieren

A scalable, clustered SMT processor for digital signal processing. / Berekovic, Mladen; Moch, Sören; Pirsch, Peter.
2003. 62-69 Beitrag in 2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03, Antibes Juan-les-Pins, Frankreich.

Publikation: Konferenzbeitrag › Paper › Forschung › Peer-Review

Berekovic, M, Moch, S & Pirsch, P 2003, 'A scalable, clustered SMT processor for digital signal processing', Beitrag in 2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03, Antibes Juan-les-Pins, Frankreich, 29 Sept. 2004 - 3 Okt. 2004 S. 62-69. https://doi.org/10.1145/1152923.1024304

Berekovic, M., Moch, S., & Pirsch, P. (2003). A scalable, clustered SMT processor for digital signal processing. 62-69. Beitrag in 2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03, Antibes Juan-les-Pins, Frankreich. https://doi.org/10.1145/1152923.1024304

Berekovic M, Moch S, Pirsch P. A scalable, clustered SMT processor for digital signal processing. 2003. Beitrag in 2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03, Antibes Juan-les-Pins, Frankreich. doi: 10.1145/1152923.1024304

Berekovic, Mladen ; Moch, Sören ; Pirsch, Peter. / A scalable, clustered SMT processor for digital signal processing. Beitrag in 2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03, Antibes Juan-les-Pins, Frankreich.8 S.

Download

@conference{a9610254351940b5af85263f61e7571f,

title = "A scalable, clustered SMT processor for digital signal processing",

abstract = "A scalable, distributed, processor architecture is presented that emphasizes on high performance computing for digital signal processing applications by combining high frequency design techniques with a very high degree of parallel processing on a chip. The architecture is based on a superscalar processor model with a modified Tomasulo scheme [1], that was extended to eliminate all central control structures for the data flow and to support simultaneous instruction issue from multiple independent threads (SMT). Consequent application of fine clustering reduces the cycle-time for wire-sensitive building blocks of the processor like the register file or the instruction scheduler and leads to a distributed architecture model, where independent thread processing units, ALUs, registers files and memories are distributed across the chip and communicate with each other by special networks. The performance of the architecture is scalable with both the number of function units and the number of thread units without having any impact on the processors cycle-time.",

author = "Mladen Berekovic and S{\"o}ren Moch and Peter Pirsch",

year = "2003",

month = sep,

day = "27",

doi = "10.1145/1152923.1024304",

language = "English",

pages = "62--69",

note = "2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03 ; Conference date: 29-09-2004 Through 03-10-2004",

}

Download

TY - CONF

T1 - A scalable, clustered SMT processor for digital signal processing

AU - Berekovic, Mladen

AU - Moch, Sören

AU - Pirsch, Peter

PY - 2003/9/27

Y1 - 2003/9/27

N2 - A scalable, distributed, processor architecture is presented that emphasizes on high performance computing for digital signal processing applications by combining high frequency design techniques with a very high degree of parallel processing on a chip. The architecture is based on a superscalar processor model with a modified Tomasulo scheme [1], that was extended to eliminate all central control structures for the data flow and to support simultaneous instruction issue from multiple independent threads (SMT). Consequent application of fine clustering reduces the cycle-time for wire-sensitive building blocks of the processor like the register file or the instruction scheduler and leads to a distributed architecture model, where independent thread processing units, ALUs, registers files and memories are distributed across the chip and communicate with each other by special networks. The performance of the architecture is scalable with both the number of function units and the number of thread units without having any impact on the processors cycle-time.

AB - A scalable, distributed, processor architecture is presented that emphasizes on high performance computing for digital signal processing applications by combining high frequency design techniques with a very high degree of parallel processing on a chip. The architecture is based on a superscalar processor model with a modified Tomasulo scheme [1], that was extended to eliminate all central control structures for the data flow and to support simultaneous instruction issue from multiple independent threads (SMT). Consequent application of fine clustering reduces the cycle-time for wire-sensitive building blocks of the processor like the register file or the instruction scheduler and leads to a distributed architecture model, where independent thread processing units, ALUs, registers files and memories are distributed across the chip and communicate with each other by special networks. The performance of the architecture is scalable with both the number of function units and the number of thread units without having any impact on the processors cycle-time.

UR - http://www.scopus.com/inward/record.url?scp=77953574256&partnerID=8YFLogxK

U2 - 10.1145/1152923.1024304

DO - 10.1145/1152923.1024304

M3 - Paper

AN - SCOPUS:77953574256

SP - 62

EP - 69

T2 - 2003 Workshop on Memory Performance: Dealing with Applications, Systems and Architecture, MEDEA '03

Y2 - 29 September 2004 through 3 October 2004

ER -

Research@Leibniz University

A scalable, clustered SMT processor for digital signal processing

Autoren

Organisationseinheiten

Details

Konferenz

Abstract

ASJC Scopus Sachgebiete

Zitieren