Details
Original language | English |
---|---|
Title of host publication | 1997 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997 |
Editors | Wanlei Zhou, Andrzej Goscinski, Michael Hobbs |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 155-162 |
Number of pages | 8 |
ISBN (electronic) | 0780342291, 9780780342293 |
Publication status | Published - 1997 |
Event | 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997 - Melbourne, Australia Duration: 10 Dec 1997 → 12 Dec 1997 |
Publication series
Name | 1997 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997 |
---|
Abstract
Derived from a thorough analysis of a wide class of image processing algorithms' properties, a parallel RISC architecture has been developed. The architecture gains performance from data level parallelism as well as from instruction level parallelism. From the beginning of the concept phase, high-level programming capabilities have been one of the major design goals. Thus, there has been a steady interaction between the design of the software development toolkit-optimizing assembler and C++ compiler-and the architecture itself. The RISC-typical register files are one of the most critical elements as well concerning die size and clock frequency as the assembler's ability in VLIW scheduling. Running at 100 MHz (200 mm 2 , 0.35 μm CMOS) the processor reaches a sustained performance of more than 2 GOPS for a wide range of image processing algorithms.
ASJC Scopus subject areas
- Computer Science(all)
- Computer Networks and Communications
- Computer Science(all)
- Hardware and Architecture
- Computer Science(all)
- Signal Processing
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
1997 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997. ed. / Wanlei Zhou; Andrzej Goscinski; Michael Hobbs. Institute of Electrical and Electronics Engineers Inc., 1997. p. 155-162 651487 (1997 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - HiPAR-DSP
T2 - 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997
AU - Wittenburg, J. P.
AU - Ohmacht, M.
AU - Kneip, J.
AU - Hinrichs, W.
AU - Pirsch, P.
PY - 1997
Y1 - 1997
N2 - Derived from a thorough analysis of a wide class of image processing algorithms' properties, a parallel RISC architecture has been developed. The architecture gains performance from data level parallelism as well as from instruction level parallelism. From the beginning of the concept phase, high-level programming capabilities have been one of the major design goals. Thus, there has been a steady interaction between the design of the software development toolkit-optimizing assembler and C++ compiler-and the architecture itself. The RISC-typical register files are one of the most critical elements as well concerning die size and clock frequency as the assembler's ability in VLIW scheduling. Running at 100 MHz (200 mm 2 , 0.35 μm CMOS) the processor reaches a sustained performance of more than 2 GOPS for a wide range of image processing algorithms.
AB - Derived from a thorough analysis of a wide class of image processing algorithms' properties, a parallel RISC architecture has been developed. The architecture gains performance from data level parallelism as well as from instruction level parallelism. From the beginning of the concept phase, high-level programming capabilities have been one of the major design goals. Thus, there has been a steady interaction between the design of the software development toolkit-optimizing assembler and C++ compiler-and the architecture itself. The RISC-typical register files are one of the most critical elements as well concerning die size and clock frequency as the assembler's ability in VLIW scheduling. Running at 100 MHz (200 mm 2 , 0.35 μm CMOS) the processor reaches a sustained performance of more than 2 GOPS for a wide range of image processing algorithms.
UR - http://www.scopus.com/inward/record.url?scp=33645150188&partnerID=8YFLogxK
U2 - 10.1109/ICAPP.1997.651487
DO - 10.1109/ICAPP.1997.651487
M3 - Conference contribution
AN - SCOPUS:33645150188
T3 - 1997 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997
SP - 155
EP - 162
BT - 1997 3rd International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 1997
A2 - Zhou, Wanlei
A2 - Goscinski, Andrzej
A2 - Hobbs, Michael
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 10 December 1997 through 12 December 1997
ER -