Details
Originalsprache | Englisch |
---|---|
Titel des Sammelwerks | 2009 International Symposium on Systems, Architectures, Modeling, and Simulation |
Herausgeber (Verlag) | Institute of Electrical and Electronics Engineers Inc. |
Seiten | 125-132 |
Seitenumfang | 8 |
ISBN (Print) | 9781424445011 |
Publikationsstatus | Veröffentlicht - 16 Okt. 2009 |
Veranstaltung | 2009 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, IC-SAMOS 2009 - Samos, Griechenland Dauer: 20 Juli 2009 → 23 Juli 2009 |
Abstract
This paper presents a synchronization framework for parallel computing heterogeneous processing elements, which are controlled by a RISC processor. The communication delay between RISC and processing elements is a key issue if the RISC is not closely attached to the processing elements. Recent synchronization approaches neglect communication delays or require low communication delays. This results in a low synchronization rate between RISC and PEs. In order to overcome this delay, a special hardware-based synchronization approach is proposed that reduces the communication overhead and increases the number of executable tasks per time unit. Further, it supports parallel execution of independent hardware tasks. The approach was evaluated for a modular coprocessor architecture containing several processing elements for image processing tasks. The coarse-grained parallel execution of independent tasks significantly improves the speed of an exemplary application for aerial image based vehicle detection on straight highways.
ASJC Scopus Sachgebiete
- Informatik (insg.)
- Theoretische Informatik und Mathematik
- Informatik (insg.)
- Angewandte Informatik
- Informatik (insg.)
- Hardware und Architektur
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
2009 International Symposium on Systems, Architectures, Modeling, and Simulation. Institute of Electrical and Electronics Engineers Inc., 2009. S. 125-132.
Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review
}
TY - GEN
T1 - Hardware-based synchronization framework for heterogeneous RISC/Coprocessor architectures
AU - Flatt, Holger
AU - Schmädecke, Ingo
AU - Kärgel, Michael
AU - Blume, Holger
AU - Pirsch, Peter
PY - 2009/10/16
Y1 - 2009/10/16
N2 - This paper presents a synchronization framework for parallel computing heterogeneous processing elements, which are controlled by a RISC processor. The communication delay between RISC and processing elements is a key issue if the RISC is not closely attached to the processing elements. Recent synchronization approaches neglect communication delays or require low communication delays. This results in a low synchronization rate between RISC and PEs. In order to overcome this delay, a special hardware-based synchronization approach is proposed that reduces the communication overhead and increases the number of executable tasks per time unit. Further, it supports parallel execution of independent hardware tasks. The approach was evaluated for a modular coprocessor architecture containing several processing elements for image processing tasks. The coarse-grained parallel execution of independent tasks significantly improves the speed of an exemplary application for aerial image based vehicle detection on straight highways.
AB - This paper presents a synchronization framework for parallel computing heterogeneous processing elements, which are controlled by a RISC processor. The communication delay between RISC and processing elements is a key issue if the RISC is not closely attached to the processing elements. Recent synchronization approaches neglect communication delays or require low communication delays. This results in a low synchronization rate between RISC and PEs. In order to overcome this delay, a special hardware-based synchronization approach is proposed that reduces the communication overhead and increases the number of executable tasks per time unit. Further, it supports parallel execution of independent hardware tasks. The approach was evaluated for a modular coprocessor architecture containing several processing elements for image processing tasks. The coarse-grained parallel execution of independent tasks significantly improves the speed of an exemplary application for aerial image based vehicle detection on straight highways.
UR - http://www.scopus.com/inward/record.url?scp=71949130682&partnerID=8YFLogxK
U2 - 10.1109/ICSAMOS.2009.5289223
DO - 10.1109/ICSAMOS.2009.5289223
M3 - Conference contribution
AN - SCOPUS:71949130682
SN - 9781424445011
SP - 125
EP - 132
BT - 2009 International Symposium on Systems, Architectures, Modeling, and Simulation
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2009 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, IC-SAMOS 2009
Y2 - 20 July 2009 through 23 July 2009
ER -