Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures

Oliver Renke; Christoph Riggers; Jens Karrenbauer; Holger Blume

doi:10.1109/asap61560.2024.00016

Details

Originalsprache	Englisch
Titel des Sammelwerks	Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024
Seiten	28-29
Seitenumfang	2
ISBN (elektronisch)	979-8-3503-4963-4
Publikationsstatus	Veröffentlicht - 2024

Publikationsreihe

Name	IEEE International Conference on Application-Specific Systems, Architectures, and Processors
ISSN (Print)	2160-0511
ISSN (elektronisch)	2160-052X

Abstract

The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.

ASJC Scopus Sachgebiete

Informatik (insg.)
Hardware und Architektur
Informatik (insg.)
Computernetzwerke und -kommunikation

Ziele für nachhaltige Entwicklung

SDG 7 – Erschwingliche und saubere Energie

Zitieren

Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. / Renke, Oliver ; Riggers, Christoph; Karrenbauer, Jens et al.
Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. 2024. S. 28-29 (IEEE International Conference on Application-Specific Systems, Architectures, and Processors).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Renke, O , Riggers, C, Karrenbauer, J & Blume, H 2024, Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. in Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. IEEE International Conference on Application-Specific Systems, Architectures, and Processors, S. 28-29. https://doi.org/10.1109/asap61560.2024.00016

Renke, O., Riggers, C., Karrenbauer, J., & Blume, H. (2024). Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. In Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024 (S. 28-29). (IEEE International Conference on Application-Specific Systems, Architectures, and Processors). https://doi.org/10.1109/asap61560.2024.00016

Renke O , Riggers C, Karrenbauer J, Blume H. Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. in Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. 2024. S. 28-29. (IEEE International Conference on Application-Specific Systems, Architectures, and Processors). doi: 10.1109/asap61560.2024.00016

Renke, Oliver ; Riggers, Christoph ; Karrenbauer, Jens et al. / Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures. Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024. 2024. S. 28-29 (IEEE International Conference on Application-Specific Systems, Architectures, and Processors).

Download

@inproceedings{bd329bb3c7b643b3aa62454a4bec48ca,

title = "Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures",

abstract = "The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.",

keywords = "CNN Optimization, CNN Quantization, Design Space Exploration, SalsaNext, Semantic Segmentation",

author = "Oliver Renke and Christoph Riggers and Jens Karrenbauer and Holger Blume",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.",

year = "2024",

doi = "10.1109/asap61560.2024.00016",

language = "English",

isbn = "979-8-3503-4964-1",

series = "IEEE International Conference on Application-Specific Systems, Architectures, and Processors",

pages = "28--29",

booktitle = "Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024",

}

Download

TY - GEN

T1 - Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures

AU - Renke, Oliver

AU - Riggers, Christoph

AU - Karrenbauer, Jens

AU - Blume, Holger

PY - 2024

Y1 - 2024

N2 - The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.

AB - The growing use of LiDAR systems and constrained computing resources in the automotive sector require efficient LiDAR processing. SalsaNext, a convolutional neural network for semantic segmentation, is a promising candidate for deployment in that area. To extend the research regarding its quantization and investigate its adaptability to constrained resources, a design space exploration is performed. The design space, defined by model size, topology, and compute precision, is evaluated on a Jetson AGX Orin regarding classification accuracy, latency, and energy efficiency. The results display a trade-off between classification accuracy and runtime. The smallest model evaluated in INT8 on the GPU provides the smallest latency of 14.48 ms with a mloU score of 43.2%. A mloU score of 47.7% at a latency of 26.92 ms can be achieved with the medium-sized model and modified topology evaluated in INT8 on the DLA. The medium-sized model with modified topology provides good classification accuracy evaluated in FP32 on the GPU with a mloU score of 55.2% in 67.85 ms.

KW - CNN Optimization

KW - CNN Quantization

KW - Design Space Exploration

KW - SalsaNext

KW - Semantic Segmentation

UR - http://www.scopus.com/inward/record.url?scp=85203107748&partnerID=8YFLogxK

U2 - 10.1109/asap61560.2024.00016

DO - 10.1109/asap61560.2024.00016

M3 - Conference contribution

SN - 979-8-3503-4964-1

T3 - IEEE International Conference on Application-Specific Systems, Architectures, and Processors

SP - 28

EP - 29

BT - Proceedings - 2024 IEEE 35th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2024

ER -

Research@Leibniz University

Design Space Exploration of Semantic Segmentation CNN SalsaNext for Constrained Architectures

Autorschaft

Organisationseinheiten

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Ziele für nachhaltige Entwicklung

Zitieren

Von denselben Autoren

Blue Light-Induced, Dosed Protein Expression of Active BDNF in Human Cells Using the Optogenetic CRY2/CIB System

ZuSE-KI-Mobil AI Chip Design Platform: An Overview

High Temperature In-Order RISC-V Processor with Heterogeneous Pipeline and Out-of-Order Write-Back Mechanism

RRNS Arith Lib – An Open-Source Redundant Residue Number System Arithmetic VHDL Library

Radar Object Detection on a Vector Processor Using Sparse Convolutional Neural Networks

Blue Light-Induced, Dosed Protein Expression of Active BDNF in Human Cells Using the Optogenetic CRY2/CIB System

ZuSE-KI-Mobil AI Chip Design Platform: An Overview

High Temperature In-Order RISC-V Processor with Heterogeneous Pipeline and Out-of-Order Write-Back Mechanism

RRNS Arith Lib – An Open-Source Redundant Residue Number System Arithmetic VHDL Library

Radar Object Detection on a Vector Processor Using Sparse Convolutional Neural Networks

Blue Light-Induced, Dosed Protein Expression of Active BDNF in Human Cells Using the Optogenetic CRY2/CIB System