CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception

Yunshuang Yuan; Monika Sester

doi:10.48550/arXiv.2404.18617

Details

Originalsprache	Englisch
Titel des Sammelwerks	Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024
Herausgeber (Verlag)	Institute of Electrical and Electronics Engineers Inc.
Seiten	1236-1241
Seitenumfang	6
ISBN (elektronisch)	9798350348811
ISBN (Print)	979-8-3503-4882-8
Publikationsstatus	Veröffentlicht - 2 Juni 2024
Veranstaltung	35th IEEE Intelligent Vehicles Symposium, IV 2024 - Jeju Island, Südkorea Dauer: 2 Juni 2024 → 5 Juni 2024

Publikationsreihe

Name	IEEE Intelligent Vehicles Symposium, Proceedings
ISSN (Print)	1931-0587
ISSN (elektronisch)	2642-7214

Abstract

Collective Perception has attracted significant attention in recent years due to its advantage for mitigating occlusion and expanding the field-of-view, thereby enhancing reliability, efficiency, and, most crucially, decision-making safety. However, developing collective perception models is highly resource demanding due to extensive requirements of processing input data for many agents, usually dozens of images and point clouds for a single frame. This not only slows down the model development process for collective perception but also impedes the utilization of larger models. In this paper, we propose an agent-based training framework that handles the deep learning modules and agent data separately to have a cleaner data flow structure. This framework not only provides an API for flexibly prototyping the data processing pipeline and defining the gradient calculation for each agent, but also provides the user interface for interactive training, testing and data visualization. Training experiment results of four collective object detection models on the prominent collective perception benchmark OPV2V show that the agent-based training can significantly reduce the GPU memory consumption and training time while retaining inference performance. The framework and model implementations are available at https://github.com/YuanYunshuang/CoSense3D

ASJC Scopus Sachgebiete

Informatik (insg.)
Angewandte Informatik
Ingenieurwesen (insg.)
Fahrzeugbau
Mathematik (insg.)
Modellierung und Simulation

Zitieren

CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception. / Yuan, Yunshuang; Sester, Monika.
Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024. Institute of Electrical and Electronics Engineers Inc., 2024. S. 1236-1241 (IEEE Intelligent Vehicles Symposium, Proceedings).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Yuan, Y & Sester, M 2024, CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception. in Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024. IEEE Intelligent Vehicles Symposium, Proceedings, Institute of Electrical and Electronics Engineers Inc., S. 1236-1241, 35th IEEE Intelligent Vehicles Symposium, IV 2024, Jeju Island, Südkorea, 2 Juni 2024. https://doi.org/10.48550/arXiv.2404.18617, https://doi.org/10.1109/IV55156.2024.10588865

Yuan, Y., & Sester, M. (2024). CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception. In Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024 (S. 1236-1241). (IEEE Intelligent Vehicles Symposium, Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2404.18617, https://doi.org/10.1109/IV55156.2024.10588865

Yuan Y, Sester M. CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception. in Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024. Institute of Electrical and Electronics Engineers Inc. 2024. S. 1236-1241. (IEEE Intelligent Vehicles Symposium, Proceedings). doi: 10.48550/arXiv.2404.18617, 10.1109/IV55156.2024.10588865

Yuan, Yunshuang ; Sester, Monika. / CoSense3D : an Agent-based Efficient Learning Framework for Collective Perception. Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024. Institute of Electrical and Electronics Engineers Inc., 2024. S. 1236-1241 (IEEE Intelligent Vehicles Symposium, Proceedings).

Download

@inproceedings{4e87748b15eb4f66bcb6d189a6320b8a,

title = "CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception",

abstract = "Collective Perception has attracted significant attention in recent years due to its advantage for mitigating occlusion and expanding the field-of-view, thereby enhancing reliability, efficiency, and, most crucially, decision-making safety. However, developing collective perception models is highly resource demanding due to extensive requirements of processing input data for many agents, usually dozens of images and point clouds for a single frame. This not only slows down the model development process for collective perception but also impedes the utilization of larger models. In this paper, we propose an agent-based training framework that handles the deep learning modules and agent data separately to have a cleaner data flow structure. This framework not only provides an API for flexibly prototyping the data processing pipeline and defining the gradient calculation for each agent, but also provides the user interface for interactive training, testing and data visualization. Training experiment results of four collective object detection models on the prominent collective perception benchmark OPV2V show that the agent-based training can significantly reduce the GPU memory consumption and training time while retaining inference performance. The framework and model implementations are available at https://github.com/YuanYunshuang/CoSense3D",

keywords = "Collective Perception, Efficient Training, Object Detection",

author = "Yunshuang Yuan and Monika Sester",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 35th IEEE Intelligent Vehicles Symposium, IV 2024 ; Conference date: 02-06-2024 Through 05-06-2024",

year = "2024",

month = jun,

day = "2",

doi = "10.48550/arXiv.2404.18617",

language = "English",

isbn = "979-8-3503-4882-8",

series = "IEEE Intelligent Vehicles Symposium, Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1236--1241",

booktitle = "Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024",

address = "United States",

}

Download

TY - GEN

T1 - CoSense3D

T2 - 35th IEEE Intelligent Vehicles Symposium, IV 2024

AU - Yuan, Yunshuang

AU - Sester, Monika

PY - 2024/6/2

Y1 - 2024/6/2

N2 - Collective Perception has attracted significant attention in recent years due to its advantage for mitigating occlusion and expanding the field-of-view, thereby enhancing reliability, efficiency, and, most crucially, decision-making safety. However, developing collective perception models is highly resource demanding due to extensive requirements of processing input data for many agents, usually dozens of images and point clouds for a single frame. This not only slows down the model development process for collective perception but also impedes the utilization of larger models. In this paper, we propose an agent-based training framework that handles the deep learning modules and agent data separately to have a cleaner data flow structure. This framework not only provides an API for flexibly prototyping the data processing pipeline and defining the gradient calculation for each agent, but also provides the user interface for interactive training, testing and data visualization. Training experiment results of four collective object detection models on the prominent collective perception benchmark OPV2V show that the agent-based training can significantly reduce the GPU memory consumption and training time while retaining inference performance. The framework and model implementations are available at https://github.com/YuanYunshuang/CoSense3D

AB - Collective Perception has attracted significant attention in recent years due to its advantage for mitigating occlusion and expanding the field-of-view, thereby enhancing reliability, efficiency, and, most crucially, decision-making safety. However, developing collective perception models is highly resource demanding due to extensive requirements of processing input data for many agents, usually dozens of images and point clouds for a single frame. This not only slows down the model development process for collective perception but also impedes the utilization of larger models. In this paper, we propose an agent-based training framework that handles the deep learning modules and agent data separately to have a cleaner data flow structure. This framework not only provides an API for flexibly prototyping the data processing pipeline and defining the gradient calculation for each agent, but also provides the user interface for interactive training, testing and data visualization. Training experiment results of four collective object detection models on the prominent collective perception benchmark OPV2V show that the agent-based training can significantly reduce the GPU memory consumption and training time while retaining inference performance. The framework and model implementations are available at https://github.com/YuanYunshuang/CoSense3D

KW - Collective Perception

KW - Efficient Training

KW - Object Detection

UR - http://www.scopus.com/inward/record.url?scp=85199788477&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2404.18617

DO - 10.48550/arXiv.2404.18617

M3 - Conference contribution

SN - 979-8-3503-4882-8

T3 - IEEE Intelligent Vehicles Symposium, Proceedings

SP - 1236

EP - 1241

BT - Proceedings - 35th IEEE Intelligent Vehicles Symposium, IV 2024

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 2 June 2024 through 5 June 2024

ER -

Research@Leibniz University

CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception

Autoren

Organisationseinheiten

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

3D Uncertain Implicit Surface Mapping Using GMM and GP

Gap completion in point cloud scene occluded by vehicles using SGC-Net

The Challenge of Data Analytics with Climate-neutral Urban Mobility: (Vision Paper)

LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints

Analyzing urban crash incidents: An advanced endogenous approach using spatiotemporal weights matrix