Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking

Publikation: Beitrag in FachzeitschriftKonferenzaufsatz in FachzeitschriftForschungPeer-Review

Autoren

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)1619-1626
Seitenumfang8
FachzeitschriftInternational Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Jahrgang48
Ausgabenummer1
PublikationsstatusVeröffentlicht - 14 Dez. 2023
VeranstaltungISPRS Geospatial Week 2023 - Kairo, Ägypten
Dauer: 2 Sept. 20237 Sept. 2023

Abstract

Recent advancements in multi-object tracking (MOT) have heavily relied on object detection models, with attention-based models like DEtection TRansformer (DETR) demonstrating state-of-the-art capabilities. However, the utilization of attention-based detection models in tracking poses a limitation due to their large parameter count, necessitating substantial training data and powerful hardware for parameter estimation. Ignoring this limitation can lead to a loss of valuable temporal information, resulting in decreased tracking performance and increased identity (ID) switches. To address this challenge, we propose a novel framework that directly incorporates motion priors into the tracking attention layer, enabling an end-to-end solution. Our contributions include: I) a novel approach for integrating motion priors into attention-based multi-object tracking models, and II) a specific realisation of this approach using a Kalman filter with a constant velocity assumption as motion prior. Our method was evaluated on the Multi-Object Tracking dataset MOT17, initial results are reported in the paper. Compared to a baseline model without motion prior, we achieve a reduction in the number of ID switches with the new method.

ASJC Scopus Sachgebiete

Zitieren

Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking. / Ali, R.; Mehltretter, M.; Heipke, C.
in: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Jahrgang 48, Nr. 1, 14.12.2023, S. 1619-1626.

Publikation: Beitrag in FachzeitschriftKonferenzaufsatz in FachzeitschriftForschungPeer-Review

Ali, R, Mehltretter, M & Heipke, C 2023, 'Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Jg. 48, Nr. 1, S. 1619-1626. https://doi.org/10.5194/isprs-archives-XLVIII-1-W2-2023-1619-2023
Ali, R., Mehltretter, M., & Heipke, C. (2023). Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 48(1), 1619-1626. https://doi.org/10.5194/isprs-archives-XLVIII-1-W2-2023-1619-2023
Ali R, Mehltretter M, Heipke C. Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2023 Dez 14;48(1):1619-1626. doi: 10.5194/isprs-archives-XLVIII-1-W2-2023-1619-2023
Ali, R. ; Mehltretter, M. ; Heipke, C. / Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking. in: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2023 ; Jahrgang 48, Nr. 1. S. 1619-1626.
Download
@article{5a221602279a4d189327b540840f764c,
title = "Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking",
abstract = "Recent advancements in multi-object tracking (MOT) have heavily relied on object detection models, with attention-based models like DEtection TRansformer (DETR) demonstrating state-of-the-art capabilities. However, the utilization of attention-based detection models in tracking poses a limitation due to their large parameter count, necessitating substantial training data and powerful hardware for parameter estimation. Ignoring this limitation can lead to a loss of valuable temporal information, resulting in decreased tracking performance and increased identity (ID) switches. To address this challenge, we propose a novel framework that directly incorporates motion priors into the tracking attention layer, enabling an end-to-end solution. Our contributions include: I) a novel approach for integrating motion priors into attention-based multi-object tracking models, and II) a specific realisation of this approach using a Kalman filter with a constant velocity assumption as motion prior. Our method was evaluated on the Multi-Object Tracking dataset MOT17, initial results are reported in the paper. Compared to a baseline model without motion prior, we achieve a reduction in the number of ID switches with the new method.",
keywords = "Attention, Image Sequence Analysis, Motion Modelling, Pedestrian Tracking, Transformer",
author = "R. Ali and M. Mehltretter and C. Heipke",
note = "Funding Information: This work was supported by the German Research Foundation (DFG) as a part of the Research Training Group i.c.sens [GRK2159]. ; ISPRS Geospatial Week 2023 ; Conference date: 02-09-2023 Through 07-09-2023",
year = "2023",
month = dec,
day = "14",
doi = "10.5194/isprs-archives-XLVIII-1-W2-2023-1619-2023",
language = "English",
volume = "48",
pages = "1619--1626",
number = "1",

}

Download

TY - JOUR

T1 - Integrating Motion Priors For End-To-End Attention-Based Multi-Object Tracking

AU - Ali, R.

AU - Mehltretter, M.

AU - Heipke, C.

N1 - Funding Information: This work was supported by the German Research Foundation (DFG) as a part of the Research Training Group i.c.sens [GRK2159].

PY - 2023/12/14

Y1 - 2023/12/14

N2 - Recent advancements in multi-object tracking (MOT) have heavily relied on object detection models, with attention-based models like DEtection TRansformer (DETR) demonstrating state-of-the-art capabilities. However, the utilization of attention-based detection models in tracking poses a limitation due to their large parameter count, necessitating substantial training data and powerful hardware for parameter estimation. Ignoring this limitation can lead to a loss of valuable temporal information, resulting in decreased tracking performance and increased identity (ID) switches. To address this challenge, we propose a novel framework that directly incorporates motion priors into the tracking attention layer, enabling an end-to-end solution. Our contributions include: I) a novel approach for integrating motion priors into attention-based multi-object tracking models, and II) a specific realisation of this approach using a Kalman filter with a constant velocity assumption as motion prior. Our method was evaluated on the Multi-Object Tracking dataset MOT17, initial results are reported in the paper. Compared to a baseline model without motion prior, we achieve a reduction in the number of ID switches with the new method.

AB - Recent advancements in multi-object tracking (MOT) have heavily relied on object detection models, with attention-based models like DEtection TRansformer (DETR) demonstrating state-of-the-art capabilities. However, the utilization of attention-based detection models in tracking poses a limitation due to their large parameter count, necessitating substantial training data and powerful hardware for parameter estimation. Ignoring this limitation can lead to a loss of valuable temporal information, resulting in decreased tracking performance and increased identity (ID) switches. To address this challenge, we propose a novel framework that directly incorporates motion priors into the tracking attention layer, enabling an end-to-end solution. Our contributions include: I) a novel approach for integrating motion priors into attention-based multi-object tracking models, and II) a specific realisation of this approach using a Kalman filter with a constant velocity assumption as motion prior. Our method was evaluated on the Multi-Object Tracking dataset MOT17, initial results are reported in the paper. Compared to a baseline model without motion prior, we achieve a reduction in the number of ID switches with the new method.

KW - Attention

KW - Image Sequence Analysis

KW - Motion Modelling

KW - Pedestrian Tracking

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85183301229&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLVIII-1-W2-2023-1619-2023

DO - 10.5194/isprs-archives-XLVIII-1-W2-2023-1619-2023

M3 - Conference article

AN - SCOPUS:85183301229

VL - 48

SP - 1619

EP - 1626

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - 1

T2 - ISPRS Geospatial Week 2023

Y2 - 2 September 2023 through 7 September 2023

ER -

Von denselben Autoren