LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Duy M.H. Nguyen
  • Roberto Henschel
  • Bodo Rosenhahn
  • Daniel Sonntag
  • Paul Swoboda

Research Organisations

External Research Organisations

  • Max-Planck Institute for Informatics
  • German Research Centre for Artificial Intelligence (DFKI)
  • Carl von Ossietzky University of Oldenburg
View graph of relations

Details

Original languageEnglish
Title of host publication2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Pages8856-8865
Number of pages10
ISBN (electronic)978-1-6654-6946-3
Publication statusPublished - 2022
Event2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, United States
Duration: 19 Jun 202224 Jun 2022

Publication series

NameProceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition
PublisherIEEE Computer Society
ISSN (Print)1063-6919

Abstract

Multi-Camera Multi-Object Tracking is currently drawing attention in the computer vision field due to its superior performance in real-world applications such as video surveillance with crowded scenes or in wide spaces. In this work, we propose a mathematically elegant multi-camera multiple object tracking approach based on a spatial-temporal lifted multicut formulation. Our model utilizes state-of-the-art tracklets produced by single-camera trackers as proposals. As these tracklets may contain ID-Switch errors, we refine them through a novel pre-clustering obtained from 3D geometry projections. As a result, we derive a better tracking graph without ID switches and more precise affinity costs for the data association phase. Tracklets are then matched to multi-camera trajectories by solving a global lifted multicut formulation that incorporates short and long-range temporal interactions on tracklets located in the same camera as well as inter-camera ones. Experimental results on the WildTrack dataset yield near-perfect performance, outperforming state-of-the-art trackers on Campus while being on par on the PETS-09 dataset. We will release our implementations at this link https://github.com/nhmduy/LMGP.

Keywords

    Motion and tracking, Optimization methods, Scene analysis and understanding, Video analysis and understanding, Vision applications and systems

ASJC Scopus subject areas

Cite this

LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking. / Nguyen, Duy M.H.; Henschel, Roberto; Rosenhahn, Bodo et al.
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. p. 8856-8865 (Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Nguyen, DMH, Henschel, R, Rosenhahn, B, Sonntag, D & Swoboda, P 2022, LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking. in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 8856-8865, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, United States, 19 Jun 2022. https://doi.org/10.48550/arXiv.2111.11892, https://doi.org/10.1109/CVPR52688.2022.00866
Nguyen, D. M. H., Henschel, R., Rosenhahn, B., Sonntag, D., & Swoboda, P. (2022). LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 8856-8865). (Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition). https://doi.org/10.48550/arXiv.2111.11892, https://doi.org/10.1109/CVPR52688.2022.00866
Nguyen DMH, Henschel R, Rosenhahn B, Sonntag D, Swoboda P. LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. p. 8856-8865. (Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.48550/arXiv.2111.11892, 10.1109/CVPR52688.2022.00866
Nguyen, Duy M.H. ; Henschel, Roberto ; Rosenhahn, Bodo et al. / LMGP : Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022. pp. 8856-8865 (Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition).
Download
@inproceedings{18ddc6baec2c4384977dc3ab70eefa5d,
title = "LMGP: Lifted Multicut Meets Geometry Projections for Multi-Camera Multi-Object Tracking",
abstract = "Multi-Camera Multi-Object Tracking is currently drawing attention in the computer vision field due to its superior performance in real-world applications such as video surveillance with crowded scenes or in wide spaces. In this work, we propose a mathematically elegant multi-camera multiple object tracking approach based on a spatial-temporal lifted multicut formulation. Our model utilizes state-of-the-art tracklets produced by single-camera trackers as proposals. As these tracklets may contain ID-Switch errors, we refine them through a novel pre-clustering obtained from 3D geometry projections. As a result, we derive a better tracking graph without ID switches and more precise affinity costs for the data association phase. Tracklets are then matched to multi-camera trajectories by solving a global lifted multicut formulation that incorporates short and long-range temporal interactions on tracklets located in the same camera as well as inter-camera ones. Experimental results on the WildTrack dataset yield near-perfect performance, outperforming state-of-the-art trackers on Campus while being on par on the PETS-09 dataset. We will release our implementations at this link https://github.com/nhmduy/LMGP.",
keywords = "Motion and tracking, Optimization methods, Scene analysis and understanding, Video analysis and understanding, Vision applications and systems",
author = "Nguyen, {Duy M.H.} and Roberto Henschel and Bodo Rosenhahn and Daniel Sonntag and Paul Swoboda",
note = "Funding Information: This research is sponsored by the XAINES and pAItient projects (BMBF grant no. 01IW20005 and BMG grant no. 2520DAT0P2 respectively). ; 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 ; Conference date: 19-06-2022 Through 24-06-2022",
year = "2022",
doi = "10.48550/arXiv.2111.11892",
language = "English",
isbn = "978-1-6654-6947-0",
series = "Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition",
publisher = "IEEE Computer Society",
pages = "8856--8865",
booktitle = "2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",

}

Download

TY - GEN

T1 - LMGP

T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

AU - Nguyen, Duy M.H.

AU - Henschel, Roberto

AU - Rosenhahn, Bodo

AU - Sonntag, Daniel

AU - Swoboda, Paul

N1 - Funding Information: This research is sponsored by the XAINES and pAItient projects (BMBF grant no. 01IW20005 and BMG grant no. 2520DAT0P2 respectively).

PY - 2022

Y1 - 2022

N2 - Multi-Camera Multi-Object Tracking is currently drawing attention in the computer vision field due to its superior performance in real-world applications such as video surveillance with crowded scenes or in wide spaces. In this work, we propose a mathematically elegant multi-camera multiple object tracking approach based on a spatial-temporal lifted multicut formulation. Our model utilizes state-of-the-art tracklets produced by single-camera trackers as proposals. As these tracklets may contain ID-Switch errors, we refine them through a novel pre-clustering obtained from 3D geometry projections. As a result, we derive a better tracking graph without ID switches and more precise affinity costs for the data association phase. Tracklets are then matched to multi-camera trajectories by solving a global lifted multicut formulation that incorporates short and long-range temporal interactions on tracklets located in the same camera as well as inter-camera ones. Experimental results on the WildTrack dataset yield near-perfect performance, outperforming state-of-the-art trackers on Campus while being on par on the PETS-09 dataset. We will release our implementations at this link https://github.com/nhmduy/LMGP.

AB - Multi-Camera Multi-Object Tracking is currently drawing attention in the computer vision field due to its superior performance in real-world applications such as video surveillance with crowded scenes or in wide spaces. In this work, we propose a mathematically elegant multi-camera multiple object tracking approach based on a spatial-temporal lifted multicut formulation. Our model utilizes state-of-the-art tracklets produced by single-camera trackers as proposals. As these tracklets may contain ID-Switch errors, we refine them through a novel pre-clustering obtained from 3D geometry projections. As a result, we derive a better tracking graph without ID switches and more precise affinity costs for the data association phase. Tracklets are then matched to multi-camera trajectories by solving a global lifted multicut formulation that incorporates short and long-range temporal interactions on tracklets located in the same camera as well as inter-camera ones. Experimental results on the WildTrack dataset yield near-perfect performance, outperforming state-of-the-art trackers on Campus while being on par on the PETS-09 dataset. We will release our implementations at this link https://github.com/nhmduy/LMGP.

KW - Motion and tracking

KW - Optimization methods

KW - Scene analysis and understanding

KW - Video analysis and understanding

KW - Vision applications and systems

UR - http://www.scopus.com/inward/record.url?scp=85138289826&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2111.11892

DO - 10.48550/arXiv.2111.11892

M3 - Conference contribution

AN - SCOPUS:85138289826

SN - 978-1-6654-6947-0

T3 - Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 8856

EP - 8865

BT - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Y2 - 19 June 2022 through 24 June 2022

ER -

By the same author(s)