LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery

Liao Liao; Xiang Chen; Jingfeng Yang; Stefan Roth; Michael Goesele; Michael Ying Yang; Bodo Rosenhahn

doi:10.5194/isprs-annals-V-2-2020-381-2020

Details

Originalsprache	Englisch
Seiten (von - bis)	381-388
Seitenumfang	8
Fachzeitschrift	ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Jahrgang	5
Ausgabenummer	2
Publikationsstatus	Veröffentlicht - 3 Aug. 2020
Veranstaltung	2020 24th ISPRS Congress on Technical Commission II - Nice, Virtual, Frankreich Dauer: 31 Aug. 2020 → 2 Sept. 2020

Abstract

State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs' features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation. The local feature invariance enhances the learning ability of the focal loss function, and the focal loss further helps to focus on the hard examples. Taken together, our method better addresses the challenges of aerial imagery. We evaluate our approach on several challenging datasets (VEDAI, DOTA), demonstrating a significant improvement over state-of-the-art methods. We demonstrate the good generalization ability of our approach on the DLR 3K dataset.

ASJC Scopus Sachgebiete

Erdkunde und Planetologie (insg.)
Erdkunde und Planetologie (sonstige)
Umweltwissenschaften (insg.)
Umweltwissenschaften (sonstige)
Physik und Astronomie (insg.)
Instrumentierung

Zitieren

LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery. / Liao, Liao; Chen, Xiang; Yang, Jingfeng et al.
in: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Jahrgang 5, Nr. 2, 03.08.2020, S. 381-388.

Publikation: Beitrag in Fachzeitschrift › Konferenzaufsatz in Fachzeitschrift › Forschung › Peer-Review

Liao, L, Chen, X, Yang, J, Roth, S, Goesele, M, Yang, MY & Rosenhahn, B 2020, 'LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery', ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Jg. 5, Nr. 2, S. 381-388. https://doi.org/10.5194/isprs-annals-V-2-2020-381-2020, https://doi.org/10.15488/10879

Liao, L., Chen, X., Yang, J., Roth, S., Goesele, M., Yang, M. Y., & Rosenhahn, B. (2020). LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 5(2), 381-388. https://doi.org/10.5194/isprs-annals-V-2-2020-381-2020, https://doi.org/10.15488/10879

Liao L, Chen X, Yang J, Roth S, Goesele M, Yang MY et al. LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2020 Aug 3;5(2):381-388. doi: 10.5194/isprs-annals-V-2-2020-381-2020, 10.15488/10879

Liao, Liao ; Chen, Xiang ; Yang, Jingfeng et al. / LR-CNN : Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery. in: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2020 ; Jahrgang 5, Nr. 2. S. 381-388.

Download

@article{f6ff3ddeaae447e3ba0fd4556fd25d6e,

title = "LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery",

abstract = "State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs' features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation. The local feature invariance enhances the learning ability of the focal loss function, and the focal loss further helps to focus on the hard examples. Taken together, our method better addresses the challenges of aerial imagery. We evaluate our approach on several challenging datasets (VEDAI, DOTA), demonstrating a significant improvement over state-of-the-art methods. We demonstrate the good generalization ability of our approach on the DLR 3K dataset.",

keywords = "Deep Learning, Feature Enhancement, Object Detection, Twin Region Proposal, Vehicle Detection",

author = "Liao Liao and Xiang Chen and Jingfeng Yang and Stefan Roth and Michael Goesele and Yang, {Michael Ying} and Bodo Rosenhahn",

note = "Funding information: This work was supported by German Research Foundation (DFG) grants COVMAP (RO 2497/12-2) and PhoenixD (EXC 2122, Project ID 390833453).; 2020 24th ISPRS Congress on Technical Commission II ; Conference date: 31-08-2020 Through 02-09-2020",

year = "2020",

month = aug,

day = "3",

doi = "10.5194/isprs-annals-V-2-2020-381-2020",

language = "English",

volume = "5",

pages = "381--388",

number = "2",

}

Download

TY - JOUR

T1 - LR-CNN

T2 - 2020 24th ISPRS Congress on Technical Commission II

AU - Liao, Liao

AU - Chen, Xiang

AU - Yang, Jingfeng

AU - Roth, Stefan

AU - Goesele, Michael

AU - Yang, Michael Ying

AU - Rosenhahn, Bodo

N1 - Funding information: This work was supported by German Research Foundation (DFG) grants COVMAP (RO 2497/12-2) and PhoenixD (EXC 2122, Project ID 390833453).

PY - 2020/8/3

Y1 - 2020/8/3

N2 - State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs' features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation. The local feature invariance enhances the learning ability of the focal loss function, and the focal loss further helps to focus on the hard examples. Taken together, our method better addresses the challenges of aerial imagery. We evaluate our approach on several challenging datasets (VEDAI, DOTA), demonstrating a significant improvement over state-of-the-art methods. We demonstrate the good generalization ability of our approach on the DLR 3K dataset.

AB - State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs' features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation. The local feature invariance enhances the learning ability of the focal loss function, and the focal loss further helps to focus on the hard examples. Taken together, our method better addresses the challenges of aerial imagery. We evaluate our approach on several challenging datasets (VEDAI, DOTA), demonstrating a significant improvement over state-of-the-art methods. We demonstrate the good generalization ability of our approach on the DLR 3K dataset.

KW - Deep Learning

KW - Feature Enhancement

KW - Object Detection

KW - Twin Region Proposal

KW - Vehicle Detection

UR - http://www.scopus.com/inward/record.url?scp=85091068652&partnerID=8YFLogxK

U2 - 10.5194/isprs-annals-V-2-2020-381-2020

DO - 10.5194/isprs-annals-V-2-2020-381-2020

M3 - Conference article

AN - SCOPUS:85091068652

VL - 5

SP - 381

EP - 388

JO - ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

JF - ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

SN - 2194-9042

IS - 2

Y2 - 31 August 2020 through 2 September 2020

ER -

Research@Leibniz University

LR-CNN: Local-Aware Region Cnn for Vehicle Detection in Aerial Imagery

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Automl for Multi-Class Anomaly Compensation of Sensor Drift

Segment Any Object Model (SAOM): Real-To-Simulation Fine-Tuning Strategy For Multi-Class Multi-Instance Segmentation

Indoor Scene Change Understanding (SCU): Segment, Describe, and Revert Any Change

Robust Shape Fitting for 3D Scene Abstraction

Quantum normalizing flows for anomaly detection