On the spatial extents of SIFT descriptors for visual concept detection

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Markus Mühling
  • Ralph Ewerth
  • Bernd Freisleben

Externe Organisationen

  • Philipps-Universität Marburg
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksComputer Vision Systems
Untertitel 8th International Conference, ICVS 2011, Proceedings
Seiten71-80
Seitenumfang10
PublikationsstatusVeröffentlicht - 2011
Extern publiziertJa
Veranstaltung8th International Conference on Computer Vision Systems, ICVS 2011 - Sophia Antipolis, Frankreich
Dauer: 20 Sept. 201122 Sept. 2011

Publikationsreihe

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Band6962 LNCS
ISSN (Print)0302-9743
ISSN (elektronisch)1611-3349

Abstract

State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.

ASJC Scopus Sachgebiete

Zitieren

On the spatial extents of SIFT descriptors for visual concept detection. / Mühling, Markus; Ewerth, Ralph; Freisleben, Bernd.
Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. 2011. S. 71-80 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 6962 LNCS).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Mühling, M, Ewerth, R & Freisleben, B 2011, On the spatial extents of SIFT descriptors for visual concept detection. in Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Bd. 6962 LNCS, S. 71-80, 8th International Conference on Computer Vision Systems, ICVS 2011, Sophia Antipolis, Frankreich, 20 Sept. 2011. https://doi.org/10.1007/978-3-642-23968-7_8
Mühling, M., Ewerth, R., & Freisleben, B. (2011). On the spatial extents of SIFT descriptors for visual concept detection. In Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings (S. 71-80). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 6962 LNCS). https://doi.org/10.1007/978-3-642-23968-7_8
Mühling M, Ewerth R, Freisleben B. On the spatial extents of SIFT descriptors for visual concept detection. in Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. 2011. S. 71-80. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-23968-7_8
Mühling, Markus ; Ewerth, Ralph ; Freisleben, Bernd. / On the spatial extents of SIFT descriptors for visual concept detection. Computer Vision Systems : 8th International Conference, ICVS 2011, Proceedings. 2011. S. 71-80 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{f3b08112aaf24f89bd869e1b6e143bec,
title = "On the spatial extents of SIFT descriptors for visual concept detection",
abstract = "State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.",
keywords = "Bag-of-Words, Magnification Factor, SIFT, Spatial Bin Size, Video Retrieval, Visual Concept Detection",
author = "Markus M{\"u}hling and Ralph Ewerth and Bernd Freisleben",
year = "2011",
doi = "10.1007/978-3-642-23968-7_8",
language = "English",
isbn = "9783642239670",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "71--80",
booktitle = "Computer Vision Systems",
note = "8th International Conference on Computer Vision Systems, ICVS 2011 ; Conference date: 20-09-2011 Through 22-09-2011",

}

Download

TY - GEN

T1 - On the spatial extents of SIFT descriptors for visual concept detection

AU - Mühling, Markus

AU - Ewerth, Ralph

AU - Freisleben, Bernd

PY - 2011

Y1 - 2011

N2 - State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.

AB - State-of-the-art systems for visual concept detection typically rely on the Bag-of-Visual-Words representation. While several aspects of this representation have , such as keypoint sampling strategy, vocabulary size, projection method, weighting scheme or the integration of color, the impact of the spatial extents of local SIFT descriptors has not been studied in previous work. In this paper, the effect of different spatial extents in a state-of-the-art system for visual concept detection is investigated. Based on the observation that SIFT descriptors with different spatial extents yield large performance differences, we propose a concept detection system that combines feature representations for different spatial extents using multiple kernel learning. It is shown experimentally on a large set of 101 concepts from the Mediamill Challenge and on the PASCAL Visual Object Classes Challenge that these feature representations are complementary: Superior performance can be achieved on both test sets using the proposed system.

KW - Bag-of-Words

KW - Magnification Factor

KW - SIFT

KW - Spatial Bin Size

KW - Video Retrieval

KW - Visual Concept Detection

UR - http://www.scopus.com/inward/record.url?scp=80053457242&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-23968-7_8

DO - 10.1007/978-3-642-23968-7_8

M3 - Conference contribution

AN - SCOPUS:80053457242

SN - 9783642239670

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 71

EP - 80

BT - Computer Vision Systems

T2 - 8th International Conference on Computer Vision Systems, ICVS 2011

Y2 - 20 September 2011 through 22 September 2011

ER -