University of Marburg at TRECVID 2011: Semantic indexing task

Markus Mühling; Khalid Ballafkir; Ralph Ewerth; Bernd Freisleben

Details

Originalsprache	Englisch
Publikationsstatus	Veröffentlicht - 2011
Extern publiziert	Ja
Veranstaltung	TREC Video Retrieval Evaluation, TRECVID 2011 - Gaithersburg, MD, USA / Vereinigte Staaten Dauer: 5 Dez. 2011 → 7 Dez. 2011

Konferenz

Konferenz	TREC Video Retrieval Evaluation, TRECVID 2011
Land/Gebiet	USA / Vereinigte Staaten
Ort	Gaithersburg, MD
Zeitraum	5 Dez. 2011 → 7 Dez. 2011

Abstract

In this paper, we summarize our results for the semantic indexing task at TRECVID 2011. Last year, we showed that the use of object detection results as additional midlevel features improved the overall performance of a bag-of-visual-words (BoVW) approach. This year, we repeated the experiment on a large concept vocabulary of 346 classes. In addition, we investigated whether feature descriptions of object regions can also improve the concept detection performance. Due to the large number of face-related concepts, like "adult", "female", "male", "dark skinned person", "first lady", "glasses", or "arafat", BoVW features are extracted from face regions and are used as an additional feature representation. Furthermore, a new post-processing scheme is introduced, that leads to a rescoring of shots based on concept relations. The experiments showed that the use of additional object-based features significantly improved the concept detection performance. Further improvements are attained using region-based BoVW features and relation-based rescoring. Altogether, our best run achieved a mean inferred average precision of 12.3% and we submitted the best results for the concepts "overlaid text" and "two persons".

ASJC Scopus Sachgebiete

Informatik (insg.)
Computergrafik und computergestütztes Design
Informatik (insg.)
Maschinelles Sehen und Mustererkennung
Informatik (insg.)
Mensch-Maschine-Interaktion
Informatik (insg.)
Software

Zitieren

University of Marburg at TRECVID 2011: Semantic indexing task. / Mühling, Markus; Ballafkir, Khalid; Ewerth, Ralph et al.
2011. Beitrag in TREC Video Retrieval Evaluation, TRECVID 2011, Gaithersburg, MD, USA / Vereinigte Staaten.

Publikation: Konferenzbeitrag › Paper › Forschung › Peer-Review

Mühling, M, Ballafkir, K, Ewerth, R & Freisleben, B 2011, 'University of Marburg at TRECVID 2011: Semantic indexing task', Beitrag in TREC Video Retrieval Evaluation, TRECVID 2011, Gaithersburg, MD, USA / Vereinigte Staaten, 5 Dez. 2011 - 7 Dez. 2011.

Mühling, M., Ballafkir, K., Ewerth, R., & Freisleben, B. (2011). University of Marburg at TRECVID 2011: Semantic indexing task. Beitrag in TREC Video Retrieval Evaluation, TRECVID 2011, Gaithersburg, MD, USA / Vereinigte Staaten.

Mühling M, Ballafkir K, Ewerth R, Freisleben B. University of Marburg at TRECVID 2011: Semantic indexing task. 2011. Beitrag in TREC Video Retrieval Evaluation, TRECVID 2011, Gaithersburg, MD, USA / Vereinigte Staaten.

Mühling, Markus ; Ballafkir, Khalid ; Ewerth, Ralph et al. / University of Marburg at TRECVID 2011 : Semantic indexing task. Beitrag in TREC Video Retrieval Evaluation, TRECVID 2011, Gaithersburg, MD, USA / Vereinigte Staaten.

Download

@conference{feb031461d394a1a873db87923a1d4d9,

title = "University of Marburg at TRECVID 2011: Semantic indexing task",

abstract = "In this paper, we summarize our results for the semantic indexing task at TRECVID 2011. Last year, we showed that the use of object detection results as additional midlevel features improved the overall performance of a bag-of-visual-words (BoVW) approach. This year, we repeated the experiment on a large concept vocabulary of 346 classes. In addition, we investigated whether feature descriptions of object regions can also improve the concept detection performance. Due to the large number of face-related concepts, like {"}adult{"}, {"}female{"}, {"}male{"}, {"}dark skinned person{"}, {"}first lady{"}, {"}glasses{"}, or {"}arafat{"}, BoVW features are extracted from face regions and are used as an additional feature representation. Furthermore, a new post-processing scheme is introduced, that leads to a rescoring of shots based on concept relations. The experiments showed that the use of additional object-based features significantly improved the concept detection performance. Further improvements are attained using region-based BoVW features and relation-based rescoring. Altogether, our best run achieved a mean inferred average precision of 12.3% and we submitted the best results for the concepts {"}overlaid text{"} and {"}two persons{"}.",

author = "Markus M{\"u}hling and Khalid Ballafkir and Ralph Ewerth and Bernd Freisleben",

year = "2011",

language = "English",

note = "TREC Video Retrieval Evaluation, TRECVID 2011 ; Conference date: 05-12-2011 Through 07-12-2011",

}

Download

TY - CONF

T1 - University of Marburg at TRECVID 2011

T2 - TREC Video Retrieval Evaluation, TRECVID 2011

AU - Mühling, Markus

AU - Ballafkir, Khalid

AU - Ewerth, Ralph

AU - Freisleben, Bernd

PY - 2011

Y1 - 2011

N2 - In this paper, we summarize our results for the semantic indexing task at TRECVID 2011. Last year, we showed that the use of object detection results as additional midlevel features improved the overall performance of a bag-of-visual-words (BoVW) approach. This year, we repeated the experiment on a large concept vocabulary of 346 classes. In addition, we investigated whether feature descriptions of object regions can also improve the concept detection performance. Due to the large number of face-related concepts, like "adult", "female", "male", "dark skinned person", "first lady", "glasses", or "arafat", BoVW features are extracted from face regions and are used as an additional feature representation. Furthermore, a new post-processing scheme is introduced, that leads to a rescoring of shots based on concept relations. The experiments showed that the use of additional object-based features significantly improved the concept detection performance. Further improvements are attained using region-based BoVW features and relation-based rescoring. Altogether, our best run achieved a mean inferred average precision of 12.3% and we submitted the best results for the concepts "overlaid text" and "two persons".

AB - In this paper, we summarize our results for the semantic indexing task at TRECVID 2011. Last year, we showed that the use of object detection results as additional midlevel features improved the overall performance of a bag-of-visual-words (BoVW) approach. This year, we repeated the experiment on a large concept vocabulary of 346 classes. In addition, we investigated whether feature descriptions of object regions can also improve the concept detection performance. Due to the large number of face-related concepts, like "adult", "female", "male", "dark skinned person", "first lady", "glasses", or "arafat", BoVW features are extracted from face regions and are used as an additional feature representation. Furthermore, a new post-processing scheme is introduced, that leads to a rescoring of shots based on concept relations. The experiments showed that the use of additional object-based features significantly improved the concept detection performance. Further improvements are attained using region-based BoVW features and relation-based rescoring. Altogether, our best run achieved a mean inferred average precision of 12.3% and we submitted the best results for the concepts "overlaid text" and "two persons".

UR - http://www.scopus.com/inward/record.url?scp=84905234447&partnerID=8YFLogxK

M3 - Paper

Y2 - 5 December 2011 through 7 December 2011

ER -

Research@Leibniz University

University of Marburg at TRECVID 2011: Semantic indexing task

Autoren

Externe Organisationen

Details

Konferenz

Abstract

ASJC Scopus Sachgebiete

Zitieren