Semi-supervised learning for semantic video retrieval

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Ralph Ewerth
  • Bernd Freisleben

Externe Organisationen

  • Philipps-Universität Marburg
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksProceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007
Seiten154-161
Seitenumfang8
PublikationsstatusVeröffentlicht - 9 Juli 2007
Extern publiziertJa
Veranstaltung6th ACM International Conference on Image and Video Retrieval, CIVR 2007 - Amsterdam, Niederlande
Dauer: 9 Juli 200711 Juli 2007

Publikationsreihe

NameProceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007

Abstract

The automatic understanding of audiovisual content for multimedia retrieval is a difficult task, since the meaning respectively the appearance of a certain event or concept is strongly determined by contextual information. For example, the appearance of a high-level concept, such as e.g. maps or news anchors, is determined by the used editing layout which usually is typical for a certain broadcasting station. In this paper, we show that it is possible to adaptively learn the appearance of certain objects or events for a particular test video utilizing unlabeled data in order to improve a subsequent retrieval process. First, an initial model is obtained via supervised learning using a set of appropriate training videos. Then, this initial model is used to rank shots for each test video v separately. This ranking is used to label the most relevant and most irrelevant shots in a video v for subsequent use as training data in a semi-supervised learning process. Based on these automatically labeled training data, relevant features are selected for the concept under consideration for video v. Then, two additional classifiers are trained on the automatically labeled data of this video. Adaboost and Support Vector Machines (SVM) are incorporated for feature selection and ensemble classification. Finally, the newly trained classifiers and the initial model form an ensemble. Experimental results on TRECVID 2005 video data demonstrate the feasibility of the proposed learning scheme for certain high-level concepts.

ASJC Scopus Sachgebiete

Zitieren

Semi-supervised learning for semantic video retrieval. / Ewerth, Ralph; Freisleben, Bernd.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007. 2007. S. 154-161 (Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Ewerth, R & Freisleben, B 2007, Semi-supervised learning for semantic video retrieval. in Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007. Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007, S. 154-161, 6th ACM International Conference on Image and Video Retrieval, CIVR 2007, Amsterdam, Niederlande, 9 Juli 2007. https://doi.org/10.1145/1282280.1282308
Ewerth, R., & Freisleben, B. (2007). Semi-supervised learning for semantic video retrieval. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007 (S. 154-161). (Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007). https://doi.org/10.1145/1282280.1282308
Ewerth R, Freisleben B. Semi-supervised learning for semantic video retrieval. in Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007. 2007. S. 154-161. (Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007). doi: 10.1145/1282280.1282308
Ewerth, Ralph ; Freisleben, Bernd. / Semi-supervised learning for semantic video retrieval. Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007. 2007. S. 154-161 (Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007).
Download
@inproceedings{4ff474964e95483aa70130fd4c2c5138,
title = "Semi-supervised learning for semantic video retrieval",
abstract = "The automatic understanding of audiovisual content for multimedia retrieval is a difficult task, since the meaning respectively the appearance of a certain event or concept is strongly determined by contextual information. For example, the appearance of a high-level concept, such as e.g. maps or news anchors, is determined by the used editing layout which usually is typical for a certain broadcasting station. In this paper, we show that it is possible to adaptively learn the appearance of certain objects or events for a particular test video utilizing unlabeled data in order to improve a subsequent retrieval process. First, an initial model is obtained via supervised learning using a set of appropriate training videos. Then, this initial model is used to rank shots for each test video v separately. This ranking is used to label the most relevant and most irrelevant shots in a video v for subsequent use as training data in a semi-supervised learning process. Based on these automatically labeled training data, relevant features are selected for the concept under consideration for video v. Then, two additional classifiers are trained on the automatically labeled data of this video. Adaboost and Support Vector Machines (SVM) are incorporated for feature selection and ensemble classification. Finally, the newly trained classifiers and the initial model form an ensemble. Experimental results on TRECVID 2005 video data demonstrate the feasibility of the proposed learning scheme for certain high-level concepts.",
keywords = "Semantic video retrieval, Semi-supervised learning",
author = "Ralph Ewerth and Bernd Freisleben",
year = "2007",
month = jul,
day = "9",
doi = "10.1145/1282280.1282308",
language = "English",
isbn = "1595937331",
series = "Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007",
pages = "154--161",
booktitle = "Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007",
note = "6th ACM International Conference on Image and Video Retrieval, CIVR 2007 ; Conference date: 09-07-2007 Through 11-07-2007",

}

Download

TY - GEN

T1 - Semi-supervised learning for semantic video retrieval

AU - Ewerth, Ralph

AU - Freisleben, Bernd

PY - 2007/7/9

Y1 - 2007/7/9

N2 - The automatic understanding of audiovisual content for multimedia retrieval is a difficult task, since the meaning respectively the appearance of a certain event or concept is strongly determined by contextual information. For example, the appearance of a high-level concept, such as e.g. maps or news anchors, is determined by the used editing layout which usually is typical for a certain broadcasting station. In this paper, we show that it is possible to adaptively learn the appearance of certain objects or events for a particular test video utilizing unlabeled data in order to improve a subsequent retrieval process. First, an initial model is obtained via supervised learning using a set of appropriate training videos. Then, this initial model is used to rank shots for each test video v separately. This ranking is used to label the most relevant and most irrelevant shots in a video v for subsequent use as training data in a semi-supervised learning process. Based on these automatically labeled training data, relevant features are selected for the concept under consideration for video v. Then, two additional classifiers are trained on the automatically labeled data of this video. Adaboost and Support Vector Machines (SVM) are incorporated for feature selection and ensemble classification. Finally, the newly trained classifiers and the initial model form an ensemble. Experimental results on TRECVID 2005 video data demonstrate the feasibility of the proposed learning scheme for certain high-level concepts.

AB - The automatic understanding of audiovisual content for multimedia retrieval is a difficult task, since the meaning respectively the appearance of a certain event or concept is strongly determined by contextual information. For example, the appearance of a high-level concept, such as e.g. maps or news anchors, is determined by the used editing layout which usually is typical for a certain broadcasting station. In this paper, we show that it is possible to adaptively learn the appearance of certain objects or events for a particular test video utilizing unlabeled data in order to improve a subsequent retrieval process. First, an initial model is obtained via supervised learning using a set of appropriate training videos. Then, this initial model is used to rank shots for each test video v separately. This ranking is used to label the most relevant and most irrelevant shots in a video v for subsequent use as training data in a semi-supervised learning process. Based on these automatically labeled training data, relevant features are selected for the concept under consideration for video v. Then, two additional classifiers are trained on the automatically labeled data of this video. Adaboost and Support Vector Machines (SVM) are incorporated for feature selection and ensemble classification. Finally, the newly trained classifiers and the initial model form an ensemble. Experimental results on TRECVID 2005 video data demonstrate the feasibility of the proposed learning scheme for certain high-level concepts.

KW - Semantic video retrieval

KW - Semi-supervised learning

UR - http://www.scopus.com/inward/record.url?scp=36849060178&partnerID=8YFLogxK

U2 - 10.1145/1282280.1282308

DO - 10.1145/1282280.1282308

M3 - Conference contribution

AN - SCOPUS:36849060178

SN - 1595937331

SN - 9781595937339

T3 - Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007

SP - 154

EP - 161

BT - Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007

T2 - 6th ACM International Conference on Image and Video Retrieval, CIVR 2007

Y2 - 9 July 2007 through 11 July 2007

ER -