Semi-supervised identification of rarely appearing persons in video by correcting weak labels

Eric Müller; Christian Otto; Ralph Ewerth

doi:10.1145/2911996.2912073

Details

Originalsprache	Englisch
Titel des Sammelwerks	ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval
Seiten	381-384
Seitenumfang	4
ISBN (elektronisch)	9781450343596
Publikationsstatus	Veröffentlicht - 6 Juni 2016
Veranstaltung	6th ACM International Conference on Multimedia Retrieval, ICMR 2016 - New York, USA / Vereinigte Staaten Dauer: 6 Juni 2016 → 9 Juni 2016

Publikationsreihe

Name	ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval

Abstract

Some recent approaches for character identification in movies and TV broadcasts are realized in a semi-supervised manner by assigning transcripts and/or subtitles to the speakers. However, the labels obtained in this way achieve only an accuracy of 80%-90% and the number of training examples for the different actors is unevenly distributed. In this paper, we propose a novel approach for person identification in video by correcting and extending the training data with reliable predictions to reduce the number of annotation errors. Furthermore, the intra-class diversity of rarely speaking characters is enhanced. To address the imbalance of training data per person, we suggest two complementary prediction scores. These scores are also used to recognize whether or not a face track belongs to a (supporting) character whose identity does not appear in the transcript etc. Experimental results demonstrate the feasibility of the proposed approach, outperforming the current state of the art.

ASJC Scopus Sachgebiete

Informatik (insg.)
Computergrafik und computergestütztes Design
Informatik (insg.)
Mensch-Maschine-Interaktion
Informatik (insg.)
Software

Zitieren

Semi-supervised identification of rarely appearing persons in video by correcting weak labels. / Müller, Eric; Otto, Christian; Ewerth, Ralph.
ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval. 2016. S. 381-384 (ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Müller, E, Otto, C & Ewerth, R 2016, Semi-supervised identification of rarely appearing persons in video by correcting weak labels. in ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval. ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval, S. 381-384, 6th ACM International Conference on Multimedia Retrieval, ICMR 2016, New York, USA / Vereinigte Staaten, 6 Juni 2016. https://doi.org/10.1145/2911996.2912073

Müller, E., Otto, C., & Ewerth, R. (2016). Semi-supervised identification of rarely appearing persons in video by correcting weak labels. In ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval (S. 381-384). (ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval). https://doi.org/10.1145/2911996.2912073

Müller E, Otto C, Ewerth R. Semi-supervised identification of rarely appearing persons in video by correcting weak labels. in ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval. 2016. S. 381-384. (ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval). doi: 10.1145/2911996.2912073

Müller, Eric ; Otto, Christian ; Ewerth, Ralph. / Semi-supervised identification of rarely appearing persons in video by correcting weak labels. ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval. 2016. S. 381-384 (ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval).

Download

@inproceedings{09ff775b20d64535b26dc83c41b0d6ba,

title = "Semi-supervised identification of rarely appearing persons in video by correcting weak labels",

abstract = "Some recent approaches for character identification in movies and TV broadcasts are realized in a semi-supervised manner by assigning transcripts and/or subtitles to the speakers. However, the labels obtained in this way achieve only an accuracy of 80%-90% and the number of training examples for the different actors is unevenly distributed. In this paper, we propose a novel approach for person identification in video by correcting and extending the training data with reliable predictions to reduce the number of annotation errors. Furthermore, the intra-class diversity of rarely speaking characters is enhanced. To address the imbalance of training data per person, we suggest two complementary prediction scores. These scores are also used to recognize whether or not a face track belongs to a (supporting) character whose identity does not appear in the transcript etc. Experimental results demonstrate the feasibility of the proposed approach, outperforming the current state of the art.",

keywords = "Face identification in video, Semi-supervised learning",

author = "Eric M{\"u}ller and Christian Otto and Ralph Ewerth",

year = "2016",

month = jun,

day = "6",

doi = "10.1145/2911996.2912073",

language = "English",

series = "ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval",

pages = "381--384",

booktitle = "ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval",

note = "6th ACM International Conference on Multimedia Retrieval, ICMR 2016 ; Conference date: 06-06-2016 Through 09-06-2016",

}

Download

TY - GEN

T1 - Semi-supervised identification of rarely appearing persons in video by correcting weak labels

AU - Müller, Eric

AU - Otto, Christian

AU - Ewerth, Ralph

PY - 2016/6/6

Y1 - 2016/6/6

N2 - Some recent approaches for character identification in movies and TV broadcasts are realized in a semi-supervised manner by assigning transcripts and/or subtitles to the speakers. However, the labels obtained in this way achieve only an accuracy of 80%-90% and the number of training examples for the different actors is unevenly distributed. In this paper, we propose a novel approach for person identification in video by correcting and extending the training data with reliable predictions to reduce the number of annotation errors. Furthermore, the intra-class diversity of rarely speaking characters is enhanced. To address the imbalance of training data per person, we suggest two complementary prediction scores. These scores are also used to recognize whether or not a face track belongs to a (supporting) character whose identity does not appear in the transcript etc. Experimental results demonstrate the feasibility of the proposed approach, outperforming the current state of the art.

AB - Some recent approaches for character identification in movies and TV broadcasts are realized in a semi-supervised manner by assigning transcripts and/or subtitles to the speakers. However, the labels obtained in this way achieve only an accuracy of 80%-90% and the number of training examples for the different actors is unevenly distributed. In this paper, we propose a novel approach for person identification in video by correcting and extending the training data with reliable predictions to reduce the number of annotation errors. Furthermore, the intra-class diversity of rarely speaking characters is enhanced. To address the imbalance of training data per person, we suggest two complementary prediction scores. These scores are also used to recognize whether or not a face track belongs to a (supporting) character whose identity does not appear in the transcript etc. Experimental results demonstrate the feasibility of the proposed approach, outperforming the current state of the art.

KW - Face identification in video

KW - Semi-supervised learning

UR - http://www.scopus.com/inward/record.url?scp=84978682976&partnerID=8YFLogxK

U2 - 10.1145/2911996.2912073

DO - 10.1145/2911996.2912073

M3 - Conference contribution

AN - SCOPUS:84978682976

T3 - ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval

SP - 381

EP - 384

BT - ICMR 2016 - Proceedings of the 2016 ACM International Conference on Multimedia Retrieval

T2 - 6th ACM International Conference on Multimedia Retrieval, ICMR 2016

Y2 - 6 June 2016 through 9 June 2016

ER -

Research@Leibniz University

Semi-supervised identification of rarely appearing persons in video by correcting weak labels

Autorschaft

Organisationseinheiten

Externe Organisationen

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren