Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Evelyn Navarrete
  • Ralph Ewerth
  • Anett Hoppe

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)
View graph of relations

Details

Original languageEnglish
Title of host publicationCIKM 2024
Subtitle of host publicationProceedings of the 33rd ACM International Conference on Information and Knowledge Management
Pages1743-1751
Number of pages9
ISBN (electronic)9798400704369
Publication statusPublished - 21 Oct 2024
Event33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 - Boise, United States
Duration: 21 Oct 202425 Oct 2024

Abstract

Identifying the regions of a learning resource that a learner pays attention to is crucial for assessing the material's impact and improving its design and related support systems. Saliency detection in videos addresses the automatic recognition of attention-drawing regions in single frames. In educational settings, the recognition of pertinent regions in a video's visual stream can enhance content accessibility and information retrieval tasks such as video segmentation, navigation, and summarization. Such advancements can pave the way for the development of advanced AI-assisted technologies that support learning with greater efficacy. However, this task becomes particularly challenging for educational videos due to the combination of unique characteristics such as text, voice, illustrations, animations, and more. To the best of our knowledge, there is currently no study that evaluates saliency detection approaches in educational videos. In this paper, we address this gap by evaluating four state-of-the-art saliency detection approaches for educational videos. We reproduce the original studies and explore the replication capabilities for general-purpose (non-educational) datasets. Then, we investigate the generalization capabilities of the models and evaluate their performance on educational videos. We conduct a comprehensive analysis to identify common failure scenarios and possible areas of improvement. Our experimental results show that educational videos remain a challenging context for generic video saliency detection models.

Keywords

    educational videos, video saliency detection, video-based learning

ASJC Scopus subject areas

Cite this

Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions. / Navarrete, Evelyn; Ewerth, Ralph; Hoppe, Anett.
CIKM 2024 : Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. 2024. p. 1743-1751.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Navarrete, E, Ewerth, R & Hoppe, A 2024, Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions. in CIKM 2024 : Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. pp. 1743-1751, 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024, Boise, United States, 21 Oct 2024. https://doi.org/10.48550/arXiv.2408.04515, https://doi.org/10.1145/3627673.3679825
Navarrete, E., Ewerth, R., & Hoppe, A. (2024). Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions. In CIKM 2024 : Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (pp. 1743-1751) https://doi.org/10.48550/arXiv.2408.04515, https://doi.org/10.1145/3627673.3679825
Navarrete E, Ewerth R, Hoppe A. Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions. In CIKM 2024 : Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. 2024. p. 1743-1751 doi: 10.48550/arXiv.2408.04515, 10.1145/3627673.3679825
Navarrete, Evelyn ; Ewerth, Ralph ; Hoppe, Anett. / Saliency Detection in Educational Videos : Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions. CIKM 2024 : Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. 2024. pp. 1743-1751
Download
@inproceedings{ded265f16c11422cacd7d801d3af3171,
title = "Saliency Detection in Educational Videos: Analyzing the Performance of Current Models, Identifying Limitations and Advancement Directions",
abstract = "Identifying the regions of a learning resource that a learner pays attention to is crucial for assessing the material's impact and improving its design and related support systems. Saliency detection in videos addresses the automatic recognition of attention-drawing regions in single frames. In educational settings, the recognition of pertinent regions in a video's visual stream can enhance content accessibility and information retrieval tasks such as video segmentation, navigation, and summarization. Such advancements can pave the way for the development of advanced AI-assisted technologies that support learning with greater efficacy. However, this task becomes particularly challenging for educational videos due to the combination of unique characteristics such as text, voice, illustrations, animations, and more. To the best of our knowledge, there is currently no study that evaluates saliency detection approaches in educational videos. In this paper, we address this gap by evaluating four state-of-the-art saliency detection approaches for educational videos. We reproduce the original studies and explore the replication capabilities for general-purpose (non-educational) datasets. Then, we investigate the generalization capabilities of the models and evaluate their performance on educational videos. We conduct a comprehensive analysis to identify common failure scenarios and possible areas of improvement. Our experimental results show that educational videos remain a challenging context for generic video saliency detection models.",
keywords = "educational videos, video saliency detection, video-based learning",
author = "Evelyn Navarrete and Ralph Ewerth and Anett Hoppe",
note = "Publisher Copyright: {\textcopyright} 2024 Owner/Author.; 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 ; Conference date: 21-10-2024 Through 25-10-2024",
year = "2024",
month = oct,
day = "21",
doi = "10.48550/arXiv.2408.04515",
language = "English",
pages = "1743--1751",
booktitle = "CIKM 2024",

}

Download

TY - GEN

T1 - Saliency Detection in Educational Videos

T2 - 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024

AU - Navarrete, Evelyn

AU - Ewerth, Ralph

AU - Hoppe, Anett

N1 - Publisher Copyright: © 2024 Owner/Author.

PY - 2024/10/21

Y1 - 2024/10/21

N2 - Identifying the regions of a learning resource that a learner pays attention to is crucial for assessing the material's impact and improving its design and related support systems. Saliency detection in videos addresses the automatic recognition of attention-drawing regions in single frames. In educational settings, the recognition of pertinent regions in a video's visual stream can enhance content accessibility and information retrieval tasks such as video segmentation, navigation, and summarization. Such advancements can pave the way for the development of advanced AI-assisted technologies that support learning with greater efficacy. However, this task becomes particularly challenging for educational videos due to the combination of unique characteristics such as text, voice, illustrations, animations, and more. To the best of our knowledge, there is currently no study that evaluates saliency detection approaches in educational videos. In this paper, we address this gap by evaluating four state-of-the-art saliency detection approaches for educational videos. We reproduce the original studies and explore the replication capabilities for general-purpose (non-educational) datasets. Then, we investigate the generalization capabilities of the models and evaluate their performance on educational videos. We conduct a comprehensive analysis to identify common failure scenarios and possible areas of improvement. Our experimental results show that educational videos remain a challenging context for generic video saliency detection models.

AB - Identifying the regions of a learning resource that a learner pays attention to is crucial for assessing the material's impact and improving its design and related support systems. Saliency detection in videos addresses the automatic recognition of attention-drawing regions in single frames. In educational settings, the recognition of pertinent regions in a video's visual stream can enhance content accessibility and information retrieval tasks such as video segmentation, navigation, and summarization. Such advancements can pave the way for the development of advanced AI-assisted technologies that support learning with greater efficacy. However, this task becomes particularly challenging for educational videos due to the combination of unique characteristics such as text, voice, illustrations, animations, and more. To the best of our knowledge, there is currently no study that evaluates saliency detection approaches in educational videos. In this paper, we address this gap by evaluating four state-of-the-art saliency detection approaches for educational videos. We reproduce the original studies and explore the replication capabilities for general-purpose (non-educational) datasets. Then, we investigate the generalization capabilities of the models and evaluate their performance on educational videos. We conduct a comprehensive analysis to identify common failure scenarios and possible areas of improvement. Our experimental results show that educational videos remain a challenging context for generic video saliency detection models.

KW - educational videos

KW - video saliency detection

KW - video-based learning

UR - http://www.scopus.com/inward/record.url?scp=85210038718&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2408.04515

DO - 10.48550/arXiv.2408.04515

M3 - Conference contribution

AN - SCOPUS:85210038718

SP - 1743

EP - 1751

BT - CIKM 2024

Y2 - 21 October 2024 through 25 October 2024

ER -