A review of deep learning for video captioning

M Abdar, M Kollati, S Kuraparthi… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Video captioning (VC) is a fast-moving, cross-disciplinary area of research that comprises
contributions from domains such as computer vision, natural language processing …

Combinatorial Analysis of Deep Learning and Machine Learning Video Captioning Studies: A Systematic Literature Review

T Kehkashan, A Alsaeedi, WMS Yafooz… - IEEE …, 2024 - ieeexplore.ieee.org
Recent improvements formulated in the area of video captioning have brought rapid
revolutions in its methods and the performance of its models. Machine learning and deep …

Machine generation of audio description for blind and visually impaired people

VP Campos, LMG Gonçalves, WL Ribeiro… - ACM Transactions on …, 2023 - dl.acm.org
Automating the generation of audio descriptions (AD) for blind and visually impaired (BVI)
people is a difficult task, since it has several challenges involved, such as: identifying gaps …

Augmenting video lectures: Identifying off-topic concepts and linking to relevant video lecture segments

K Ghosh, SR Nangi, Y Kanchugantla… - International Journal of …, 2022 - Springer
Video lectures are considered as one of the primary media to cater good-quality educational
content to the learners. The video lectures illustrate the course-relevant concepts with …

Recommending Personalized Video Lecture Augmentations with Tagged Community Question Answers

K Ghosh - International Journal of Artificial Intelligence in …, 2025 - Springer
Regardless of their domains, level, or expertise, students consider video lectures one of the
most popular learning media while engaged in self-study sessions on any e-learning …

The role of the input in natural language video description

S Cascianelli, G Costante, A Devo… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Natural language video description (NLVD) has recently received strong interest in the
computer vision, natural language processing (NLP), multimedia, and autonomous robotics …

Towards automatic textual summarization of movies

C Liu, M Last, A Shmilovici - Recent Developments and the New Direction …, 2020 - Springer
With the rapidly increasing number of online video resources, the ability of automatically
understanding those videos becomes more and more important, since it is almost …

Affective question answering on video

N Ruwa, Q Mao, L Wang, J Gou - Neurocomputing, 2019 - Elsevier
Abstract Visual Question Answering (VQA) is an increasingly popular research area in
machine learning. Most of the existing VQA tasks only focus on static images, and only a few …

Application of Deep Learning in Video Question Answering System

M Pandya, A Parekhji, A Shahane… - Design of Intelligent …, 2021 - taylorfrancis.com
Recently, a research area of machine learning that has grown in popularity is Video
Question Answering (VQA). A lot of models based on this concept are used for static images …

Sistema de geração automática de audiodescrição a partir de análise de conteúdo de vídeo

VP Campos - 2019 - bdtd.ibict.br
A audiodescrição é um recurso de acessibilidade projetado para tornar a informação visual
acessível a pessoas cegas ou com baixa visão. Para aumentar a oferta de faixas de …