Combinatorial analysis of deep learning and machine learning video captioning studies: a systematic literature review

T Kehkashan, A Alsaeedi, WMS Yafooz… - IEEE …, 2024 - ieeexplore.ieee.org
Recent improvements formulated in the area of video captioning have brought rapid
revolutions in its methods and the performance of its models. Machine learning and deep …

CLIP-based Semantic Enhancement and Vocabulary Expansion for Video Captioning Using Reinforcement Learning

L Zheng, P Guo, Z Miao, W Xu - 2024 International Joint …, 2024 - ieeexplore.ieee.org
Video captioning aims to comprehend the content of videos and automatically generate
sentences. It necessitates a network with a robust knowledge background to understand …

Deep learning and multimodal artificial neural network architectures for disease diagnosis and clinical applications

J Thomas, ED Raj - Machine Learning and Deep Learning in …, 2022 - taylorfrancis.com
Machine learning is an important utility of artificial intelligence that provides systems with the
capacity to automatically examine and enhance action without being specially programmed …

[PDF][PDF] Video captioning using neural networks

P Padmawar, R Borade, A Hol - … for Research in Applied Science and …, 2022 - academia.edu
Researchers in the fields of computer vision and natural language processing have been
concentrating their efforts in recent years on automatically develo** natural language …

Multimodal image classifier using textual and visual embeddings

A Fuxman, A Timofeev, Z Li, CT Lu, M Shah… - US Patent …, 2024 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for realizing a multimodal image classifier. In an aspect, a method includes …

[PDF][PDF] Real-time Audio Video Summarization

NS Patil, SS Patil, AM Pawar, C Goel… - Journal ofMobile …, 2022 - researchgate.net
Video processing has grown in importance for several reasons in today's advanced
technological environment where everything is changing at a much faster rate. It is crucial …

Method and system for annotating video scenes in video data

D Kutylov - US Patent App. 18/454,853, 2023 - Google Patents
000SAee scqucncc of sccncs analyzing whcthcr (i) thc changcs bctwccn adjacent vidco
frames exceed a predefined threshold basing on comparison of video frame histograms (ii) …