A survey on generative ai and llm for video generation, understanding, and streaming

P Zhou, L Wang, Z Liu, Y Hao, P Hui, S Tarkoma… - ar** events,
object-object interactions, domain specificity, and other semantics that are worth highlighting …

V2t: video to text framework using a novel automatic shot boundary detection algorithm

A Singh, TD Singh, S Bandyopadhyay - Multimedia Tools and Applications, 2022 - Springer
The generation of natural language descriptions for a video has been reported by many
researchers till now. But, it is still the most interesting research topic among the researchers …

A variational selection mechanism for article comment generation

J Liu, P Cheng, J Dai, J Liu - Expert Systems with Applications, 2024 - Elsevier
Article comment generation is a new and challenging task in natural language processing,
which has recently received much attention from researchers. In the article comment …

An efficient keyframes selection based framework for video captioning

A Singh, LS Meetei, SM Singh, TD Singh… - Proceedings of the …, 2021 - aclanthology.org
Describing a video is a challenging yet attractive task since it falls into the intersection of
computer vision and natural language generation. The attention-based models have …

VATEX2020: pLSTM framework for video captioning

A Singh, SM Singh, LS Meetei, R Das, TD Singh… - Procedia Computer …, 2023 - Elsevier
Captioning a video involves condensing the video's information into text, which can be
useful in video sentiment analysis, video-guided machine translation (VMT), visual question …

Enriching Knowledge Management Through Generative AI Using Sora's Capabilities.

S Bhattacharjee, S Pahari - IUP Journal of Knowledge …, 2024 - search.ebscohost.com
Deep learning algorithms have enabled generative artificial intelligence (GenAI), which has
been increasingly popular in recent years because of its capacity to imitate and produce …

Enhanced visual multi-modal fusion framework for dense video captioning

R Zhong, Q Zhang, M Zuo - 2023 - researchsquare.com
Dense video captioning is a machine translation task that aims to localize events from full
video and describe them separately. Human observers who parse the video are also …