Transfer learning enhanced vision-based human activity recognition: a decade-long analysis
The discovery of several machine learning and deep learning techniques has paved the
way to extend the reach of humans in various real-world applications. Classical machine …
way to extend the reach of humans in various real-world applications. Classical machine …
Egovlpv2: Egocentric video-language pre-training with fusion in the backbone
Video-language pre-training (VLP) has become increasingly important due to its ability to
generalize to various vision and language tasks. However, existing egocentric VLP …
generalize to various vision and language tasks. However, existing egocentric VLP …
A review on video summarization techniques
The exponential growth of technology has resulted in a profusion of advanced imaging
devices and eases internet accessibility, leading to an increase in the creation and use of …
devices and eases internet accessibility, leading to an increase in the creation and use of …
Video summarization using deep neural networks: A survey
Video summarization technologies aim to create a concise and complete synopsis by
selecting the most informative parts of the video content. Several approaches have been …
selecting the most informative parts of the video content. Several approaches have been …
Clip-it! language-guided video summarization
A generic video summary is an abridged version of a video that conveys the whole story and
features the most important scenes. Yet the importance of scenes in a video is often …
features the most important scenes. Yet the importance of scenes in a video is often …
Deep reinforcement learning for unsupervised video summarization with diversity-representativeness reward
Video summarization aims to facilitate large-scale video browsing by producing short,
concise summaries that are diverse and representative of original videos. In this paper, we …
concise summaries that are diverse and representative of original videos. In this paper, we …
Unsupervised video summarization with adversarial lstm networks
This paper addresses the problem of unsupervised video summarization, formulated as
selecting a sparse subset of video frames that optimally represent the input video. Our key …
selecting a sparse subset of video frames that optimally represent the input video. Our key …
Video summarization with long short-term memory
We propose a novel supervised learning technique for summarizing videos by automatically
selecting keyframes or key subshots. Casting the task as a structured prediction problem …
selecting keyframes or key subshots. Casting the task as a structured prediction problem …
Dsnet: A flexible detect-to-summarize network for video summarization
In this paper, we propose a Detect-to-Summarize network (DSNet) framework for supervised
video summarization. Our DSNet contains anchor-based and anchor-free counterparts. The …
video summarization. Our DSNet contains anchor-based and anchor-free counterparts. The …
Video summarization with attention-based encoder–decoder networks
This paper addresses the problem of supervised video summarization by formulating it as a
sequence-to-sequence learning problem, where the input is a sequence of original video …
sequence-to-sequence learning problem, where the input is a sequence of original video …