Deep neural network concepts for background subtraction: A systematic review and comparative evaluation
Conventional neural networks have been demonstrated to be a powerful framework for
background subtraction in video acquired by static cameras. Indeed, the well-known Self …
background subtraction in video acquired by static cameras. Indeed, the well-known Self …
Video description: A survey of methods, datasets, and evaluation metrics
Video description is the automatic generation of natural language sentences that describe
the contents of a given video. It has applications in human-robot interaction, hel** the …
the contents of a given video. It has applications in human-robot interaction, hel** the …
Bevt: Bert pretraining of video transformers
This paper studies the BERT pretraining of video transformers. It is a straightforward but
worth-studying extension given the recent success from BERT pretraining of image …
worth-studying extension given the recent success from BERT pretraining of image …
Wilddeepfake: A challenging real-world dataset for deepfake detection
In recent years, the abuse of a face swap technique called deepfake has raised enormous
public concerns. So far, a large number of deepfake videos (known as" deepfakes") have …
public concerns. So far, a large number of deepfake videos (known as" deepfakes") have …
Masked video distillation: Rethinking masked feature modeling for self-supervised video representation learning
Benefiting from masked visual modeling, self-supervised video representation learning has
achieved remarkable progress. However, existing methods focus on learning …
achieved remarkable progress. However, existing methods focus on learning …
Tokenlearner: Adaptive space-time tokenization for videos
In this paper, we introduce a novel visual representation learning which relies on a handful
of adaptively learned tokens, and which is applicable to both image and video …
of adaptively learned tokens, and which is applicable to both image and video …
Rethinking spatiotemporal feature learning: Speed-accuracy trade-offs in video classification
Despite the steady progress in video analysis led by the adoption of convolutional neural
networks (CNNs), the relative improvement has been less drastic as that in 2D static image …
networks (CNNs), the relative improvement has been less drastic as that in 2D static image …
Eco: Efficient convolutional network for online video understanding
The state of the art in video understanding suffers from two problems:(1) The major part of
reasoning is performed locally in the video, thus missing important relationships within …
reasoning is performed locally in the video, thus missing important relationships within …
ISTVT: interpretable spatial-temporal video transformer for deepfake detection
With the rapid development of Deepfake synthesis technology, our information security and
personal privacy have been severely threatened in recent years. To achieve a robust …
personal privacy have been severely threatened in recent years. To achieve a robust …
Youtube-8m: A large-scale video classification benchmark
Many recent advancements in Computer Vision are attributed to large datasets. Open-
source software packages for Machine Learning and inexpensive commodity hardware …
source software packages for Machine Learning and inexpensive commodity hardware …