Μελετητής Google

A Boukerche, Z Hou - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

The recent boom of autonomous driving nowadays has made object detection in traffic
scenes a hot topic of research. Designed to classify and locate instances in the image, this is …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 108 Σχετικά άρθρα

Action recognition based on RGB and skeleton data sets: A survey

R Yue, Z Tian, S Du - Neurocomputing, 2022 - Elsevier

Action recognition is a major branch of computer vision research. As a widely used
technology, action recognition has been applied to human–computer interaction, intelligent …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 57 Σχετικά άρθρα Όλες οι 2 εκδοχές

Videocomposer: Compositional video synthesis with motion controllability

X Wang, H Yuan, S Zhang, D Chen… - Advances in …, 2024 - proceedings.neurips.cc

The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 285 Σχετικά άρθρα Όλες οι 6 εκδοχές Προσωρινά αποθηκευμένη

[Free GPT-4]

[PDF] thecvf.com

Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition

CY Wu, Y Li, K Mangalam, H Fan… - Proceedings of the …, 2022 - openaccess.thecvf.com

While today's video recognition systems parse snapshots or short clips accurately, they
cannot connect the dots and reason across a longer range of time yet. Most existing video …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 237 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

X3d: Expanding architectures for efficient video recognition

C Feichtenhofer - Proceedings of the IEEE/CVF conference …, 2020 - openaccess.thecvf.com

This paper presents X3D, a family of efficient video networks that progressively expand a
tiny 2D image classification architecture along multiple network axes, in space, time, width …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1268 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Slowfast networks for video recognition

C Feichtenhofer, H Fan, J Malik… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

We present SlowFast networks for video recognition. Our model involves (i) a Slow pathway,
operating at low frame rate, to capture spatial semantics, and (ii) a Fast pathway, operating …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 4175 Σχετικά άρθρα Όλες οι 11 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 140 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Learning in the frequency domain

K Xu, M Qin, F Sun, Y Wang… - Proceedings of the …, 2020 - openaccess.thecvf.com

Deep neural networks have achieved remarkable success in computer vision tasks. Existing
neural networks mainly operate in the spatial domain with fixed input sizes. For practical …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 497 Σχετικά άρθρα Όλες οι 12 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Dvc: An end-to-end deep video compression framework

G Lu, W Ouyang, D Xu, X Zhang… - Proceedings of the …, 2019 - openaccess.thecvf.com

Conventional video compression approaches use the predictive coding architecture and
encode the corresponding motion information and residual information. In this paper, taking …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 754 Σχετικά άρθρα Όλες οι 11 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Long-term feature banks for detailed video understanding

CY Wu, C Feichtenhofer, H Fan, K He… - Proceedings of the …, 2019 - openaccess.thecvf.com

To understand the world, we humans constantly need to relate the present to the past, and
put events in context. In this paper, we enable existing video models to do the same. We …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 602 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Compressed video action recognition

Object detection using deep learning methods in traffic scenarios

Action recognition based on RGB and skeleton data sets: A survey

Videocomposer: Compositional video synthesis with motion controllability

Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition

X3d: Expanding architectures for efficient video recognition

Slowfast networks for video recognition

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Learning in the frequency domain

Dvc: An end-to-end deep video compression framework

Long-term feature banks for detailed video understanding