Google Učenjak

Rubiksnet: Learnable 3d-shift for efficient video action recognition

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Transformer for skeleton-based action recognition: A review of recent advances

W **n, R Liu, Y Liu, Y Chen, W Yu, Q Miao - Neurocomputing, 2023 - Elsevier

Skeleton-based action recognition has rapidly become one of the most popular and
essential research topics in computer vision. The task is to analyze the characteristics of …

Shrani Navedi Navedeno v 61 virih Sorodni članki Vse različice: 2

Action recognition based on RGB and skeleton data sets: A survey

R Yue, Z Tian, S Du - Neurocomputing, 2022 - Elsevier

Action recognition is a major branch of computer vision research. As a widely used
technology, action recognition has been applied to human–computer interaction, intelligent …

Shrani Navedi Navedeno v 57 virih Sorodni članki Vse različice: 3

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org

AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

Shrani Navedi Navedeno v 4844 virih Sorodni članki Vse različice: 2 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Assembly101: A large-scale multi-view video dataset for understanding procedural activities

F Sener, D Chatterjee, D Shelepov… - Proceedings of the …, 2022 - openaccess.thecvf.com

Assembly101 is a new procedural activity dataset featuring 4321 videos of people
assembling and disassembling 101" take-apart" toy vehicles. Participants work without fixed …

Shrani Navedi Navedeno v 212 virih Sorodni članki Vse različice: 9 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Revisiting the" video" in video-language understanding

S Buch, C Eyzaguirre, A Gaidon, J Wu… - Proceedings of the …, 2022 - openaccess.thecvf.com

What makes a video task uniquely suited for videos, beyond what can be understood from a
single image? Building on recent progress in self-supervised image-language models, we …

Shrani Navedi Navedeno v 182 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Shrani Navedi Navedeno v 795 virih Sorodni članki Vse različice: 5

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Video transformer network

D Neimark, O Bar, M Zohar… - Proceedings of the …, 2021 - openaccess.thecvf.com

This paper presents VTN, a transformer-based framework for video recognition. Inspired by
recent developments in vision transformers, we ditch the standard approach in video action …

Shrani Navedi Navedeno v 592 virih Sorodni članki Vse različice: 9 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Movinets: Mobile video networks for efficient video recognition

D Kondratyuk, L Yuan, Y Li, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract We present Mobile Video Networks (MoViNets), a family of computation and
memory efficient video networks that can operate on streaming video for online inference …

Shrani Navedi Navedeno v 308 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Molo: Motion-augmented long-short contrastive learning for few-shot action recognition

X Wang, S Zhang, Z Qing, C Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Current state-of-the-art approaches for few-shot action recognition achieve promising
performance by conducting frame-level matching on learned visual features. However, they …

Shrani Navedi Navedeno v 69 virih Sorodni članki Vse različice: 8 V obliki HTML

PIT: Progressive interaction transformer for pedestrian crossing intention prediction

Y Zhou, G Tan, R Zhong, Y Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

For autonomous driving, one of the major challenges is to predict pedestrian crossing
intention in ego-view. Pedestrian intention depends not only on their intrinsic goals but also …

Shrani Navedi Navedeno v 44 virih Sorodni članki Vse različice: 3

Ustvari opozorilo

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

Rubiksnet: Learnable 3d-shift for efficient video action recognition

Transformer for skeleton-based action recognition: A review of recent advances

Action recognition based on RGB and skeleton data sets: A survey

On the opportunities and risks of foundation models

Assembly101: A large-scale multi-view video dataset for understanding procedural activities

Revisiting the" video" in video-language understanding

Dynamic neural networks: A survey

Video transformer network

Movinets: Mobile video networks for efficient video recognition

Molo: Motion-augmented long-short contrastive learning for few-shot action recognition

PIT: Progressive interaction transformer for pedestrian crossing intention prediction