- Academic Search

Y Weng, Z Pan, M Han, X Chang, B Zhuang - European Conference on …, 2022 - Springer

The task of action detection aims at deducing both the action category and localization of the
start and end moment for each action instance in a long, untrimmed video. While vision …

Speichern Zitieren Zitiert von: 38 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Decomposed cross-modal distillation for rgb-based temporal action detection

P Lee, T Kim, M Shim, D Wee… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Temporal action detection aims to predict the time intervals and the classes of action
instances in the video. Despite the promising performance, existing two-stream models …

Speichern Zitieren Zitiert von: 19 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Localizing moments in long video via multimodal guidance

W Barrios, M Soldan… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent introduction of the large-scale, long-form MAD and Ego4D datasets has enabled
researchers to investigate the performance of current state-of-the-art methods for video …

Speichern Zitieren Zitiert von: 21 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization

C Ju, K Zheng, J Liu, P Zhao, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Weakly-supervised temporal action localization (WTAL) learns to detect and classify action
instances with only category labels. Most methods widely adopt the off-the-shelf …

Speichern Zitieren Zitiert von: 28 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Nsnet: Non-saliency suppression sampler for efficient video recognition

B **a, W Wu, H Wang, R Su, D He, H Yang… - … on Computer Vision, 2022 - Springer

It is challenging for artificial intelligence systems to achieve accurate video recognition
under the scenario of low computation costs. Adaptive inference based efficient video …

Speichern Zitieren Zitiert von: 24 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Temporal saliency query network for efficient video recognition

B **a, Z Wang, W Wu, H Wang, J Han - European Conference on …, 2022 - Springer

Efficient video recognition is a hot-spot research topic with the explosive growth of
multimedia data on the Internet and mobile devices. Most existing methods select the salient …

Speichern Zitieren Zitiert von: 24 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Temporal action localization in the deep learning era: A survey

B Wang, Y Zhao, L Yang, T Long… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

The temporal action localization research aims to discover action instances from untrimmed
videos, representing a fundamental step in the field of intelligent video understanding. With …

Speichern Zitieren Zitiert von: 27 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Egotv: Egocentric task verification from natural language task descriptions

R Hazra, B Chen, A Rai, N Kamra… - Proceedings of the …, 2023 - openaccess.thecvf.com

To enable progress towards egocentric agents capable of understanding everyday tasks
specified in natural language, we propose a benchmark and a synthetic dataset called …

Speichern Zitieren Zitiert von: 7 Ähnliche Artikel Alle 6 Versionen HTML-Version

Multi-level Content-aware Boundary Detection for Temporal Action Proposal Generation

T Su, H Wang, L Wang - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org

It is challenging to generate temporal action proposals from untrimmed videos. In general,
boundary-based temporal action proposal generators are based on detecting temporal …

Speichern Zitieren Zitiert von: 7 Ähnliche Artikel Alle 5 Versionen

MIFNet: Multiple instances focused temporal action proposal generation

L Wang, H Yao, H Yang, S Wang - Neurocomputing, 2023 - Elsevier

Temporal action proposal generation (TAPG) serves as a promising solution for video
analysis. However, the performance of existing methods is still far from satisfactory for real …

Speichern Zitieren Zitiert von: 4 Ähnliche Artikel Alle 4 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Temporal action proposal generation with background constraint

An efficient spatio-temporal pyramid transformer for action detection

Decomposed cross-modal distillation for rgb-based temporal action detection

Localizing moments in long video via multimodal guidance

Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization

Nsnet: Non-saliency suppression sampler for efficient video recognition

Temporal saliency query network for efficient video recognition

Temporal action localization in the deep learning era: A survey

Egotv: Egocentric task verification from natural language task descriptions

Multi-level Content-aware Boundary Detection for Temporal Action Proposal Generation

MIFNet: Multiple instances focused temporal action proposal generation