محقق Google

C Wei, H Fan, S **e, CY Wu, A Yuille… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Abstract We present Masked Feature Prediction (MaskFeat) for self-supervised pre-training
of video models. Our approach first randomly masks out a portion of the input sequence and …‏

ذخیره ارجاع بیان شده در 741 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Mvitv2: Improved multiscale vision transformers for classification and detection‏

Y Li, CY Wu, H Fan, K Mangalam… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

In this paper, we study Multiscale Vision Transformers (MViTv2) as a unified architecture for
image and video classification, as well as object detection. We present an improved version …‏

ذخیره ارجاع بیان شده در 865 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Multiscale vision transformers‏

H Fan, B **ong, K Mangalam, Y Li… - Proceedings of the …, 2021‏ - openaccess.thecvf.com‏

Abstract We present Multiscale Vision Transformers (MViT) for video and image recognition,
by connecting the seminal idea of multiscale feature hierarchies with transformer models …‏

ذخیره ارجاع بیان شده در 1565 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Recurring the transformer for video action recognition‏

J Yang, X Dong, L Liu, C Zhang… - Proceedings of the …, 2022‏ - openaccess.thecvf.com‏

Existing video understanding approaches, such as 3D convolutional neural networks and
Transformer-Based methods, usually process the videos in a clip-wise manner. Hence huge …‏

ذخیره ارجاع بیان شده در 112 یافته مقاله‌های مربوط تمام نسخه‌های 5 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Transformer-based deep learning model and video dataset for unsafe action identification in construction projects‏

M Yang, C Wu, Y Guo, R Jiang, F Zhou, J Zhang… - Automation in …, 2023‏ - Elsevier‏

A large proportion of construction accidents are caused by unintentional and unsafe actions
and behaviors. It is of significant difficulties and ineffectiveness to monitor unsafe behaviors …‏

ذخیره ارجاع بیان شده در 52 یافته مقاله‌های مربوط تمام نسخه‌های 2

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A content-driven micro-video recommendation dataset at scale‏

Y Ni, Y Cheng, X Liu, J Fu, Y Li, X He, Y Zhang… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Micro-videos have recently gained immense popularity, sparking critical research in micro-
video recommendation with significant implications for the entertainment, advertising, and e …‏

ذخیره ارجاع بیان شده در 33 یافته مقاله‌های مربوط تمام نسخه‌های 2 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Torchgeo: deep learning with geospatial data‏

AJ Stewart, C Robinson, IA Corley, A Ortiz… - Proceedings of the 30th …, 2022‏ - dl.acm.org‏

Remotely sensed geospatial data are critical for applications including precision agriculture,
urban planning, disaster monitoring and response, and climate change research, among …‏

ذخیره ارجاع بیان شده در 80 یافته مقاله‌های مربوط تمام نسخه‌های 6

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Spotting temporally precise, fine-grained events in video‏

J Hong, H Zhang, M Gharbi, M Fisher… - European Conference on …, 2022‏ - Springer‏

We introduce the task of spotting temporally precise, fine-grained events in video (detecting
the precise moment in time events occur). Precise spotting requires models to reason …‏

ذخیره ارجاع بیان شده در 36 یافته مقاله‌های مربوط تمام نسخه‌های 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Augly: Data augmentations for robustness‏

Z Papakipos, J Bitton - arxiv preprint arxiv:2201.06494, 2022‏ - arxiv.org‏

We introduce AugLy, a data augmentation library with a focus on adversarial robustness.
AugLy provides a wide array of augmentations for multiple modalities (audio, image, text, & …‏

ذخیره ارجاع بیان شده در 54 یافته مقاله‌های مربوط تمام نسخه‌های 2 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Woods: Benchmarks for out-of-distribution generalization in time series‏

JC Gagnon-Audet, K Ahuja, MJ Darvishi-Bayazi… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Machine learning models often fail to generalize well under distributional shifts.
Understanding and overcoming these failures have led to a research field of Out-of …‏

ذخیره ارجاع بیان شده در 44 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

PyTorchVideo: A deep learning library for video understanding

Masked feature prediction for self-supervised visual pre-training‏

Mvitv2: Improved multiscale vision transformers for classification and detection‏

Multiscale vision transformers‏

Recurring the transformer for video action recognition‏

Transformer-based deep learning model and video dataset for unsafe action identification in construction projects‏

A content-driven micro-video recommendation dataset at scale‏

Torchgeo: deep learning with geospatial data‏

Spotting temporally precise, fine-grained events in video‏

Augly: Data augmentations for robustness‏

Woods: Benchmarks for out-of-distribution generalization in time series‏