- Academic Search

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer

What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

保存引用被引用数: 38 関連記事全 7 バージョン

[Free GPT-4]

[PDF] arxiv.org

Rethinking clip-based video learners in cross-domain open-vocabulary action recognition

KY Lin, H Ding, J Zhou, YM Tang, YX Peng… - arxiv preprint arxiv …, 2024 - arxiv.org

Building upon the impressive success of CLIP (Contrastive Language-Image Pretraining),
recent pioneer works have proposed to adapt the powerful CLIP to video data, leading to …

保存引用被引用数: 8 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Human-centric transformer for domain adaptive action recognition

KY Lin, J Zhou, WS Zheng - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

We study the domain adaptation task for action recognition, namely domain adaptive action
recognition, which aims to effectively transfer action recognition power from a label-sufficient …

保存引用被引用数: 3 関連記事全 8 バージョン

[Free GPT-4]

[PDF] arxiv.org

AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation

L Mur-Labadia, R Martinez-Cantin, JJ Guerrero… - … on Computer Vision, 2024 - Springer

Abstract Short-Term object-interaction Anticipation (STA) consists of detecting the location of
the next-active objects, the noun and verb categories of the interaction, and the time to …

保存引用被引用数: 3 関連記事全 2 バージョン

[Free GPT-4]

[PDF] arxiv.org

Multimodal cross-domain few-shot learning for egocentric action recognition

M Hatano, R Hachiuma, R Fujii, H Saito - European Conference on …, 2024 - Springer

We address a novel cross-domain few-shot learning task (CD-FSL) with multimodal input
and unlabeled target data for egocentric action recognition. This paper simultaneously …

保存引用被引用数: 2 関連記事全 4 バージョン

[Free GPT-4]

[PDF] pkwyx.com

Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?

R Leonardi, A Furnari, F Ragusa… - European Conference on …, 2024 - Springer

In this study, we investigate the effectiveness of synthetic data in enhancing egocentric hand-
object interaction detection. Via extensive experiments and comparative analyses on three …

保存引用被引用数: 1 関連記事全 5 バージョン

[Free GPT-4]

[PDF] arxiv.org

Egonce++: Do egocentric video-language models really understand hand-object interactions?

B Xu, Z Wang, Y Du, Z Song, S Zheng, Q ** - arxiv preprint arxiv …, 2024 - arxiv.org

Egocentric video-language pretraining is a crucial paradigm to advance the learning of
egocentric hand-object interactions (EgoHOI). Despite the great success on existing …

保存引用被引用数: 2 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] thecvf.com

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

SA Peirone, F Pistilli, A Alliegro… - Proceedings of the …, 2024 - openaccess.thecvf.com

Human comprehension of a video stream is naturally broad: in a few instants we are able to
understand what is happening the relevance and relationship of objects and forecast what …

保存引用被引用数: 2 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

A survey on deep learning techniques for action anticipation

Z Zhong, M Martin, M Voit, J Gall, J Beyerer - arxiv preprint arxiv …, 2023 - arxiv.org

The ability to anticipate possible future human actions is essential for a wide range of
applications, including autonomous driving and human-robot interaction. Consequently …

保存引用被引用数: 8 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] thecvf.com

What does CLIP know about peeling a banana?

C Cuttano, G Rosi, G Trivigno… - Proceedings of the …, 2024 - openaccess.thecvf.com

Humans show an innate capability to identify tools to support specific actions. The
association between objects parts and the actions they facilitate is usually named …

保存引用被引用数: 2 関連記事全 3 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over...

An outlook into the future of egocentric vision

Rethinking clip-based video learners in cross-domain open-vocabulary action recognition

Human-centric transformer for domain adaptive action recognition

AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction Anticipation

Multimodal cross-domain few-shot learning for egocentric action recognition

Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?

Egonce++: Do egocentric video-language models really understand hand-object interactions?

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

A survey on deep learning techniques for action anticipation

What does CLIP know about peeling a banana?