Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
An outlook into the future of egocentric vision
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …
research in egocentric vision and the ever-anticipated future, where wearable computing …
Hel** hands: An object-aware ego-centric video recognition model
We introduce an object-aware decoder for improving the performance of spatio-temporal
representations on ego-centric videos. The key idea is to enhance object-awareness during …
representations on ego-centric videos. The key idea is to enhance object-awareness during …
Object-centric video representation for long-term action anticipation
This paper focuses on building object-centric representations for long-term action
anticipation in videos. Our key motivation is that objects provide important cues to recognize …
anticipation in videos. Our key motivation is that objects provide important cues to recognize …
Appearance-agnostic representation learning for compositional action recognition
The discussion of compositional generalization in action recognition, ie., Compositional
Action Recognition (CAR), has recently received increasing attention. CAR challenges …
Action Recognition (CAR), has recently received increasing attention. CAR challenges …
Bi-causal: group activity recognition via bidirectional causality
Abstract Current approaches in Group Activity Recognition (GAR) predominantly emphasize
Human Relations (HRs) while often neglecting the impact of Human-Object Interactions …
Human Relations (HRs) while often neglecting the impact of Human-Object Interactions …
Simultaneous detection and interaction reasoning for object-centric action recognition
The interactions between human and objects are important for recognizing object-centric
actions. Existing methods usually adopt a two-stage pipeline, where object proposals are …
actions. Existing methods usually adopt a two-stage pipeline, where object proposals are …
Learning causal domain-invariant temporal dynamics for few-shot action recognition
Few-shot action recognition aims at quickly adapting a pre-trained model to the novel data
with a distribution shift using only a limited number of samples. Key challenges include how …
with a distribution shift using only a limited number of samples. Key challenges include how …
Extending Video Masked Autoencoders to 128 frames
Video understanding has witnessed significant progress with recent video foundation
models demonstrating strong performance owing to self-supervised pre-training objectives; …
models demonstrating strong performance owing to self-supervised pre-training objectives; …
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition
Y Pan, Q Zhao, Y Zhang, Z Wang… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
Fine-grained action recognition typically faces challenges with lower inter-class variances
and higher intra-class variances. Supervised contrastive learning is inherently suitable for …
and higher intra-class variances. Supervised contrastive learning is inherently suitable for …
Principles of Visual Tokens for Efficient Video Understanding
Video understanding has made huge strides in recent years, relying largely on the power of
the transformer architecture. As this architecture is notoriously expensive and video is highly …
the transformer architecture. As this architecture is notoriously expensive and video is highly …