- Academic Search

M Liu, L Nie, Y Wang, M Wang, Y Rui - ACM Computing Surveys, 2023 - dl.acm.org

Video moment localization, also known as video moment retrieval, aims to search a target
segment within a video described by a given natural language query. Beyond the task of …

Uložit Citovat Počet citací tohoto článku: 33 Související články Všechny verze (počet: 4)

[Free GPT-4]

[PDF] arxiv.org

Temporal sentence grounding in videos: A survey and future directions

H Zhang, A Sun, W **g, JT Zhou - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Temporal sentence grounding in videos (TSGV), aka, natural language video localization
(NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment that …

Uložit Citovat Počet citací tohoto článku: 54 Související články Všechny verze (počet: 8)

[Free GPT-4]

[PDF] acm.org

Deconfounded video moment retrieval with causal intervention

X Yang, F Feng, W Ji, M Wang, TS Chua - Proceedings of the 44th …, 2021 - dl.acm.org

We tackle the task of video moment retrieval (VMR), which aims to localize a specific
moment in a video according to a textual query. Existing methods primarily model the …

Uložit Citovat Počet citací tohoto článku: 194 Související články Všechny verze (počet: 3)

[Free GPT-4]

[PDF] thecvf.com

Context-aware biaffine localizing network for temporal sentence grounding

D Liu, X Qu, J Dong, P Zhou, Y Cheng… - Proceedings of the …, 2021 - openaccess.thecvf.com

This paper addresses the problem of temporal sentence grounding (TSG), which aims to
identify the temporal boundary of a specific segment from an untrimmed video by a sentence …

Uložit Citovat Počet citací tohoto článku: 169 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Mad: A scalable dataset for language grounding in videos from movie audio descriptions

M Soldan, A Pardo, JL Alcázar… - Proceedings of the …, 2022 - openaccess.thecvf.com

The recent and increasing interest in video-language research has driven the development
of large-scale datasets that enable data-intensive machine learning techniques. In …

Uložit Citovat Počet citací tohoto článku: 106 Související články Všechny verze (počet: 8) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Fast video moment retrieval

J Gao, C Xu - Proceedings of the IEEE/CVF International …, 2021 - openaccess.thecvf.com

This paper targets at fast video moment retrieval (fast VMR), aiming to localize the target
moment efficiently and accurately as queried by a given natural language sentence. We …

Uložit Citovat Počet citací tohoto článku: 116 Související články Všechny verze (počet: 4) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Knowing where to focus: Event-aware transformer for video grounding

J Jang, J Park, J Kim, H Kwon… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Recent DETR-based video grounding models have made the model directly predict moment
timestamps without any hand-crafted components, such as a pre-defined proposal or non …

Uložit Citovat Počet citací tohoto článku: 51 Související články Všechny verze (počet: 8) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

You can ground earlier than see: An effective and efficient pipeline for temporal sentence grounding in compressed videos

X Fang, D Liu, P Zhou, G Nan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target
moment semantically according to a sentence query. Although previous respectable works …

Uložit Citovat Počet citací tohoto článku: 45 Související články Všechny verze (počet: 7) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

G2l: Semantically aligned and uniform video grounding via geodesic and game theory

H Li, M Cao, X Cheng, Y Li, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent video grounding works attempt to introduce vanilla contrastive learning into video
grounding. However, we claim that this naive solution is suboptimal. Contrastive learning …

Uložit Citovat Počet citací tohoto článku: 42 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]

[PDF] arxiv.org

Locvtp: Video-text pre-training for temporal localization

M Cao, T Yang, J Weng, C Zhang, J Wang… - European Conference on …, 2022 - Springer

Abstract Video-Text Pre-training (VTP) aims to learn transferable representations for various
downstream tasks from large-scale web videos. To date, almost all existing VTP methods …

Uložit Citovat Počet citací tohoto článku: 71 Související články Všechny verze (počet: 6)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Jointly cross-and self-modal graph attention network for query-based moment localization

A survey on video moment localization

Temporal sentence grounding in videos: A survey and future directions

Deconfounded video moment retrieval with causal intervention

Context-aware biaffine localizing network for temporal sentence grounding

Mad: A scalable dataset for language grounding in videos from movie audio descriptions

Fast video moment retrieval

Knowing where to focus: Event-aware transformer for video grounding

You can ground earlier than see: An effective and efficient pipeline for temporal sentence grounding in compressed videos

G2l: Semantically aligned and uniform video grounding via geodesic and game theory

Locvtp: Video-text pre-training for temporal localization