- Academic Search

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

Lagre Referanse Sitert av 184 Beslektede artikler Alle 8 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer

The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

Lagre Referanse Sitert av 201 Beslektede artikler Alle 5 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc

Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

Lagre Referanse Sitert av 313 Beslektede artikler Alle 12 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

MOSE: A new dataset for video object segmentation in complex scenes

H Ding, C Liu, S He, X Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …

Lagre Referanse Sitert av 125 Beslektede artikler Alle 7 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Emerging properties in self-supervised vision transformers

M Caron, H Touvron, I Misra, H Jégou… - Proceedings of the …, 2021 - openaccess.thecvf.com

In this paper, we question if self-supervised learning provides new properties to Vision
Transformer (ViT) that stand out compared to convolutional networks (convnets). Beyond the …

Lagre Referanse Sitert av 6324 Beslektede artikler Alle 14 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Tap-vid: A benchmark for tracking any point in a video

C Doersch, A Gupta, L Markeeva… - Advances in …, 2022 - proceedings.neurips.cc

Generic motion understanding from video involves not only tracking objects, but also
perceiving how their surfaces deform and move. This information is useful to make …

Lagre Referanse Sitert av 136 Beslektede artikler Alle 7 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Kee** your eye on the ball: Trajectory attention in video transformers

M Patrick, D Campbell, Y Asano… - Advances in neural …, 2021 - proceedings.neurips.cc

In video transformers, the time dimension is often treated in the same way as the two spatial
dimensions. However, in a scene where objects or the camera may move, a physical point …

Lagre Referanse Sitert av 287 Beslektede artikler Alle 17 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

A generalist framework for panoptic segmentation of images and videos

T Chen, L Li, S Saxena, G Hinton… - Proceedings of the …, 2023 - openaccess.thecvf.com

Panoptic segmentation assigns semantic and instance ID labels to every pixel of an image.
As permutations of instance IDs are also valid solutions, the task requires learning of high …

Lagre Referanse Sitert av 107 Beslektede artikler Alle 6 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Particle video revisited: Tracking through occlusions using point trajectories

AW Harley, Z Fang, K Fragkiadaki - European Conference on Computer …, 2022 - Springer

Tracking pixels in videos is typically studied as an optical flow estimation problem, where
every pixel is described with a displacement vector that locates it in the next frame. Even …

Lagre Referanse Sitert av 153 Beslektede artikler Alle 4 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Self-supervised co-training for video representation learning

T Han, W **e, A Zisserman - Advances in neural information …, 2020 - proceedings.neurips.cc

The objective of this paper is visual-only self-supervised video representation learning. We
make the following contributions:(i) we investigate the benefit of adding semantic-class …

Lagre Referanse Sitert av 472 Beslektede artikler Alle 11 versjoner HTML-versjon

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Mast: A memory-augmented self-supervised tracker

A survey on self-supervised learning: Algorithms, applications, and future trends

A review of convolutional neural network architectures and their optimizations

Emergent correspondence from image diffusion

MOSE: A new dataset for video object segmentation in complex scenes

Emerging properties in self-supervised vision transformers

Tap-vid: A benchmark for tracking any point in a video

Kee** your eye on the ball: Trajectory attention in video transformers

A generalist framework for panoptic segmentation of images and videos

Particle video revisited: Tracking through occlusions using point trajectories

Self-supervised co-training for video representation learning