Μελετητής Google

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer

Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1898 Σχετικά άρθρα Όλες οι 8 εκδοχές

[Free GPT-4]

[PDF] neurips.cc

Segnext: Rethinking convolutional attention design for semantic segmentation

MH Guo, CZ Lu, Q Hou, Z Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 702 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] springer.com

Visual attention network

MH Guo, CZ Lu, ZN Liu, MM Cheng, SM Hu - Computational Visual Media, 2023 - Springer

While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 793 Σχετικά άρθρα Όλες οι 8 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Srformer: Permuted self-attention for single image super-resolution

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com

Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 155 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer

We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 326 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

X Shi, Z Huang, D Li, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 100 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Multimodal token fusion for vision transformers

Y Wang, X Chen, L Cao, W Huang… - Proceedings of the …, 2022 - openaccess.thecvf.com

Many adaptations of transformers have emerged to address the single-modal vision tasks,
where self-attention modules are stacked to handle input sources like images. Intuitively …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 193 Σχετικά άρθρα Όλες οι 7 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Propainter: Improving propagation and transformer for video inpainting

S Zhou, C Li, KCK Chan… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms
in video inpainting (VI). Despite the effectiveness of these components, they still suffer from …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 73 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Transflow: Transformer as flow learner

Y Lu, Q Wang, S Ma, T Geng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Optical flow is an indispensable building block for various important computer vision tasks,
including motion estimation, object tracking, and disparity measurement. In this work, we …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 88 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Towards an end-to-end framework for flow-guided video inpainting

Z Li, CZ Lu, J Qin, CL Guo… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Optical flow, which captures motion information across frames, is exploited in recent video
inpainting methods through propagating pixels along its trajectories. However, the hand …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 157 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Fuseformer: Fusing fine-grained information in transformers for video inpainting

Attention mechanisms in computer vision: A survey

Segnext: Rethinking convolutional attention design for semantic segmentation

Visual attention network

Srformer: Permuted self-attention for single image super-resolution

Flowformer: A transformer architecture for optical flow

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

Multimodal token fusion for vision transformers

Propainter: Improving propagation and transformer for video inpainting

Transflow: Transformer as flow learner

Towards an end-to-end framework for flow-guided video inpainting