- Academic Search

S Izquierdo, J Civera - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Abstract The task of Visual Place Recognition (VPR) aims to match a query image against
references from an extensive database of images from different places relying solely on …

Enregistrer Citer Cité 46 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Matte anything: Interactive natural image matting with segment anything model

J Yao, X Wang, L Ye, W Liu - Image and Vision Computing, 2024 - Elsevier

Natural image matting algorithms aim to predict the transparency map (alpha-matte) with the
trimap guidance. However, the production of trimap often requires significant labor, which …

Enregistrer Citer Cité 34 fois Autres articles Les 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey

H Yunusa, S Qin, AHA Chukkol, AA Yusuf… - arxiv preprint arxiv …, 2024 - arxiv.org

The hybrid of Convolutional Neural Network (CNN) and Vision Transformers (ViT)
architectures has emerged as a groundbreaking approach, pushing the boundaries of …

Enregistrer Citer Cité 12 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] pkwyx.com

Diffusion for natural image matting

Y Hu, Y Lin, W Wang, Y Zhao, Y Wei, H Shi - European Conference on …, 2024 - Springer

Existing natural image matting algorithms inevitably have flaws in their predictions on
difficult cases, and their one-step prediction manner cannot further correct these errors. In …

Enregistrer Citer Cité 8 fois Autres articles Les 2 versions Free GPT-4

Exploring the synergies of hybrid convolutional neural network and Vision Transformer architectures for computer vision: A survey

Y Haruna, S Qin, AHA Chukkol, AA Yusuf, I Bello… - … Applications of Artificial …, 2025 - Elsevier

Abstract The hybrid of Convolutional Neural Network (CNN) and Vision Transformer (ViT)
architecture has emerged as a groundbreaking approach, pushing the boundaries of …

Enregistrer Citer Autres articles

[Free GPT-4]

[PDF] arxiv.org

Endodac: Efficient adapting foundation model for self-supervised depth estimation from any endoscopic camera

B Cui, M Islam, L Bai, A Wang, H Ren - International Conference on …, 2024 - Springer

Depth estimation plays a crucial role in various tasks within endoscopic surgery, including
navigation, surface reconstruction, and augmented reality visualization. Despite the …

Enregistrer Citer Cité 11 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Transparent image layer diffusion using latent transparency

L Zhang, M Agrawala - arxiv preprint arxiv:2402.17113, 2024 - arxiv.org

We present LayerDiffusion, an approach enabling large-scale pretrained latent diffusion
models to generate transparent images. The method allows generation of single transparent …

Enregistrer Citer Cité 25 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

SparseDC: Depth Completion from sparse and non-uniform inputs

C Long, W Zhang, Z Chen, H Wang, Y Liu, P Tong… - Information …, 2024 - Elsevier

We propose SparseDC, a model for Depth Completion from Sparse and non-uniform inputs.
Unlike previous methods focusing on completing fixed distributions on benchmark datasets …

Enregistrer Citer Cité 7 fois Autres articles Les 3 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

PD Tudosiu, Y Yang, S Zhang, F Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-image generation has achieved astonishing results yet precise spatial controllability
and prompt fidelity remain highly challenging. This limitation is typically addressed through …

Enregistrer Citer Cité 5 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Unifying Automatic and Interactive Matting with Pretrained ViTs

Z Ye, W Liu, H Guo, Y Liang, C Hong… - Proceedings of the …, 2024 - openaccess.thecvf.com

Automatic and interactive matting largely improve image matting by respectively alleviating
the need for auxiliary input and enabling object selection. Due to different settings on …

Enregistrer Citer Cité 3 fois Autres articles Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Vitmatte: Boosting image matting with pre-trained plain vision transformers

Optimal transport aggregation for visual place recognition

Matte anything: Interactive natural image matting with segment anything model

Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey

Diffusion for natural image matting

Exploring the synergies of hybrid convolutional neural network and Vision Transformer architectures for computer vision: A survey

Endodac: Efficient adapting foundation model for self-supervised depth estimation from any endoscopic camera

Transparent image layer diffusion using latent transparency

SparseDC: Depth Completion from sparse and non-uniform inputs

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Unifying Automatic and Interactive Matting with Pretrained ViTs