Μελετητής Google

S Gui, S Song, R Qin, Y Tang - Remote Sensing, 2024 - mdpi.com

Given the large volume of remote sensing images collected daily, automatic object detection
and segmentation have been a consistent need in Earth observation (EO). However, objects …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 90 Σχετικά άρθρα Όλες οι 6 εκδοχές Προσωρινά αποθηκευμένη

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] A review on deep learning in UAV remote sensing

LP Osco, JM Junior, APM Ramos… - International Journal of …, 2021 - Elsevier

Abstract Deep Neural Networks (DNNs) learn representation from data with an impressive
capability, and brought important breakthroughs for processing images, time-series, natural …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 418 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Vmamba: Visual state space model

Y Liu, Y Tian, Y Zhao, H Yu, L **e… - Advances in neural …, 2025 - proceedings.neurips.cc

Designing computationally efficient network architectures remains an ongoing necessity in
computer vision. In this paper, we adapt Mamba, a state-space language model, into …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1100 Σχετικά άρθρα Όλες οι 12 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Unireplknet: A universal perception large-kernel convnet for audio video point cloud time-series and image recognition

X Ding, Y Zhang, Y Ge, S Zhao… - Proceedings of the …, 2024 - openaccess.thecvf.com

Large-kernel convolutional neural networks (ConvNets) have recently received extensive
research attention but two unresolved and critical issues demand further investigation. 1) …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 143 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Eva: Exploring the limits of masked visual representation learning at scale

Y Fang, W Wang, B **e, Q Sun, L Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com

We launch EVA, a vision-centric foundation model to explore the limits of visual
representation at scale using only publicly accessible data. EVA is a vanilla ViT pre-trained …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 711 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Diffusiondet: Diffusion model for object detection

S Chen, P Sun, Y Song, P Luo - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 494 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Internimage: Exploring large-scale vision foundation models with deformable convolutions

W Wang, J Dai, Z Chen, Z Huang, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

Compared to the great progress of large-scale vision transformers (ViTs) in recent years,
large-scale models based on convolutional neural networks (CNNs) are still in an early …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 847 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Eva-02: A visual representation for neon genesis

Y Fang, Q Sun, X Wang, T Huang, X Wang… - Image and Vision …, 2024 - Elsevier

We launch EVA-02, a next-generation Transformer-based visual representation pre-trained
to reconstruct strong and robust language-aligned vision features via masked image …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 241 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 172 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Image as a foreign language: Beit pretraining for vision and vision-language tasks

W Wang, H Bao, L Dong, J Bjorck… - Proceedings of the …, 2023 - openaccess.thecvf.com

A big convergence of language, vision, and multimodal pretraining is emerging. In this work,
we introduce a general-purpose multimodal foundation model BEiT-3, which achieves …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 570 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Cascade R-CNN: High quality object detection and instance segmentation

Remote sensing object detection in the deep learning era—a review

[HTML][HTML] A review on deep learning in UAV remote sensing

Vmamba: Visual state space model

Unireplknet: A universal perception large-kernel convnet for audio video point cloud time-series and image recognition

Eva: Exploring the limits of masked visual representation learning at scale

Diffusiondet: Diffusion model for object detection

Internimage: Exploring large-scale vision foundation models with deformable convolutions

Eva-02: A visual representation for neon genesis

Universal instance perception as object discovery and retrieval

Image as a foreign language: Beit pretraining for vision and vision-language tasks