Google Academic

AB Amjoud, M Amrouch - IEEE Access, 2023 - ieeexplore.ieee.org

Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …

Salvați Citați Citat de 176 ori Articole cu conținut similar Toate cele 4 versiuni

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Object detection and image segmentation with deep learning on earth observation data: A review-part i: Evolution and recent trends

T Hoeser, C Kuenzer - Remote Sensing, 2020 - mdpi.com

Deep learning (DL) has great influence on large parts of science and increasingly
established itself as an adaptive method for new challenges in the field of Earth observation …

Salvați Citați Citat de 383 ori Articole cu conținut similar Toate cele 14 versiuni În cache

[Free GPT-4]
[DeepSeek]

[PDF] ulsan.ac.kr

Yolov9: Learning what you want to learn using programmable gradient information

CY Wang, IH Yeh, HY Mark Liao - European conference on computer …, 2024 - Springer

Today's deep learning methods focus on how to design the objective functions to make the
prediction as close as possible to the target. Meanwhile, an appropriate neural network …

Salvați Citați Citat de 1644 ori Articole cu conținut similar Toate cele 12 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Internimage: Exploring large-scale vision foundation models with deformable convolutions

W Wang, J Dai, Z Chen, Z Huang, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

Compared to the great progress of large-scale vision transformers (ViTs) in recent years,
large-scale models based on convolutional neural networks (CNNs) are still in an early …

Salvați Citați Citat de 856 ori Articole cu conținut similar Toate cele 10 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Eva-02: A visual representation for neon genesis

Y Fang, Q Sun, X Wang, T Huang, X Wang… - Image and Vision …, 2024 - Elsevier

We launch EVA-02, a next-generation Transformer-based visual representation pre-trained
to reconstruct strong and robust language-aligned vision features via masked image …

Salvați Citați Citat de 244 ori Articole cu conținut similar Toate cele 6 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Real-time object detection is one of the most important research topics in computer vision.
As new approaches regarding architecture optimization and training optimization are …

Salvați Citați Citat de 10091 ori Articole cu conținut similar Toate cele 15 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Bevfusion: A simple and robust lidar-camera fusion framework

T Liang, H **e, K Yu, Z **a, Z Lin… - Advances in …, 2022 - proceedings.neurips.cc

Fusing the camera and LiDAR information has become a de-facto standard for 3D object
detection tasks. Current methods rely on point clouds from the LiDAR sensor as queries to …

Salvați Citați Citat de 406 ori Articole cu conținut similar Toate cele 7 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring plain vision transformer backbones for object detection

Y Li, H Mao, R Girshick, K He - European conference on computer vision, 2022 - Springer

We explore the plain, non-hierarchical Vision Transformer (ViT) as a backbone network for
object detection. This design enables the original ViT architecture to be fine-tuned for object …

Salvați Citați Citat de 940 ori Articole cu conținut similar Toate cele 7 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

mplug-2: A modularized multi-modal foundation model across text, image and video

H Xu, Q Ye, M Yan, Y Shi, J Ye, Y Xu… - International …, 2023 - proceedings.mlr.press

Recent years have witnessed a big convergence of language, vision, and multi-modal
pretraining. In this work, we present mPLUG-2, a new unified paradigm with modularized …

Salvați Citați Citat de 135 ori Articole cu conținut similar Toate cele 6 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Detrs with hybrid matching

D Jia, Y Yuan, H He, X Wu, H Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com

One-to-one set matching is a key design for DETR to establish its end-to-end capability, so
that object detection does not require a hand-crafted NMS (non-maximum suppression) to …

Salvați Citați Citat de 229 ori Articole cu conținut similar Toate cele 8 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Cbnet: A composite backbone network architecture for object detection

Object detection using deep learning, CNNs and vision transformers: A review

[HTML][HTML] Object detection and image segmentation with deep learning on earth observation data: A review-part i: Evolution and recent trends

Yolov9: Learning what you want to learn using programmable gradient information

Internimage: Exploring large-scale vision foundation models with deformable convolutions

Eva-02: A visual representation for neon genesis

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Bevfusion: A simple and robust lidar-camera fusion framework

Exploring plain vision transformer backbones for object detection

mplug-2: A modularized multi-modal foundation model across text, image and video

Detrs with hybrid matching