Google Akademik

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

Kaydet Alıntı yap Alıntılanma sayısı: 34 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Yolov9: Learning what you want to learn using programmable gradient information

CY Wang, IH Yeh, HY Mark Liao - European conference on computer …, 2024 - Springer

Today's deep learning methods focus on how to design the objective functions to make the
prediction as close as possible to the target. Meanwhile, an appropriate neural network …

Kaydet Alıntı yap Alıntılanma sayısı: 1418 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Tip-adapter: Training-free adaption of clip for few-shot classification

R Zhang, W Zhang, R Fang, P Gao, K Li, J Dai… - European conference on …, 2022 - Springer

Abstract Contrastive Vision-Language Pre-training, known as CLIP, has provided a new
paradigm for learning visual representations using large-scale image-text pairs. It shows …

Kaydet Alıntı yap Alıntılanma sayısı: 346 İlgili makaleler 6 sürümün hepsi

[Free GPT-4]

[PDF] neurips.cc

Point-m2ae: multi-scale masked autoencoders for hierarchical point cloud pre-training

R Zhang, Z Guo, P Gao, R Fang… - Advances in neural …, 2022 - proceedings.neurips.cc

Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for
language and 2D image transformers. However, it still remains an open question on how to …

Kaydet Alıntı yap Alıntılanma sayısı: 265 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

Kaydet Alıntı yap Alıntılanma sayısı: 452 İlgili makaleler 22 sürümün hepsi

[Free GPT-4]

[PDF] thecvf.com

Pimae: Point cloud and image interactive masked autoencoders for 3d object detection

A Chen, K Zhang, R Zhang, Z Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Masked Autoencoders learn strong visual representations and achieve state-of-the-art
results in several independent modalities, yet very few works have addressed their …

Kaydet Alıntı yap Alıntılanma sayısı: 75 İlgili makaleler 7 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] mdpi.com

Recent advances and perspectives in deep learning techniques for 3D point cloud data processing

Z Ding, Y Sun, S Xu, Y Pan, Y Peng, Z Mao - Robotics, 2023 - mdpi.com

In recent years, deep learning techniques for processing 3D point cloud data have seen
significant advancements, given their unique ability to extract relevant features and handle …

Kaydet Alıntı yap Alıntılanma sayısı: 22 İlgili makaleler 3 sürümün hepsi Önbellek

[Free GPT-4]

[PDF] arxiv.org

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Kaydet Alıntı yap Alıntılanma sayısı: 134 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] thecvf.com

Query-dependent video representation for moment retrieval and highlight detection

WJ Moon, S Hyun, SU Park, D Park… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, video moment retrieval and highlight detection (MR/HD) are being spotlighted as
the demand for video understanding is drastically increased. The key objective of MR/HD is …

Kaydet Alıntı yap Alıntılanma sayısı: 119 İlgili makaleler 5 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] aaai.org

Calip: Zero-shot enhancement of clip with parameter-free attention

Z Guo, R Zhang, L Qiu, X Ma, X Miao, X He… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Abstract Contrastive Language-Image Pre-training (CLIP) has been shown to learn visual
representations with promising zero-shot performance. To further improve its downstream …

Kaydet Alıntı yap Alıntılanma sayısı: 113 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

MonoDETR: Depth-guided transformer for monocular 3D object detection

Robustness-aware 3d object detection in autonomous driving: A review and outlook

Yolov9: Learning what you want to learn using programmable gradient information

Tip-adapter: Training-free adaption of clip for few-shot classification

Point-m2ae: multi-scale masked autoencoders for hierarchical point cloud pre-training

A survey of visual transformers

Pimae: Point cloud and image interactive masked autoencoders for 3d object detection

Recent advances and perspectives in deep learning techniques for 3D point cloud data processing

Vision-centric bev perception: A survey

Query-dependent video representation for moment retrieval and highlight detection

Calip: Zero-shot enhancement of clip with parameter-free attention