- Academic Search

[PDF][PDF] Crema: Multimodal compositional video reasoning via efficient modular adaptation and fusion

S Yu, J Yoon, M Bansal - arxiv preprint arxiv:2402.05889, 2024 - southnlp.github.io

Despite impressive advancements in multimodal compositional reasoning approaches, they
are still limited in their flexibility and efficiency by processing fixed modality inputs while …

Simpan Kutip Dirujuk 12 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Comprehensive review on 3D point cloud segmentation in plants

H Song, W Wen, S Wu, X Guo - Artificial Intelligence in Agriculture, 2025 - Elsevier

Segmentation of three-dimensional (3D) point clouds is fundamental in comprehending
unstructured structural and morphological data. It plays a critical role in research related to …

Simpan Kutip Artikel terkait

Point-to-pixel prompting for point cloud analysis with pre-trained image models

Z Wang, Y Rao, X Yu, J Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Nowadays, pre-training big models on large-scale datasets has achieved great success and
dominated many downstream tasks in natural language processing and 2D vision, while pre …

Simpan Kutip Dirujuk 7 kali Artikel terkait 6 versi

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

RGB-D Cube R-CNN: 3D Object Detection with Selective Modality Dropout

J Piekenbrinck, A Hermans… - Proceedings of the …, 2024 - openaccess.thecvf.com

In this paper we create an RGB-D 3D object detector targeted at indoor robotics use cases
where one modality may be unavailable due to a specific sensor setup or a sensor failure …

Simpan Kutip Dirujuk 5 kali Artikel terkait Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle

H Kweon, J Kim, KJ Yoon - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Manual annotation of every point in a point cloud is a costly and labor-intensive process.
While weakly supervised point cloud semantic segmentation (WSPCSS) with sparse …

Simpan Kutip Dirujuk 5 kali Artikel terkait Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Masked Image Modeling: A Survey

V Hondru, FA Croitoru, S Minaee, RT Ionescu… - arxiv preprint arxiv …, 2024 - arxiv.org

In this work, we survey recent studies on masked image modeling (MIM), an approach that
emerged as a powerful self-supervised learning technique in computer vision. The MIM task …

Simpan Kutip Dirujuk 3 kali Artikel terkait 3 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Infrastructure 3D Target detection based on multi-mode fusion for intelligent and connected vehicles

X Zhang, L He, R Lv, C **, Y Wang - IEEE Access, 2023 - ieeexplore.ieee.org

Autonomous driving technology faces significant safety challenges due to the lack of a
global perspective and the limitations of long-range perception capabilities. It is widely …

Simpan Kutip Dirujuk 5 kali Artikel terkait 2 versi

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Mask3d: Pre-training 2d vision transformers by learning masked 3d priors

Improving 2d feature representations by 3d-aware fine-tuning

[PDF][PDF] Crema: Multimodal compositional video reasoning via efficient modular adaptation and fusion

[HTML][HTML] Comprehensive review on 3D point cloud segmentation in plants

Point-to-pixel prompting for point cloud analysis with pre-trained image models

RGB-D Cube R-CNN: 3D Object Detection with Selective Modality Dropout

Weakly Supervised Point Cloud Semantic Segmentation via Artificial Oracle

Masked Image Modeling: A Survey

Infrastructure 3D Target detection based on multi-mode fusion for intelligent and connected vehicles