- Academic Search

S Chen, C Ge, Z Tong, J Wang… - Advances in …, 2022 - proceedings.neurips.cc

Abstract Pretraining Vision Transformers (ViTs) has achieved great success in visual
recognition. A following scenario is to adapt a ViT to various image and video recognition …

Zapisz Cytuj Cytowane przez 612 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Imagenet-21k pretraining for the masses

T Ridnik, E Ben-Baruch, A Noy… - ar** better image captioning
models, yet most of them rely on a separate object detector to extract regional features …

Zapisz Cytuj Cytowane przez 121 Powiązane artykuły Wszystkie wersje 9 Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Ml-decoder: Scalable and versatile classification head

T Ridnik, G Sharir, A Ben-Cohen… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this paper, we introduce ML-Decoder, a new attention-based classification head. ML-
Decoder predicts the existence of class labels via queries, and enables better utilization of …

Zapisz Cytuj Cytowane przez 127 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Re-labeling imagenet: from single to multi-labels, from global to localized labels

S Yun, SJ Oh, B Heo, D Han… - Proceedings of the …, 2021 - openaccess.thecvf.com

ImageNet has been the most popular image classification benchmark, but it is also the one
with a significant level of label noise. Recent studies have shown that many samples contain …

Zapisz Cytuj Cytowane przez 178 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]

[PDF] aclanthology.org

[PDF][PDF] MRN: A locally and globally mention-based reasoning network for document-level relation extraction

J Li, K Xu, F Li, H Fei, Y Ren, D Ji - Findings of the Association for …, 2021 - aclanthology.org

Document-level relation extraction aims to detect the relations within one document, which is
challenging since it requires complex reasoning using mentions, entities, local and global …

Zapisz Cytuj Cytowane przez 144 Powiązane artykuły Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Cdul: Clip-driven unsupervised learning for multi-label image classification

R Abdelfattah, Q Guo, X Li, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

This paper presents a CLIP-based unsupervised learning method for annotation-free multi-
label image classification, including three stages: initialization, training, and inference. At the …

Zapisz Cytuj Cytowane przez 40 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

Triplet attention and dual-pool contrastive learning for clinic-driven multi-label medical image classification

Y Zhang, L Luo, Q Dou, PA Heng - Medical image analysis, 2023 - Elsevier

Multi-label classification (MLC) can attach multiple labels on single image, and has
achieved promising results on medical images. But existing MLC methods still face …

Zapisz Cytuj Cytowane przez 47 Powiązane artykuły Wszystkie wersje 3

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Asymmetric loss for multi-label classification

Adaptformer: Adapting vision transformers for scalable visual recognition

Imagenet-21k pretraining for the masses

Ml-decoder: Scalable and versatile classification head

Re-labeling imagenet: from single to multi-labels, from global to localized labels

[PDF][PDF] MRN: A locally and globally mention-based reasoning network for document-level relation extraction

Cdul: Clip-driven unsupervised learning for multi-label image classification

Triplet attention and dual-pool contrastive learning for clinic-driven multi-label medical image classification