Hyperbolic contrastive learning for visual representations beyond objects
Although self-/un-supervised methods have led to rapid progress in visual representation
learning, these methods generally treat objects and scenes using the same lens. In this …
learning, these methods generally treat objects and scenes using the same lens. In this …
In or out? fixing imagenet out-of-distribution detection evaluation
Out-of-distribution (OOD) detection is the problem of identifying inputs which are unrelated to
the in-distribution task. The OOD detection performance when the in-distribution (ID) is …
the in-distribution task. The OOD detection performance when the in-distribution (ID) is …
Genecis: A benchmark for general conditional image similarity
We argue that there are many notions of'similarity'and that models, like humans, should be
able to adapt to these dynamically. This contrasts with most representation learning …
able to adapt to these dynamically. This contrasts with most representation learning …
Distilling model failures as directions in latent space
Existing methods for isolating hard subpopulations and spurious correlations in datasets
often require human intervention. This can make these methods labor-intensive and dataset …
often require human intervention. This can make these methods labor-intensive and dataset …
Effective human-AI teams via learned natural language rules and onboarding
People are relying on AI agents to assist them with various tasks. The human must know
when to rely on the agent, collaborate with the agent, or ignore its suggestions. In this work …
when to rely on the agent, collaborate with the agent, or ignore its suggestions. In this work …
The song describer dataset: a corpus of audio captions for music-and-language evaluation
We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality
audio-caption pairs, designed for the evaluation of music-and-language models. The …
audio-caption pairs, designed for the evaluation of music-and-language models. The …
Analyzing dataset annotation quality management in the wild
Data quality is crucial for training accurate, unbiased, and trustworthy machine learning
models as well as for their correct evaluation. Recent work, however, has shown that even …
models as well as for their correct evaluation. Recent work, however, has shown that even …
Automated classification of model errors on imagenet
While the ImageNet dataset has been driving computer vision research over the past
decade, significant label noise and ambiguity have made top-1 accuracy an insufficient …
decade, significant label noise and ambiguity have made top-1 accuracy an insufficient …
Understanding the detrimental class-level effects of data augmentation
Data augmentation (DA) encodes invariance and provides implicit regularization critical to a
model's performance in image classification tasks. However, while DA improves average …
model's performance in image classification tasks. However, while DA improves average …
Understanding and mitigating the label noise in pre-training on downstream tasks
Pre-training on large-scale datasets and then fine-tuning on downstream tasks have
become a standard practice in deep learning. However, pre-training data often contain label …
become a standard practice in deep learning. However, pre-training data often contain label …