A comprehensive survey of few-shot learning: Evolution, applications, challenges, and opportunities
Few-shot learning (FSL) has emerged as an effective learning method and shows great
potential. Despite the recent creative works in tackling FSL tasks, learning valid information …
potential. Despite the recent creative works in tackling FSL tasks, learning valid information …
A systematic review on affective computing: Emotion models, databases, and recent advances
Affective computing conjoins the research topics of emotion recognition and sentiment
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …
Clip2scene: Towards label-efficient 3d scene understanding by clip
Abstract Contrastive Language-Image Pre-training (CLIP) achieves promising results in 2D
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …
Lit: Zero-shot transfer with locked-image text tuning
This paper presents contrastive-tuning, a simple method employing contrastive training to
align image and text models while still taking advantage of their pre-training. In our empirical …
align image and text models while still taking advantage of their pre-training. In our empirical …
Finetuned language models are zero-shot learners
This paper explores a simple method for improving the zero-shot learning abilities of
language models. We show that instruction tuning--finetuning language models on a …
language models. We show that instruction tuning--finetuning language models on a …
Global attention mechanism: Retain information to enhance channel-spatial interactions
A variety of attention mechanisms have been studied to improve the performance of various
computer vision tasks. However, the prior methods overlooked the significance of retaining …
computer vision tasks. However, the prior methods overlooked the significance of retaining …
Language models enable zero-shot prediction of the effects of mutations on protein function
Modeling the effect of sequence variation on function is a fundamental problem for
understanding and designing proteins. Since evolution encodes information about function …
understanding and designing proteins. Since evolution encodes information about function …
Clip-forge: Towards zero-shot text-to-shape generation
Generating shapes using natural language can enable new ways of imagining and creating
the things around us. While significant recent progress has been made in text-to-image …
the things around us. While significant recent progress has been made in text-to-image …
Learning concise and descriptive attributes for visual recognition
Recent advances in foundation models present new opportunities for interpretable visual
recognition--one can first query Large Language Models (LLMs) to obtain a set of attributes …
recognition--one can first query Large Language Models (LLMs) to obtain a set of attributes …
Learning transferable visual models from natural language supervision
State-of-the-art computer vision systems are trained to predict a fixed set of predetermined
object categories. This restricted form of supervision limits their generality and usability since …
object categories. This restricted form of supervision limits their generality and usability since …