Deep learning for visual understanding: A review

Y Guo, Y Liu, A Oerlemans, S Lao, S Wu, MS Lew - Neurocomputing, 2016 - Elsevier
Deep learning algorithms are a subset of the machine learning algorithms, which aim at
discovering multiple levels of distributed representations. Recently, numerous deep learning …

Domain adaptation for visual applications: A comprehensive survey

G Csurka - arxiv preprint arxiv:1702.05374, 2017 - arxiv.org
The aim of this paper is to give an overview of domain adaptation and transfer learning with
a specific view on visual applications. After a general motivation, we first position domain …

Learning transferable visual models from natural language supervision

A Radford, JW Kim, C Hallacy… - International …, 2021 - proceedings.mlr.press
State-of-the-art computer vision systems are trained to predict a fixed set of predetermined
object categories. This restricted form of supervision limits their generality and usability since …

Regionclip: Region-based language-image pretraining

Y Zhong, J Yang, P Zhang, C Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Contrastive language-image pretraining (CLIP) using image-text pairs has achieved
impressive results on image classification in both zero-shot and transfer learning settings …

Revisiting unreasonable effectiveness of data in deep learning era

C Sun, A Shrivastava, S Singh… - Proceedings of the …, 2017 - openaccess.thecvf.com
The success of deep learning in vision can be attributed to:(a) models with high capacity;(b)
increased computational power; and (c) availability of large-scale labeled data. Since 2012 …

icarl: Incremental classifier and representation learning

SA Rebuffi, A Kolesnikov, G Sperl… - Proceedings of the …, 2017 - openaccess.thecvf.com
A major open problem on the road to artificial intelligence is the development of
incrementally learning systems that learn about more and more concepts over time from a …

End-to-end incremental learning

FM Castro, MJ Marín-Jiménez, N Guil… - Proceedings of the …, 2018 - openaccess.thecvf.com
Although deep learning approaches have stood out in recent years due to their state-of-the-
art results, they continue to suffer from catastrophic forgetting, a dramatic decrease in overall …

Exploring visual relationship for image captioning

T Yao, Y Pan, Y Li, T Mei - Proceedings of the European …, 2018 - openaccess.thecvf.com
It is always well believed that modeling relationships between objects would be helpful for
representing and eventually describing an image. Nevertheless, there has not been …

Making deep neural networks robust to label noise: A loss correction approach

G Patrini, A Rozza, A Krishna Menon… - Proceedings of the …, 2017 - openaccess.thecvf.com
We present a theoretically grounded approach to train deep neural networks, including
recurrent networks, subject to class-dependent label noise. We propose two procedures for …

Ok-vqa: A visual question answering benchmark requiring external knowledge

K Marino, M Rastegari, A Farhadi… - Proceedings of the …, 2019 - openaccess.thecvf.com
Abstract Visual Question Answering (VQA) in its ideal form lets us study reasoning in the
joint space of vision and language and serves as a proxy for the AI task of scene …