Human uncertainty makes classification more robust

JC Peterson, RM Battleday… - Proceedings of the …, 2019 - openaccess.thecvf.com
The classification performance of deep neural networks has begun to asymptote at near-
perfect levels. However, their ability to generalize outside the training set and their …

Tvsum: Summarizing web videos using titles

Y Song, J Vallmitjana, A Stent… - Proceedings of the …, 2015 - openaccess.thecvf.com
Video summarization is a challenging problem in part because knowing which part of a
video is important requires prior knowledge about its main topic. We present TVSum, an …

HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition

Z Yan, H Zhang, R Piramuthu… - Proceedings of the …, 2015 - openaccess.thecvf.com
In image classification, visual separability between different object categories is highly
uneven, and some categories are more difficult to distinguish than others. Such difficult …

Large-scale object classification using label relation graphs

J Deng, N Ding, Y Jia, A Frome, K Murphy… - Computer Vision–ECCV …, 2014 - Springer
In this paper we study how to perform object classification in a principled way that exploits
the rich structure of real world labels. We develop a new model that allows encoding of …

B-CNN: branch convolutional neural network for hierarchical classification

X Zhu, M Bain - arxiv preprint arxiv:1709.09890, 2017 - arxiv.org
Convolutional Neural Network (CNN) image classifiers are traditionally designed to have
sequential convolutional layers with a single output layer. This is based on the assumption …

Generative models of visually grounded imagination

R Vedantam, I Fischer, J Huang, K Murphy - arxiv preprint arxiv …, 2017 - arxiv.org
It is easy for people to imagine what a man with pink hair looks like, even if they have never
seen such a person before. We call the ability to create images of novel semantic concepts …

Network of experts for large-scale image categorization

K Ahmed, MH Baig, L Torresani - … , The Netherlands, October 11–14, 2016 …, 2016 - Springer
We present a tree-structured network architecture for large-scale image classification. The
trunk of the network contains convolutional layers optimized over all classes. At a given …

Contemplating visual emotions: Understanding and overcoming dataset bias

R Panda, J Zhang, H Li, JY Lee, X Lu… - Proceedings of the …, 2018 - openaccess.thecvf.com
While machine learning approaches to visual emotion recognition offer great promise,
current methods consider training and testing models on small scale datasets covering …

Learning semantic relationships for better action retrieval in images

V Ramanathan, C Li, J Deng, W Han… - Proceedings of the …, 2015 - openaccess.thecvf.com
Human actions capture a wide variety of interactions between people and objects. As a
result, the set of possible actions is extremely large and it is difficult to obtain sufficient …

[KIRJA][B] Learning semantic image representations at a large scale

Y Jia - 2014 - search.proquest.com
I present my work towards learning a better computer vision system that learns and
generalizes object categories better, and behaves in ways closer to what human behave …