A survey and performance evaluation of deep learning methods for small object detection

Y Liu, P Sun, N Wergeles, Y Shang - Expert Systems with Applications, 2021 - Elsevier
In computer vision, significant advances have been made on object detection with the rapid
development of deep convolutional neural networks (CNN). This paper provides a …

Video description: A survey of methods, datasets, and evaluation metrics

N Aafaq, A Mian, W Liu, SZ Gilani, M Shah - ACM Computing Surveys …, 2019 - dl.acm.org
Video description is the automatic generation of natural language sentences that describe
the contents of a given video. It has applications in human-robot interaction, hel** the …

The revisiting problem in simultaneous localization and map**: A survey on visual loop closure detection

KA Tsintotas, L Bampis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Where am I? This is one of the most critical questions that any intelligent system should
answer to decide whether it navigates to a previously visited area. This problem has long …

Gather-excite: Exploiting feature context in convolutional neural networks

J Hu, L Shen, S Albanie, G Sun… - Advances in neural …, 2018 - proceedings.neurips.cc
While the use of bottom-up local operators in convolutional neural networks (CNNs)
matches well some of the statistics of natural images, it may also prevent such models from …

Relation networks for object detection

H Hu, J Gu, Z Zhang, J Dai… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Although it is well believed for years that modeling relations between objects would help
object recognition, there has not been evidence that the idea is working in the deep learning …

InLoc: Indoor visual localization with dense matching and view synthesis

H Taira, M Okutomi, T Sattler… - Proceedings of the …, 2018 - openaccess.thecvf.com
We seek to predict the 6 degree-of-freedom (6DoF) pose of a query photograph with respect
to a large indoor 3D map. The contributions of this work are three-fold. First, we develop a …

Spatial pyramid-enhanced NetVLAD with weighted triplet loss for place recognition

J Yu, C Zhu, J Zhang, Q Huang… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
We propose an end-to-end place recognition model based on a novel deep neural network.
First, we propose to exploit the spatial pyramid structure of the images to enhance the vector …

Finding tiny faces

P Hu, D Ramanan - Proceedings of the IEEE conference on …, 2017 - openaccess.thecvf.com
Though tremendous strides have been made in object recognition, one of the remaining
open challenges is detecting small objects. We explore three aspects of the problem in the …

Malware classification with deep convolutional neural networks

M Kalash, M Rochan, N Mohammed… - 2018 9th IFIP …, 2018 - ieeexplore.ieee.org
In this paper, we propose a deep learning framework for malware classification. There has
been a huge increase in the volume of malware in recent years which poses a serious …

Visual semantic navigation using scene priors

W Yang, X Wang, A Farhadi, A Gupta… - arxiv preprint arxiv …, 2018 - arxiv.org
How do humans navigate to target objects in novel scenes? Do we use the
semantic/functional priors we have built over years to efficiently search and navigate? For …