STAT: Spatial-temporal attention mechanism for video captioning

C Yan, Y Tu, X Wang, Y Zhang, X Hao… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Video captioning refers to automatic generate natural language sentences, which
summarize the video contents. Inspired by the visual attention mechanism of human beings …

Unsupervised person re-identification: Clustering and fine-tuning

H Fan, L Zheng, C Yan, Y Yang - ACM Transactions on Multimedia …, 2018 - dl.acm.org
The superiority of deeply learned pedestrian representations has been reported in very
recent literature of person re-identification (re-ID). In this article, we consider the more …

Deep hyperspectral image sharpening

R Dian, S Li, A Guo, L Fang - IEEE transactions on neural …, 2018 - ieeexplore.ieee.org
Hyperspectral image (HSI) sharpening, which aims at fusing an observable low spatial
resolution (LR) HSI (LR-HSI) with a high spatial resolution (HR) multispectral image (HR …

Spatial and semantic convolutional features for robust visual object tracking

J Zhang, X **, J Sun, J Wang, AK Sangaiah - Multimedia Tools and …, 2020 - Springer
Robust and accurate visual tracking is a challenging problem in computer vision. In this
paper, we exploit spatial and semantic convolutional features extracted from convolutional …

Supervised hash coding with deep neural network for environment perception of intelligent vehicles

C Yan, H **e, D Yang, J Yin, Y Zhang… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Image content analysis is an important surround perception modality of intelligent vehicles.
In order to efficiently recognize the on-road environment based on image content analysis …

Unsupervised deep video hashing via balanced code for large-scale video retrieval

G Wu, J Han, Y Guo, L Liu, G Ding… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
This paper proposes a deep hashing framework, namely, unsupervised deep video hashing
(UDVH), for large-scale video similarity search with the aim to learn compact yet effective …

A scalable region-based level set method using adaptive bilateral filter for noisy image segmentation

H Yu, F He, Y Pan - Multimedia Tools and Applications, 2020 - Springer
Image segmentation plays an important role in the computer vision. However, it is extremely
challenging due to low resolution, high noise and blurry boundaries. Recently, region-based …

Joint transmission map estimation and dehazing using deep networks

H Zhang, V Sindagi, VM Patel - IEEE Transactions on Circuits …, 2019 - ieeexplore.ieee.org
Single image haze removal is an extremely challenging problem due to its inherent ill-posed
nature. Several prior-based and learning-based methods have been proposed in the …

Deep cascade learning

ES Marquez, JS Hare… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
In this paper, we propose a novel approach for efficient training of deep neural networks in a
bottom-up fashion using a layered structure. Our algorithm, which we refer to as deep …

Optimization of deep convolutional neural network for large scale image retrieval

C Bai, L Huang, X Pan, J Zheng, S Chen - Neurocomputing, 2018 - Elsevier
Feature extraction and similarity measurement are two key steps in image retrieval. AlexNet
is a classical deep convolutional neural network for image classification, but using it directly …