RGB-D saliency detection via cascaded mutual information minimization

J Zhang, DP Fan, Y Dai, X Yu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Existing RGB-D saliency detection models do not explicitly encourage RGB and depth to
achieve effective multi-modal learning. In this paper, we introduce a novel multi-stage …

Ensemble deep learning for skeleton-based action recognition using temporal sliding lstm networks

I Lee, D Kim, S Kang, S Lee - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
This paper addresses the problems of feature representation of skeleton joints and the
modeling of temporal dynamics to recognize human actions. Traditional methods generally …

Fully deep blind image quality predictor

J Kim, S Lee - IEEE Journal of selected topics in signal …, 2016 - ieeexplore.ieee.org
In general, owing to the benefits obtained from original information, full-reference image
quality assessment (FR-IQA) achieves relatively higher prediction accuracy than no …

Graph edge convolutional neural networks for skeleton-based action recognition

X Zhang, C Xu, X Tian, D Tao - IEEE transactions on neural …, 2019 - ieeexplore.ieee.org
Body joints, directly obtained from a pose estimation model, have proven effective for action
recognition. Existing works focus on analyzing the dynamics of human joints. However …

[HTML][HTML] Methods for reducing visual discomfort in stereoscopic 3D: A review

K Terzić, M Hansard - Signal Processing: Image Communication, 2016 - Elsevier
Visual discomfort is a significant obstacle to the wider use of stereoscopic 3D displays. Many
studies have identified the most common causes of discomfort, and a rich body of literature …

Deep video quality assessor: From spatio-temporal visual sensitivity to a convolutional neural aggregation network

W Kim, J Kim, S Ahn, J Kim… - Proceedings of the …, 2018 - openaccess.thecvf.com
Incorporating spatio-temporal human visual perception into video quality assessment (VQA)
remains a formidable issue. Previous statistical or computational models of spatio-temporal …

Binocular spatial activity and reverse saliency driven no-reference stereopair quality assessment

L Liu, B Liu, CC Su, H Huang, AC Bovik - Signal Processing: Image …, 2017 - Elsevier
We develop a new model for no-reference 3D stereopair quality assessment that considers
the impact of binocular fusion, rivalry, suppression, and a reverse saliency effect on the …

The future of collaborative human-artificial intelligence decision-making for mission planning

SE Kase, CP Hung, T Krayzman, JZ Hare… - Frontiers in …, 2022 - frontiersin.org
In an increasingly complex military operating environment, next generation wargaming
platforms can reduce risk, decrease operating costs, and improve overall outcomes. Novel …

From human pose similarity metric to 3D human pose estimator: Temporal propagating LSTM networks

K Lee, W Kim, S Lee - IEEE transactions on pattern analysis …, 2022 - ieeexplore.ieee.org
Predicting a 3D pose directly from a monocular image is a challenging problem. Most pose
estimation methods proposed in recent years have shown 'quantitatively'good results (below …

Implementation of a virtual training simulator based on 360° multi-view human action recognition

B Kwon, J Kim, K Lee, YK Lee, S Park, S Lee - IEEE Access, 2017 - ieeexplore.ieee.org
Virtual training has received a considerable amount of research attention in recent years
due to its potential for use in a variety of applications, such as virtual military training, virtual …