Not all pixels are equal: Difficulty-aware semantic segmentation via deep layer cascade

X Li, Z Liu, P Luo, C Change Loy… - Proceedings of the …, 2017 - openaccess.thecvf.com
We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of
semantic segmentation. Unlike the conventional model cascade (MC) that is composed of …

Crafting a toolchain for image restoration by deep reinforcement learning

K Yu, C Dong, L Lin, CC Loy - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
We investigate a novel approach for image restoration by reinforcement learning. Unlike
existing studies that mostly train a single large network for a specialized task, we prepare a …

Empowering relational network by self-attention augmented conditional random fields for group activity recognition

RRA Pramono, YT Chen, WH Fang - European Conference on Computer …, 2020 - Springer
This paper presents a novel relational network for group activity recognition. The core of our
network is to augment the conditional random fields (CRF), amenable to learning inter …

Learning deep spatio-temporal dependence for semantic video segmentation

Z Qiu, T Yao, T Mei - IEEE Transactions on Multimedia, 2017 - ieeexplore.ieee.org
Semantically labeling every pixel in a video is a very challenging task as video is an
information-intensive media with complex spatio-temporal dependence. We present in this …

Budget-aware deep semantic video segmentation

B Mahasseni, S Todorovic… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
In this work, we study a poorly understood trade-off between accuracy and runtime costs for
deep semantic video segmentation. While recent work has demonstrated advantages of …

Relational reasoning for group activity recognition via self-attention augmented conditional random field

RRA Pramono, WH Fang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This paper presents a new relational network for group activity recognition. The essence of
the network is to integrate conditional random fields (CRFs) with self-attention to infer the …

Anytime dense prediction with confidence adaptivity

Z Liu, Z Xu, HJ Wang, T Darrell… - arxiv preprint arxiv …, 2021 - arxiv.org
Anytime inference requires a model to make a progression of predictions which might be
halted at any time. Prior research on anytime visual recognition has mostly focused on …

STPNet: A spatial-temporal propagation network for background subtraction

Y Yang, J Ruan, Y Zhang, X Cheng… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In background subtraction tasks, spatial and temporal contexts are beneficial in detecting
moving objects. The methods based on Deep Neural Networks in this task has explored …

Quadtree generating networks: Efficient hierarchical scene parsing with sparse convolutions

K Chitta, JM Alvarez, M Hebert - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Abstract Semantic segmentation with Convolutional Neural Networks is a memory-intensive
task due to the high spatial resolution of feature maps and output predictions. In this paper …

End-to-end background subtraction via a multi-scale spatio-temporal model

Y Yang, T Zhang, J Hu, D Xu, G **e - IEEE Access, 2019 - ieeexplore.ieee.org
Background subtraction is an important task in computer vision. Traditional approaches
usually utilize low-level visual features like color, texture, or edge to build background …