Not all pixels are equal: Difficulty-aware semantic segmentation via deep layer cascade
We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of
semantic segmentation. Unlike the conventional model cascade (MC) that is composed of …
semantic segmentation. Unlike the conventional model cascade (MC) that is composed of …
Crafting a toolchain for image restoration by deep reinforcement learning
We investigate a novel approach for image restoration by reinforcement learning. Unlike
existing studies that mostly train a single large network for a specialized task, we prepare a …
existing studies that mostly train a single large network for a specialized task, we prepare a …
Empowering relational network by self-attention augmented conditional random fields for group activity recognition
This paper presents a novel relational network for group activity recognition. The core of our
network is to augment the conditional random fields (CRF), amenable to learning inter …
network is to augment the conditional random fields (CRF), amenable to learning inter …
Learning deep spatio-temporal dependence for semantic video segmentation
Semantically labeling every pixel in a video is a very challenging task as video is an
information-intensive media with complex spatio-temporal dependence. We present in this …
information-intensive media with complex spatio-temporal dependence. We present in this …
Budget-aware deep semantic video segmentation
In this work, we study a poorly understood trade-off between accuracy and runtime costs for
deep semantic video segmentation. While recent work has demonstrated advantages of …
deep semantic video segmentation. While recent work has demonstrated advantages of …
Relational reasoning for group activity recognition via self-attention augmented conditional random field
This paper presents a new relational network for group activity recognition. The essence of
the network is to integrate conditional random fields (CRFs) with self-attention to infer the …
the network is to integrate conditional random fields (CRFs) with self-attention to infer the …
Anytime dense prediction with confidence adaptivity
Anytime inference requires a model to make a progression of predictions which might be
halted at any time. Prior research on anytime visual recognition has mostly focused on …
halted at any time. Prior research on anytime visual recognition has mostly focused on …
STPNet: A spatial-temporal propagation network for background subtraction
Y Yang, J Ruan, Y Zhang, X Cheng… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In background subtraction tasks, spatial and temporal contexts are beneficial in detecting
moving objects. The methods based on Deep Neural Networks in this task has explored …
moving objects. The methods based on Deep Neural Networks in this task has explored …
Quadtree generating networks: Efficient hierarchical scene parsing with sparse convolutions
Abstract Semantic segmentation with Convolutional Neural Networks is a memory-intensive
task due to the high spatial resolution of feature maps and output predictions. In this paper …
task due to the high spatial resolution of feature maps and output predictions. In this paper …
End-to-end background subtraction via a multi-scale spatio-temporal model
Y Yang, T Zhang, J Hu, D Xu, G **e - IEEE Access, 2019 - ieeexplore.ieee.org
Background subtraction is an important task in computer vision. Traditional approaches
usually utilize low-level visual features like color, texture, or edge to build background …
usually utilize low-level visual features like color, texture, or edge to build background …