A review on 2D instance segmentation based on deep neural networks

W Gu, S Bai, L Kong - Image and Vision Computing, 2022 - Elsevier
Image instance segmentation involves labeling pixels of images with classes and instances,
which is one of the pivotal technologies in many domains, such as natural scenes …

Video abstraction: A systematic review and classification

BT Truong, S Venkatesh - ACM transactions on multimedia computing …, 2007 - dl.acm.org
The demand for various multimedia applications is rapidly increasing due to the recent
advance in the computing and network infrastructure, together with the widespread use of …

Egocentric video-language pretraining

KQ Lin, J Wang, M Soldan, M Wray… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Video-Language Pretraining (VLP), which aims to learn transferable representation
to advance a wide range of video-text downstream tasks, has recently received increasing …

Salient object detection in the deep learning era: An in-depth survey

W Wang, Q Lai, H Fu, J Shen, H Ling… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
As an essential problem in computer vision, salient object detection (SOD) has attracted an
increasing amount of research attention over the years. Recent advances in SOD are …

Saliency detection by multi-context deep learning

R Zhao, W Ouyang, H Li… - Proceedings of the IEEE …, 2015 - openaccess.thecvf.com
Low-level saliency cues or priors do not produce good enough saliency detection results
especially when the salient object presents in a low-contrast background with confusing …

Deep contrast learning for salient object detection

G Li, Y Yu - Proceedings of the IEEE conference on …, 2016 - openaccess.thecvf.com
Salient object detection has recently witnessed substantial progress due to powerful
features extracted using deep convolutional neural networks (CNNs). However, existing …

Video summarization with long short-term memory

K Zhang, WL Chao, F Sha, K Grauman - … 14, 2016, Proceedings, Part VII 14, 2016 - Springer
We propose a novel supervised learning technique for summarizing videos by automatically
selecting keyframes or key subshots. Casting the task as a structured prediction problem …

Tvsum: Summarizing web videos using titles

Y Song, J Vallmitjana, A Stent… - Proceedings of the …, 2015 - openaccess.thecvf.com
Video summarization is a challenging problem in part because knowing which part of a
video is important requires prior knowledge about its main topic. We present TVSum, an …

Contrast-based image attention analysis by using fuzzy growing

YF Ma, HJ Zhang - Proceedings of the eleventh ACM international …, 2003 - dl.acm.org
Visual attention analysis provides an alternative methodology to semantic image
understanding in many applications such as adaptive content delivery and region-based …

Video summarization with attention-based encoder–decoder networks

Z Ji, K **ong, Y Pang, X Li - … on Circuits and Systems for Video …, 2019 - ieeexplore.ieee.org
This paper addresses the problem of supervised video summarization by formulating it as a
sequence-to-sequence learning problem, where the input is a sequence of original video …