Temporal action localization in untrimmed videos via multi-stage cnns

Z Shou, D Wang, SF Chang - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
We address temporal action localization in untrimmed long videos. This is important
because videos in real applications are usually unconstrained and contain multiple action …

Multiple instance learning: A survey of problem characteristics and applications

MA Carbonneau, V Cheplygina, E Granger… - Pattern Recognition, 2018 - Elsevier
Multiple instance learning (MIL) is a form of weakly supervised learning where training
instances are arranged in sets, called bags, and a label is provided for the entire bag. This …

Wildcat: Weakly supervised learning of deep convnets for image classification, pointwise localization and segmentation

T Durand, T Mordan, N Thome… - Proceedings of the …, 2017 - openaccess.thecvf.com
This paper introduces WILDCAT, a deep learning method which jointly aims at aligning
image regions for gaining spatial invariance and learning strongly localized features. Our …

Devnet: A deep event network for multimedia event detection and evidence recounting

C Gan, N Wang, Y Yang, DY Yeung… - Proceedings of the …, 2015 - openaccess.thecvf.com
In this paper, we focus on complex event detection in internet videos while also providing
the key evidences of the detection results. Convolutional Neural Networks (CNNs) have …

Multi-stream multi-class fusion of deep networks for video classification

Z Wu, YG Jiang, X Wang, H Ye, X Xue - Proceedings of the 24th ACM …, 2016 - dl.acm.org
This paper studies deep network architectures to address the problem of video classification.
A multi-stream framework is proposed to fully utilize the rich multimodal information in …

Weldon: Weakly supervised learning of deep convolutional neural networks

T Durand, N Thome, M Cord - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
In this paper, we introduce a novel framework for WEakly supervised Learning of Deep
cOnvolutional neural Networks (WELDON). Our method is dedicated to automatically …

Less is more: Learning highlight detection from video duration

B **ong, Y Kalantidis, D Ghadiyaram… - Proceedings of the …, 2019 - openaccess.thecvf.com
Highlight detection has the potential to significantly ease video browsing, but existing
methods often suffer from expensive supervision requirements, where human viewers must …

Event-based media processing and analysis: A survey of the literature

C Tzelepis, Z Ma, V Mezaris, B Ionescu… - Image and Vision …, 2016 - Elsevier
Research on event-based processing and analysis of media is receiving an increasing
attention from the scientific community due to its relevance for an abundance of applications …

Ensemble learning with label proportions for bankruptcy prediction

Z Chen, W Chen, Y Shi - Expert Systems with Applications, 2020 - Elsevier
Corporate bankruptcy prediction is an interesting and important research topic that can be
conceived in many practical applications. Recently, machine learning based methods have …

Modeling multimodal clues in a hybrid deep learning framework for video classification

YG Jiang, Z Wu, J Tang, Z Li, X Xue… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Videos are inherently multimodal. This paper studies the problem of exploiting the abundant
multimodal clues for improved video classification performance. We introduce a novel hybrid …