Temporal action localization in untrimmed videos via multi-stage cnns
We address temporal action localization in untrimmed long videos. This is important
because videos in real applications are usually unconstrained and contain multiple action …
because videos in real applications are usually unconstrained and contain multiple action …
Multiple instance learning: A survey of problem characteristics and applications
Multiple instance learning (MIL) is a form of weakly supervised learning where training
instances are arranged in sets, called bags, and a label is provided for the entire bag. This …
instances are arranged in sets, called bags, and a label is provided for the entire bag. This …
Wildcat: Weakly supervised learning of deep convnets for image classification, pointwise localization and segmentation
This paper introduces WILDCAT, a deep learning method which jointly aims at aligning
image regions for gaining spatial invariance and learning strongly localized features. Our …
image regions for gaining spatial invariance and learning strongly localized features. Our …
Devnet: A deep event network for multimedia event detection and evidence recounting
In this paper, we focus on complex event detection in internet videos while also providing
the key evidences of the detection results. Convolutional Neural Networks (CNNs) have …
the key evidences of the detection results. Convolutional Neural Networks (CNNs) have …
Multi-stream multi-class fusion of deep networks for video classification
This paper studies deep network architectures to address the problem of video classification.
A multi-stream framework is proposed to fully utilize the rich multimodal information in …
A multi-stream framework is proposed to fully utilize the rich multimodal information in …
Weldon: Weakly supervised learning of deep convolutional neural networks
In this paper, we introduce a novel framework for WEakly supervised Learning of Deep
cOnvolutional neural Networks (WELDON). Our method is dedicated to automatically …
cOnvolutional neural Networks (WELDON). Our method is dedicated to automatically …
Less is more: Learning highlight detection from video duration
Highlight detection has the potential to significantly ease video browsing, but existing
methods often suffer from expensive supervision requirements, where human viewers must …
methods often suffer from expensive supervision requirements, where human viewers must …
Event-based media processing and analysis: A survey of the literature
Research on event-based processing and analysis of media is receiving an increasing
attention from the scientific community due to its relevance for an abundance of applications …
attention from the scientific community due to its relevance for an abundance of applications …
Ensemble learning with label proportions for bankruptcy prediction
Corporate bankruptcy prediction is an interesting and important research topic that can be
conceived in many practical applications. Recently, machine learning based methods have …
conceived in many practical applications. Recently, machine learning based methods have …
Modeling multimodal clues in a hybrid deep learning framework for video classification
Videos are inherently multimodal. This paper studies the problem of exploiting the abundant
multimodal clues for improved video classification performance. We introduce a novel hybrid …
multimodal clues for improved video classification performance. We introduce a novel hybrid …