A survey of video datasets for human action and activity recognition

JM Chaquet, EJ Carmona… - Computer Vision and …, 2013 - Elsevier
Vision-based human action and activity recognition has an increasing importance among
the computer vision community with applications to visual surveillance, video retrieval and …

Multiple instance learning: A survey of problem characteristics and applications

MA Carbonneau, V Cheplygina, E Granger… - Pattern Recognition, 2018 - Elsevier
Multiple instance learning (MIL) is a form of weakly supervised learning where training
instances are arranged in sets, called bags, and a label is provided for the entire bag. This …

W2vv++ fully deep learning for ad-hoc video search

X Li, C Xu, G Yang, Z Chen, J Dong - Proceedings of the 27th ACM …, 2019 - dl.acm.org
Ad-hoc video search (AVS) is an important yet challenging problem in multimedia retrieval.
Different from previous concept-based methods, we propose a fully deep learning method …

Bi-level semantic representation analysis for multimedia event detection

X Chang, Z Ma, Y Yang, Z Zeng… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
Multimedia event detection has been one of the major endeavors in video event analysis. A
variety of approaches have been proposed recently to tackle this problem. Among others …

Event-based media processing and analysis: A survey of the literature

C Tzelepis, Z Ma, V Mezaris, B Ionescu… - Image and Vision …, 2016 - Elsevier
Research on event-based processing and analysis of media is receiving an increasing
attention from the scientific community due to its relevance for an abundance of applications …

Multi-modal event topic model for social event analysis

S Qian, T Zhang, C Xu, J Shao - IEEE transactions on …, 2015 - ieeexplore.ieee.org
With the massive growth of social events in Internet, it has become more and more difficult to
exactly find and organize the interesting events from massive social media data, which is …

Data-driven crowd understanding: A baseline for a large-scale crowd dataset

C Zhang, K Kang, H Li, X Wang, R **e… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Crowd understanding has drawn increasing attention from the computer vision community,
and its progress is driven by the availability of public crowd datasets. In this paper, we …

A continuous learning framework for activity recognition using deep hybrid feature models

M Hasan, AK Roy-Chowdhury - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Most of the research on human activity recognition has focused on learning a static model,
considering that all the training instances are labeled and present in advance, while in …

Semantic concept discovery for large-scale zero-shot event detection

X Chang, Y Yang, AG Hauptmann… - … Joint Conference on …, 2015 - research.monash.edu
We focus on detecting complex events in unconstrained Internet videos. While most existing
works rely on the abundance of labeled training data, we consider a more difficult zero-shot …

Videostory: A new multimedia embedding for few-example recognition and translation of events

A Habibian, T Mensink, CGM Snoek - Proceedings of the 22nd ACM …, 2014 - dl.acm.org
This paper proposes a new video representation for few-example event recognition and
translation. Different from existing representations, which rely on either low-level features, or …