Hawkes processes for events in social media

MA Rizoiu, Y Lee, S Mishra, L **e - Frontiers of multimedia research, 2017 - dl.acm.org
This chapter provides an accessible introduction for point processes, and especially Hawkes
processes, for modeling discrete, inter-dependent events over continuous time. We start by …

Deep learning for video classification and captioning

Z Wu, T Yao, Y Fu, YG Jiang - Frontiers of multimedia research, 2017 - dl.acm.org
Today's digital contents are inherently multimedia: text, audio, image, video, and so on.
Video, in particular, has become a new way of communication between Internet users with …

Enhancing micro-video understanding by harnessing external sounds

L Nie, X Wang, J Zhang, X He, H Zhang… - Proceedings of the 25th …, 2017 - dl.acm.org
Different from traditional long videos, micro-videos are much shorter and usually recorded at
a specific place with mobile devices. To better understand the semantics of a micro-video …

Large-scale audio event discovery in one million youtube videos

A Jansen, JF Gemmeke, DPW Ellis… - … , Speech and Signal …, 2017 - ieeexplore.ieee.org
Internet videos provide a virtually boundless source of audio with a conspicuous lack of
localized annotations, presenting an ideal setting for unsupervised methods. With this …

Learning representations for nonspeech audio events through their similarities to speech patterns

H Phan, L Hertel, M Maass, R Mazur… - … /ACM Transactions on …, 2016 - ieeexplore.ieee.org
The human auditory system is very well matched to both human speech and environmental
sounds. Therefore, the question arises whether human speech material may provide useful …

A hierarchical system for word discovery exploiting DTW-based initialization

O Walter, T Korthals… - 2013 IEEE Workshop …, 2013 - ieeexplore.ieee.org
Discovering the linguistic structure of a language solely from spoken input asks for two
steps: phonetic and lexical discovery. The first is concerned with identifying the categorical …

Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices

J Heymann, O Walter… - … on Acoustics, Speech …, 2014 - ieeexplore.ieee.org
In this paper we present an algorithm for the unsupervised segmentation of a lattice
produced by a phoneme recognizer into words. Using a lattice rather than a single phoneme …

Social-sensed multimedia computing

P Cui - Frontiers of Multimedia Research, 2017 - dl.acm.org
Multimedia computing technology, as one of the most effective and pervasive technologies
in modern society, plays irreplaceable roles in bridging user needs with vast amounts of …

Unsupervised word segmentation from noisy input

J Heymann, O Walter… - 2013 IEEE workshop …, 2013 - ieeexplore.ieee.org
In this paper we present an algorithm for the unsupervised segmentation of a character or
phoneme lattice into words. Using a lattice at the input rather than a single string accounts …

Unsupervised hierarchical structure induction for deeper semantic analysis of audio

S Chaudhuri, B Raj - 2013 IEEE International Conference on …, 2013 - ieeexplore.ieee.org
Current audio analysis techniques rely on fairly shallow analysis of audio content, using
symbols or patterns extracted directly from the observed acoustics. We hypothesize that the …