Multimedia classification and event detection using double fusion
Abstract Multimedia Event Detection (MED) is a multimedia retrieval task with the goal of
finding videos of a particular event in video archives, given example videos and event …
finding videos of a particular event in video archives, given example videos and event …
Double fusion for multimedia event detection
Abstract Multimedia Event Detection is a multimedia retrieval task with the goal of finding
videos of a particular event in an internet video archive, given example videos and …
videos of a particular event in an internet video archive, given example videos and …
[PDF][PDF] Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification.
In this paper, we attempt to represent audio as a sequence of acoustic units using
unsupervised learning and use them for multi-class classification. We expect the acoustic …
unsupervised learning and use them for multi-class classification. We expect the acoustic …
A video indexing and retrieval computational prototype based on transcribed speech
N Spolaôr, HD Lee, WSR Takaki, LA Ensina… - Multimedia Tools and …, 2021 - Springer
Using the voice to interact with systems is attractive in medicine and other areas due to its
friendliness and flexibility. Video indexing and retrieval have benefited from this resource …
friendliness and flexibility. Video indexing and retrieval have benefited from this resource …
[PDF][PDF] Informedia@ trecvid 2011
The Informedia group participated in three tasks this year, including: Multimedia Event
Detection (MED), Semantic Indexing (SIN) and Surveillance Event Detection. Generally, all …
Detection (MED), Semantic Indexing (SIN) and Surveillance Event Detection. Generally, all …
Efficient genre-specific semantic video indexing
J Wu, M Worring - IEEE Transactions on Multimedia, 2011 - ieeexplore.ieee.org
Large video collections such as YouTube contain many different video genres, while in
many applications the user might be interested in one or two specific video genres only …
many applications the user might be interested in one or two specific video genres only …
Multimodal video concept detection via bag of auditory words and multiple kernel learning
State-of-the-art systems for video concept detection mainly rely on visual features. Some
previous approaches have also included audio features, either using low-level features such …
previous approaches have also included audio features, either using low-level features such …
Beyond audio and video retrieval: topic-oriented multimedia summarization
Given the deluge of multimedia content that is becoming available over the Internet, it is
increasingly important to be able to effectively examine and organize these large stores of …
increasingly important to be able to effectively examine and organize these large stores of …
On the applicability of speaker diarization to audio concept detection for multimedia retrieval
R Mertens, PS Huang, L Gottlieb… - 2011 IEEE …, 2011 - ieeexplore.ieee.org
Recently, audio concepts emerged as a useful building block in multimodal video retrieval
systems. Information like" this file contains laughter"," this file contains engine sounds" or" …
systems. Information like" this file contains laughter"," this file contains engine sounds" or" …
[PDF][PDF] Structured Models for Semantic Analysis of Audio Content
S Chaudhuri - 2013 - lti.cmu.edu
In the universe of audio signals, the notions of syntax, semantics, pragmatics, etc. have been
associated with a very limited set of domains, such as speech and language, and musical …
associated with a very limited set of domains, such as speech and language, and musical …