Multimedia data mining: state of the art and challenges

CA Bhatt, MS Kankanhalli - Multimedia Tools and Applications, 2011‏ - Springer
Advances in multimedia data acquisition and storage technology have led to the growth of
very large multimedia databases. Analyzing this huge amount of multimedia data to discover …

Multimedia content analysis-using both audio and visual clues

Y Wang, Z Liu, JC Huang - IEEE signal processing magazine, 2000‏ - ieeexplore.ieee.org
Multimedia content analysis refers to the computerized understanding of the semantic
meanings of a multimedia document, such as a video sequence with an accompanying …

Audio-visual integration in multimodal communication

T Chen, RR Rao - Proceedings of the IEEE, 1998‏ - ieeexplore.ieee.org
We review recent research that examines audio-visual integration in multimodal
communication. The topics include bimodality in human speech, human and automated lip …

Audio feature extraction and analysis for scene segmentation and classification

Z Liu, Y Wang, T Chen - Journal of VLSI signal processing systems for …, 1998‏ - Springer
Understanding of the scene content of a video sequence is very important for content-based
indexing and retrieval of multimedia databases. Research in this area in the past several …

A hidden Markov model framework for video segmentation using audio and image features

JS Boreczky, LD Wilcox - Proceedings of the 1998 IEEE …, 1998‏ - ieeexplore.ieee.org
This paper describes a technique for segmenting video using hidden Markov models
(HMM). Video is segmented into regions defined by shots, shot boundaries, and camera …

Classification TV programs based on audio information using hidden Markov model

Z Liu, J Huang, Y Wang - 1998 IEEE Second Workshop on …, 1998‏ - ieeexplore.ieee.org
This paper describes a technique for classifying TV broadcast video using a hidden Markov
model (HMM). Here we consider the problem of discriminating five types of TV programs …

A deep learning-based pipeline for mosquito detection and classification from wingbeat sounds

MS Yin, P Haddawy, T Ziemer, F Wetjen… - Multimedia Tools and …, 2023‏ - Springer
Mosquito vector-borne diseases such as malaria and dengue constitute some of the most
serious public health burdens in tropical and sub-tropical countries. Effective targeting of …

Video handling with music and speech detection

K Minami, A Akutsu, H Hamada… - IEEE MultiMedia, 1998‏ - ieeexplore.ieee.org
The audio-based approach to video indexing described by the authors detects music and
speech independently even when they occur simultaneously. The indexed video segments …

Content-based video parsing and indexing based on audio-visual interaction

S Tsekeridou, I Pitas - IEEE transactions on circuits and systems …, 2001‏ - ieeexplore.ieee.org
A content-based video parsing and indexing method is presented in this paper, which
analyzes both information sources (auditory and visual) and accounts for their inter-relations …

Characterizing Multimedia Information Environment through Multi-modal Clustering of YouTube Videos

N Yousefi, M Shaik, N Agarwal - arxiv preprint arxiv:2402.18702, 2024‏ - arxiv.org
This study aims to investigate the comprehensive characterization of information content in
multimedia (videos), particularly on YouTube. The research presents a multi-method …