Multimodal age and gender estimation for adaptive human-robot interaction: A systematic literature review

HA Younis, NIR Ruhaiyem, AA Badr, AK Abdul-Hassan… - Processes, 2023 - mdpi.com
Identifying the gender of a person and his age by way of speaking is considered a crucial
task in computer vision. It is a very important and active research topic with many areas of …

Acoustical sound database in real environments for sound scene understanding and hands-free speech recognition

S Nakamura, K Hiyane, F Asano, T Nishiura, T Yamada - 2000 - naist.repo.nii.ac.jp
This paper reports on a project for collection of the sound scene data. The sound scene data
is necessary for studies such as sound source localization, sound retrieval, sound …

Blind source separation combining independent component analysis and beamforming

H Saruwatari, S Kurita, K Takeda, F Itakura… - EURASIP Journal on …, 2003 - Springer
We describe a new method of blind source separation (BSS) on a microphone array
combining subband independent component analysis (ICA) and beamforming. The …

Blind source separation based on a fast-convergence algorithm combining ICA and beamforming

H Saruwatari, T Kawamura, T Nishikawa… - … on Audio, speech …, 2006 - ieeexplore.ieee.org
We propose a new algorithm for blind source separation (BSS), in which independent
component analysis (ICA) and beamforming are combined to resolve the slow-convergence …

Vowels in infant-directed speech: More breathy and more variable, but not clearer

K Miyazawa, T Shinya, A Martin, H Kikuchi, R Mazuka - Cognition, 2017 - Elsevier
Infant-directed speech (IDS) is known to differ from adult-directed speech (ADS) in a number
of ways, and it has often been argued that some of these IDS properties facilitate infants' …

Perceptual limits in a simulated “Cocktail party”

T Kawashima, T Sato - Attention, Perception, & Psychophysics, 2015 - Springer
Numerosity judgments of simultaneous talkers were examined. Listeners were required to
report the number of talkers heard when this number varied (1 to 13). Spatial location of …

Blind source separation of acoustic signals based on multistage ICA combining frequency-domain ICA and time-domain ICA

T Nishikawa, H Saruwatari… - IEICE Transactions on …, 2003 - search.ieice.org
We propose a new algorithm for blind source separation (BSS), in which frequency-domain
independent component analysis (FDICA) and time-domain ICA (TDICA) are combined to …

Sound scene data collection in real acoustical environments

S Nakamura, K Hiyane, F Asano… - Journal of the Acoustical …, 1999 - jstage.jst.go.jp
This paper describes a sound scene database necessary for studies such as sound source
localization, sound retrieval, sound recognition and speech recognition in real acoustical …

Enhancement of speech dynamics for voice activity detection using DNN

S Dwijayanti, K Yamamori, M Miyoshi - EURASIP Journal on Audio …, 2018 - Springer
Voice activity detection (VAD) is an important preprocessing step for various speech
applications to identify speech and non-speech periods in input signals. In this paper, we …

Blind source separation for speech based on fast-convergence algorithm with ICA and beamforming

H Saruwatari, T Kawamura, K Shikano - 2001 - naist.repo.nii.ac.jp
We propose a new algorithm for blind source separation (BSS), in which independent
component analysis (ICA) and beamforming are combined to resolve the low-convergence …