„Google“ mokslinčius

J Li, L Deng, Y Gong… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org

New waves of consumer-centric applications, such as voice search and voice interaction
with mobile devices and home entertainment systems, increasingly require automatic …

Išsaugoti Cituoti Cituoja 692 Susiję straipsniai Visos 8 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PLACES: Prompting language models for social conversation synthesis

M Chen, A Papangelis, C Tao, S Kim… - arxiv preprint arxiv …, 2023 - arxiv.org

Collecting high quality conversational data can be very expensive for most applications and
infeasible for others due to privacy, ethical, or similar concerns. A promising direction to …

Išsaugoti Cituoti Cituoja 64 Susiję straipsniai Visos 6 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] ustc.edu.cn

A regression approach to speech enhancement based on deep neural networks

Y Xu, J Du, LR Dai, CH Lee - IEEE/ACM transactions on audio …, 2014 - ieeexplore.ieee.org

In contrast to the conventional minimum mean square error (MMSE)-based noise reduction
techniques, we propose a supervised method to enhance speech by means of finding a …

Išsaugoti Cituoti Cituoja 1543 Susiję straipsniai Visos 7 versijos

[Free GPT-4]
[DeepSeek]

[PDF] tuni.fi

Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge

A Mesaros, T Heittola, E Benetos… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org

Public evaluation campaigns and datasets promote active development in target research
areas, allowing direct comparison of algorithms. The second edition of the challenge on …

Išsaugoti Cituoti Cituoja 389 Susiję straipsniai Visos 8 versijos

[Free GPT-4]
[DeepSeek]

[PDF] archive.org

Towards scaling up classification-based speech separation

Y Wang, DL Wang - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org

Formulating speech separation as a binary classification problem has been shown to be
effective. While good separation performance is achieved in matched test conditions using …

Išsaugoti Cituoti Cituoja 557 Susiję straipsniai Visos 6 versijos

Noise robust automatic speech recognition: review and analysis

M Dua, Akanksha, S Dua - International Journal of Speech Technology, 2023 - Springer

Abstract Automatic Speech Recognition (ASR) system is an emerging technology used in
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …

Išsaugoti Cituoti Cituoja 11 Susiję straipsniai Visos 2 versijos

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Enabling reproducible research in sensor-based transportation mode recognition with the Sussex-Huawei dataset

L Wang, H Gjoreski, M Ciliberto, S Mekki… - IEEE …, 2019 - ieeexplore.ieee.org

Transportation and locomotion mode recognition from multimodal smartphone sensors is
useful for providing just-in-time context-aware assistance. However, the field is currently …

Išsaugoti Cituoti Cituoja 183 Susiję straipsniai Visos 7 versijos

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

The PASCAL CHiME speech separation and recognition challenge

J Barker, E Vincent, N Ma, H Christensen… - Computer Speech & …, 2013 - Elsevier

Distant microphone speech recognition systems that operate with human-like robustness
remain a distant goal. The key difficulty is that operating in everyday listening conditions …

Išsaugoti Cituoti Cituoja 288 Susiję straipsniai Visos 14 versijos

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

The signal separation evaluation campaign (2007–2010): Achievements and remaining challenges

E Vincent, S Araki, F Theis, G Nolte, P Bofill, H Sawada… - Signal Processing, 2012 - Elsevier

We present the outcomes of three recent evaluation campaigns in the field of audio and
biomedical source separation. These campaigns have witnessed a boom in the range of …

Išsaugoti Cituoti Cituoja 231 Susiję straipsniai Visos 17 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Coherent-to-diffuse power ratio estimation for dereverberation

A Schwarz, W Kellermann - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org

The estimation of the time-and frequency-dependent coherent-to-diffuse power ratio (CDR)
from the measured spatial coherence between two omnidirectional microphones is …

Išsaugoti Cituoti Cituoja 157 Susiję straipsniai Visos 5 versijos

Kurti įspėjimą

Cituoti

Išplėstinė paieška

Išsaugota skiltyje „Mano biblioteka“

The CHiME corpus: A resource and a challenge for computational hearing in multisource environments.

An overview of noise-robust automatic speech recognition

PLACES: Prompting language models for social conversation synthesis

A regression approach to speech enhancement based on deep neural networks

Detection and classification of acoustic scenes and events: Outcome of the DCASE 2016 challenge

Towards scaling up classification-based speech separation

Noise robust automatic speech recognition: review and analysis

Enabling reproducible research in sensor-based transportation mode recognition with the Sussex-Huawei dataset

The PASCAL CHiME speech separation and recognition challenge

The signal separation evaluation campaign (2007–2010): Achievements and remaining challenges

Coherent-to-diffuse power ratio estimation for dereverberation