„Google“ mokslinčius

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Išsaugoti Cituoti Cituoja 242 Susiję straipsniai Visos 7 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Išsaugoti Cituoti Cituoja 410 Susiję straipsniai Visos 10 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A review of speaker diarization: Recent advances with deep learning

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier

Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

Išsaugoti Cituoti Cituoja 430 Susiję straipsniai Visos 7 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A comparative study on transformer vs rnn in speech applications

S Karita, N Chen, T Hayashi, T Hori… - 2019 IEEE automatic …, 2019 - ieeexplore.ieee.org

Sequence-to-sequence models have been widely used in end-to-end speech processing,
for example, automatic speech recognition (ASR), speech translation (ST), and text-to …

Išsaugoti Cituoti Cituoja 909 Susiję straipsniai Visos 9 versijos

[Free GPT-4]
[DeepSeek]

[PDF] inaoep.mx

Automatic speech recognition: a survey

M Malik, MK Malik, K Mehmood… - Multimedia Tools and …, 2021 - Springer

Recently great strides have been made in the field of automatic speech recognition (ASR) by
using various deep learning techniques. In this study, we present a thorough comparison …

Išsaugoti Cituoti Cituoja 395 Susiję straipsniai Visos 8 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

S Watanabe, M Mandel, J Barker, E Vincent… - arxiv preprint arxiv …, 2020 - arxiv.org

Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …

Išsaugoti Cituoti Cituoja 368 Susiję straipsniai Visos 8 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Librimix: An open-source dataset for generalizable speech separation

J Cosentino, M Pariente, S Cornell, A Deleforge… - arxiv preprint arxiv …, 2020 - arxiv.org

In recent years, wsj0-2mix has become the reference dataset for single-channel speech
separation. Most deep learning-based speech separation models today are benchmarked …

Išsaugoti Cituoti Cituoja 321 Susiję straipsniai Visos 5 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Wavesplit: End-to-end speech separation by speaker clustering

N Zeghidour, D Grangier - IEEE/ACM Transactions on Audio …, 2021 - ieeexplore.ieee.org

We introduce Wavesplit, an end-to-end source separation system. From a single mixture, the
model infers a representation for each source and then estimates each source signal given …

Išsaugoti Cituoti Cituoja 311 Susiję straipsniai Visos 8 versijos

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

A review of the recent progress in battery informatics

C Ling - npj Computational Materials, 2022 - nature.com

Batteries are of paramount importance for the energy storage, consumption, and
transportation in the current and future society. Recently machine learning (ML) has …

Išsaugoti Cituoti Cituoja 121 Susiję straipsniai Visos 5 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-task self-supervised learning for robust speech recognition

M Ravanelli, J Zhong, S Pascual… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

Despite the growing interest in unsupervised learning, extracting meaningful knowledge
from unlabelled audio remains an open challenge. To take a step in this direction, we …

Išsaugoti Cituoti Cituoja 339 Susiję straipsniai Visos 7 versijos

Kurti įspėjimą

Cituoti

Išplėstinė paieška

Išsaugota skiltyje „Mano biblioteka“

The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines

A review of deep learning techniques for speech processing

Self-supervised speech representation learning: A review

A review of speaker diarization: Recent advances with deep learning

A comparative study on transformer vs rnn in speech applications

Automatic speech recognition: a survey

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

Librimix: An open-source dataset for generalizable speech separation

Wavesplit: End-to-end speech separation by speaker clustering

A review of the recent progress in battery informatics

Multi-task self-supervised learning for robust speech recognition