Google Académico

S Hu, L Zhou, S Liu, S Chen, L Meng, H Hao… - arxiv preprint arxiv …, 2024 - arxiv.org

The recent advancements in large language models (LLMs) have revolutionized the field of
natural language processing, progressively broadening their scope to multimodal …

Guardar Citar Citado por 48 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generalizing across domains via cross-gradient training

S Shankar, V Piratla, S Chakrabarti… - arxiv preprint arxiv …, 2018 - arxiv.org

We present CROSSGRAD, a method to use multi-domain training data to learn a classifier
that generalizes to new domains. CROSSGRAD does not need an adaptation phase via …

Guardar Citar Citado por 617 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Light gated recurrent units for speech recognition

M Ravanelli, P Brakel, M Omologo… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

A field that has directly benefited from the recent advances in deep learning is automatic
speech recognition (ASR). Despite the great achievements of the past decades, however, a …

Guardar Citar Citado por 450 Artículos relacionados Las 7 versiones

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & …, 2017 - Elsevier

Speech enhancement and automatic speech recognition (ASR) are most often evaluated in
matched (or multi-condition) settings where the acoustic conditions of the training data …

Guardar Citar Citado por 430 Artículos relacionados Las 16 versiones

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[PDF][PDF] Speech and language processing

D Jurafsky - 2000 - academia.edu

" This book is an absolute necessity for instructors at all levels, as well as an indispensible
reference for researchers. Introducing NLP, computational linguistics, and speech …

Guardar Citar Citado por 17855 Artículos relacionados En caché

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

[PDF][PDF] Domain adaptation with structural correspondence learning

J Blitzer, R McDonald, F Pereira - Proceedings of the 2006 …, 2006 - aclanthology.org

Discriminative learning methods are widely used in natural language processing. These
methods work best when their training and test data are drawn from the same distribution …

Guardar Citar Citado por 2023 Artículos relacionados Las 21 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Weakly supervised learning with multi-stream CNN-LSTM-HMMs to discover sequential parallelism in sign language videos

O Koller, NC Camgoz, H Ney… - IEEE transactions on …, 2019 - ieeexplore.ieee.org

In this work we present a new approach to the field of weakly supervised learning in the
video domain. Our method is relevant to sequence learning problems which can be split up …

Guardar Citar Citado por 354 Artículos relacionados Las 8 versiones

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

The 2005 music information retrieval evaluation exchange (mirex 2005): Preliminary overview

JS Downie, K West, A Ehmann… - 6th int. conf. on music …, 2005 - inria.hal.science

This paper is an extended abstract which provides a brief preliminary overview of the 2005
Music Information Retrieval Evaluation eXchange (MIREX 2005). The MIREX organizational …

Guardar Citar Citado por 149 Artículos relacionados Las 15 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

[PDF][PDF] Shallow parsing with conditional random fields

F Sha, F Pereira - Proceedings of the 2003 human language …, 2003 - aclanthology.org

Conditional random fields for sequence labeling offer advantages over both generative
models like HMMs and classifiers applied at each sequence position. Among sequence …

Guardar Citar Citado por 1969 Artículos relacionados Las 13 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] uam.es

MCYT baseline corpus: a bimodal biometric database

J Ortega-Garcia, J Fierrez-Aguilar, D Simon… - IEE Proceedings-Vision …, 2003 - IET

The current need for large multimodal databases to evaluate automatic biometric recognition
systems has motivated the development of the MCYT bimodal database. The main purpose …

Guardar Citar Citado por 922 Artículos relacionados Las 18 versiones

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Some statistical issues in the comparison of speech recognition algorithms

Wavllm: Towards robust and adaptive speech large language model

Generalizing across domains via cross-gradient training

Light gated recurrent units for speech recognition

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

[PDF][PDF] Speech and language processing

[PDF][PDF] Domain adaptation with structural correspondence learning

Weakly supervised learning with multi-stream CNN-LSTM-HMMs to discover sequential parallelism in sign language videos

The 2005 music information retrieval evaluation exchange (mirex 2005): Preliminary overview

[PDF][PDF] Shallow parsing with conditional random fields

MCYT baseline corpus: a bimodal biometric database