- Academic Search

A Baevski, WN Hsu, A Conneau… - Advances in Neural …, 2021 - proceedings.neurips.cc

Despite rapid progress in the recent past, current speech recognition systems still require
labeled training data which limits this technology to a small fraction of the languages spoken …

Simpan Kutip Dirujuk 331 kali Artikel terkait 6 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Unsupervised automatic speech recognition: A review

H Aldarmaki, A Ullah, S Ram, N Zaki - Speech Communication, 2022 - Elsevier

Abstract Automatic Speech Recognition (ASR) systems can be trained to achieve
remarkable performance given large amounts of manually transcribed speech, but large …

Simpan Kutip Dirujuk 75 kali Artikel terkait 8 versi

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Unsupervised learning of spoken language with visual context

D Harwath, A Torralba, J Glass - Advances in neural …, 2016 - proceedings.neurips.cc

Humans learn to speak before they can read or write, so why can't computers do the same?
In this paper, we present a deep neural network model capable of rudimentary spoken …

Simpan Kutip Dirujuk 299 kali Artikel terkait 12 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] jhu.edu

Query-by-example keyword spotting using long short-term memory networks

G Chen, C Parada, TN Sainath - 2015 IEEE international …, 2015 - ieeexplore.ieee.org

We present a novel approach to query-by-example keyword spotting (KWS) using a long
short-term memory (LSTM) recurrent neural network-based feature extractor. In our …

Simpan Kutip Dirujuk 217 kali Artikel terkait 8 versi

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

[PDF][PDF] A nonparametric Bayesian approach to acoustic model discovery

C Lee, J Glass - Proceedings of the 50th Annual Meeting of the …, 2012 - aclanthology.org

We investigate the problem of acoustic modeling in which prior language-specific
knowledge and transcribed data are unavailable. We present an unsupervised model that …

Simpan Kutip Dirujuk 272 kali Artikel terkait 9 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep convolutional acoustic word embeddings using word-pair side information

H Kamper, W Wang, K Livescu - 2016 IEEE International …, 2016 - ieeexplore.ieee.org

Recent studies have been revisiting whole words as the basic modelling unit in speech
recognition and query applications, instead of phonetic units. Such whole-word segmental …

Simpan Kutip Dirujuk 197 kali Artikel terkait 8 versi

[Free GPT-4]
[DeepSeek]

[PDF] iitkgp.ac.in

Recent developments in spoken term detection: a survey

A Mandal, KR Prasanna Kumar, P Mitra - International Journal of Speech …, 2014 - Springer

Spoken term detection (STD) provides an efficient means for content based indexing of
speech. However, achieving high detection performance, faster speed, detecting ot-of …

Simpan Kutip Dirujuk 76 kali Artikel terkait 7 versi

Rio: A pervasive rfid-based touch gesture interface

S Pradhan, E Chai, K Sundaresan, L Qiu… - Proceedings of the 23rd …, 2017 - dl.acm.org

In this paper, we design and develop RIO, a novel battery-free touch sensing user interface
(UI) primitive for future IoT and smart spaces. RIO enables UIs to be constructed using off-the …

Simpan Kutip Dirujuk 156 kali Artikel terkait 3 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning hierarchical discrete linguistic units from visually-grounded speech

D Harwath, WN Hsu, J Glass - arxiv preprint arxiv:1911.09602, 2019 - arxiv.org

In this paper, we present a method for learning discrete linguistic units by incorporating
vector quantization layers into neural models of visually grounded speech. We show that our …

Simpan Kutip Dirujuk 105 kali Artikel terkait 8 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] cuhk.edu.hk

Mispronunciation detection and diagnosis in l2 english speech using multidistribution deep neural networks

K Li, X Qian, H Meng - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org

This paper investigates the use of multidistribution deep neural networks (DNNs) for
mispronunciation detection and diagnosis (MDD), to circumvent the difficulties encountered …

Simpan Kutip Dirujuk 186 kali Artikel terkait 13 versi

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Unsupervised spoken keyword spotting via segmental DTW on Gaussian posteriorgrams

Unsupervised speech recognition

[HTML][HTML] Unsupervised automatic speech recognition: A review

Unsupervised learning of spoken language with visual context

Query-by-example keyword spotting using long short-term memory networks

[PDF][PDF] A nonparametric Bayesian approach to acoustic model discovery

Deep convolutional acoustic word embeddings using word-pair side information

Recent developments in spoken term detection: a survey

Rio: A pervasive rfid-based touch gesture interface

Learning hierarchical discrete linguistic units from visually-grounded speech

Mispronunciation detection and diagnosis in l2 english speech using multidistribution deep neural networks