دنبال کردن
Viet Anh Trinh
عنوان
نقل شده توسط
نقل شده توسط
سال
Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
VA Trinh, S Braun
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
262022
Importantaug: a data augmentation agent for speech
VA Trinh, HS Kavaki, MI Mandel
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
142022
New Dataset and Strong Baselines for the Grammatical Error Correction of Russian
VA Trinh, A Rozovskaya
Findings of ACL, 2021
142021
Directly Comparing the Listening Strategies of Humans and Machines
VA Trinh, M Mandel
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 312 - 323, 2020
132020
Bubble Cooperative Networks for identifying important speech cues
VA Trinh, B McFee, MI Mandel
Proceeding of the Interspeech 2018, 2018
102018
AUTOMATIC SPEECH RECOGNITION TUNED FOR CHILD SPEECH IN THE CLASSROOM
R Southwell, W Ward, VA Trinh, C Clevenger, C Clevenger, E Watts, ...
ICASSP, 2024
82024
Large scale evaluation of importance maps in automatic speech recognition
VA Trinh, M Mandel
Proceeding of the Interspeech 2020, 2020
62020
Concatenative Resynthesis with Improved Training Signals for Speech Enhancement.
AR Syed, VA Trinh, MI Mandel
Proceeding of the Interspeech 2018, 0
6*
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
VA Trinh, P Ghahremani, B King, J Droppo, A Stolcke, R Maas
Proceeding of the Interspeech 2022, 2022
52022
Combining spatial clustering with LSTM speech models for multichannel speech enhancement
F Grezes, Z Ni, VA Trinh, M Mandel
arXiv preprint arXiv:2012.03388, 2020
42020
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
F Grezes, Z Ni, VA Trinh, M Mandel
arXiv preprint arXiv:2012.01576, 2020
32020
Towards Accurate and Real-Time End-of-Speech Estimation
Y Fan, C Vaz, D He, J Heymann, VA Trinh, Z Zhang, V Ravichandran
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Z Ni, F Grezes, VA Trinh, MI Mandel
arXiv preprint arXiv:2012.02191, 2020
22020
Tracking Classroom Movement Patterns with Person Re-ID
X He, J Wang, VA Trinh, A McReynolds, J Whitehill
Educational Data Mining, 2024
12024
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing
VA Trinh, R Southwell, Y Guan, X He, Z Wang, J Whitehill
arXiv preprint arXiv:2406.06582, 2024
12024
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits
A Stolcke, A Raju, C Vaz, D He, V Ravichandran, VA Trinh
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Identifying, Evaluating and Applying Importance Maps for Speech
VA Trinh
City University of New York, 2022
12022
Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?
Y Guan, VA Trinh, V Voleti, J Whitehill
arXiv preprint arXiv:2409.09221, 2024
2024
Two-Pass Endpoint Detection for Speech Recognition
A Raju, A Khare, D He, I Sklyar, L Chen, S Alptekin, VA Trinh, Z Zhang, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–19