Прати
Kunal Dhawan
Kunal Dhawan
Research Scientist, NVIDIA
Верификована је имејл адреса на cs.cmu.edu - Почетна страница
Наслов
Навело
Навело
Година
IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition
S Ganji, K Dhawan, R Sinha
Speech Communication 110, 76-89, 2019
292019
Enhancing speaker diarization with large language models: A contextual beam search approach
TJ Park, K Dhawan, N Koluguri, J Balam
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
182024
Discrete audio representation as an alternative to mel-spectrograms for speaker and speech recognition
KC Puvvada, NR Koluguri, K Dhawan, J Balam, B Ginsburg
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
142024
Less is more: Accurate speech recognition & translation without web-scale data
KC Puvvada, P Żelasko, H Huang, O Hrinchuk, NR Koluguri, K Dhawan, ...
arXiv preprint arXiv:2406.19674, 2024
122024
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer
K Dhawan, D Rekesh, B Ginsburg
arXiv preprint arXiv:2306.08753, 2023
12*2023
Hindi-English code-switching speech corpus
G Sreeram, K Dhawan, R Sinha
arXiv preprint arXiv:1810.00662, 2018
102018
Spectral codecs: Spectrogram-based audio codecs for high quality speech synthesis
R Langman, A Jukić, K Dhawan, NR Koluguri, B Ginsburg
arXiv preprint arXiv:2406.05298, 2024
92024
Novel textual features for language modeling of intra-sentential code-switching data
S Ganji, K Dhawan, R Sinha
Computer Speech & Language 64, 101099, 2020
82020
Joint language identification of code-switching speech using attention-based e2e network
G Sreeram, K Dhawan, K Priyadarshi, R Sinha
2020 International Conference on Signal Processing and Communications (SPCOM …, 2020
82020
Property-aware multi-speaker data simulation: A probabilistic modelling technique for synthetic data generation
TJ Park, H Huang, C Hooper, N Koluguri, K Dhawan, A Jukic, J Balam, ...
arXiv preprint arXiv:2310.12371, 2023
72023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System
TJ Park, H Huang, A Jukic, K Dhawan, KC Puvvada, N Koluguri, N Karpov, ...
arXiv preprint arXiv:2310.12378, 2023
72023
Phonetic word embeddings
R Sharma, K Dhawan, B Pailla
arXiv preprint arXiv:2109.14796, 2021
62021
Investigating target set reduction for end-to-end speech recognition of hindi-english code-switching data
K Dhawan, G Sreeram, K Priyadarshi, R Sinha
2020 National conference on communications (NCC), 1-5, 2020
62020
Large language model based generative error correction: A challenge and baselines for speech recognition, speaker tagging, and emotion recognition
CHH Yang, T Park, Y Gong, Y Li, Z Chen, YT Lin, C Chen, Y Hu, ...
2024 IEEE Spoken Language Technology Workshop (SLT), 371-378, 2024
52024
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ...
arXiv preprint arXiv:2411.05361, 2024
52024
Sortformer: Seamless integration of speaker diarization and asr by bridging timestamps and tokens
T Park, I Medennikov, K Dhawan, W Wang, H Huang, NR Koluguri, ...
arXiv preprint arXiv:2409.06656, 2024
22024
Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR
W Wang, K Dhawan, T Park, KC Puvvada, I Medennikov, S Majumdar, ...
2024 IEEE Spoken Language Technology Workshop (SLT), 1224-1231, 2024
12024
VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning
Y Peng, KC Puvvada, Z Chen, P Zelasko, H Huang, K Dhawan, K Hu, ...
arXiv preprint arXiv:2410.17485, 2024
12024
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
H Huang, T Park, K Dhawan, I Medennikov, KC Puvvada, NR Koluguri, ...
arXiv preprint arXiv:2408.13106, 2024
12024
Evaluating speech production-based acoustic features for COVID-19 classification using cough signals
BT Nellore, G Sreeram, K Dhawan, PB Reddy
2021 IEEE 18th India Council International Conference (INDICON), 1-5, 2021
12021
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–20