Soumi Maiti

Hivatkozott rá

	Összes	2020 óta
Hivatkozások	530	515
h-index	14	14
i10-index	18	18

340

170

255

201820192020202120222023202420255 10 18 17 30 90 331 26

Nyilvános hozzáférés

Összes megtekintése

14 cikk

1 cikk

elérhető

nem érhető el

Finanszírozási megbízások alapján

Társszerzők

Shinji WatanabeCarnegie Mellon UniversityE-mail megerősítve itt: cmu.edu
Yifan PengCarnegie Mellon UniversityE-mail megerősítve itt: andrew.cmu.edu
Michael I MandelAssociate Professor of Computer and Information Science at Brooklyn College, CUNYE-mail megerősítve itt: sci.brooklyn.cuny.edu
Takaaki SaekiGoogle DeepMindE-mail megerősítve itt: google.com
Erik MarchiApple Inc.E-mail megerősítve itt: tum.de
Alistair ConkieAppleE-mail megerősítve itt: apple.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)E-mail megerősítve itt: google.com
Hakan ErdoganGoogleE-mail megerősítve itt: google.com
Scott WisdomGoogle DeepMindE-mail megerősítve itt: google.com
Kevin WilsonGoogleE-mail megerősítve itt: google.com
Yooncheol JuSpeech synthesis AI researcher, 42dot.Inc, Hyundai Motor GroupE-mail megerősítve itt: 42dot.ai
Srinivas BangaloreInteractionsE-mail megerősítve itt: interactions.com
Svetlana StoyanchevResearch Scientist, AT&T LabsE-mail megerősítve itt: research.att.com

Követés

Soumi Maiti

Carnegie Mellon University

E-mail megerősítve itt: andrew.cmu.edu - Kezdőlap

Machine Learning Speech Processing


Cím Rendezés hivatkozások szerint Rendezés év szerint Rendezés cím szerint	Hivatkozott rá Hivatkozott rá	Év
Voxtlm: Unified decoder-only models for consolidating speech recognition, synthesis and speech, text continuation tasks S Maiti, Y Peng, S Choi, J Jung, X Chang, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	54	2024
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	46	2023
Improving massively multilingual asr with auxiliary ctc objectives W Chen, B Yan, J Shi, Y Peng, S Maiti, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	43	2023
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	37	2024
Speechlmscore: Evaluating speech generation using speech language model S Maiti, Y Peng, T Saeki, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	34	2023
EEND-SS: Joint end-to-end neural speaker diarization and speech separation for flexible number of speakers S Maiti, Y Ueda, S Watanabe, C Zhang, M Yu, SX Zhang, Y Xu 2022 IEEE Spoken Language Technology Workshop (SLT), 480-487, 2023	34	2023
Reducing barriers to self-supervised learning: Hubert pre-training with academic compute W Chen, X Chang, Y Peng, Z Ni, S Maiti, S Watanabe arXiv preprint arXiv:2306.06672, 2023	27	2023
Parametric resynthesis with neural vocoders S Maiti, MI Mandel 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019	26	2019
Generating multilingual voices using speaker space translation based on bilingual speaker data S Maiti, E Marchi, A Conkie ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	24	2020
Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement S Maiti, MI Mandel ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	24	2020
End-to-end diarization for variable number of speakers with local-global networks and discriminative speaker embeddings S Maiti, H Erdogan, K Wilson, S Wisdom, S Watanabe, JR Hershey ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	23	2021
SpeechBERTScore: Reference-aware automatic evaluation of speech generation leveraging nlp evaluation metrics T Saeki, S Maiti, S Takamichi, S Watanabe, H Saruwatari arXiv preprint arXiv:2401.16812, 2024	15	2024
ESPnet-ST-v2: Multipurpose spoken language translation toolkit B Yan, J Shi, Y Tang, H Inaguma, Y Peng, S Dalmia, P Polák, ... arXiv preprint arXiv:2304.04596, 2023	14	2023
Learning to speak from text: Zero-shot multilingual text-to-speech with unsupervised text pretraining T Saeki, S Maiti, X Li, S Watanabe, S Takamichi, H Saruwatari arXiv preprint arXiv:2301.12596, 2023	14	2023
Speech denoising by parametric resynthesis S Maiti, MI Mandel ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	13	2019
Joint prediction and denoising for large-scale multilingual self-supervised learning W Chen, J Shi, B Yan, D Berrebbi, W Zhang, Y Peng, X Chang, S Maiti, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	12	2023
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe Interspeech, 16-20, 2022	11	2022
Unsupervised data selection for tts: Using arabic broadcast news as a case study M Baali, T Hayashi, H Mubarak, S Maiti, S Watanabe, W El-Hajj, A Ali arXiv preprint arXiv:2301.09099, 2023	10	2023
Predicting interaction quality in customer service dialogs S Stoyanchev, S Maiti, S Bangalore Advanced Social Interaction with Agents: 8th International Workshop on …, 2018	9	2018
CMU’s IWSLT 2023 simultaneous speech translation system B Yan, J Shi, S Maiti, W Chen, X Li, Y Peng, S Arora, S Watanabe Proceedings of the 20th International Conference on Spoken Language …, 2023	8	2023

A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.

Cikkek 1–20

Hivatkozások évente

Ismétlődő hivatkozások

Összevont hivatkozások

Társszerzők hozzáadásaTársszerzők

Követés

Hivatkozott rá

Társszerzők