Ambuj Mehrish

Citée par

	Toutes	Depuis 2020
Citations	598	571
indice h	10	9
indice i10	10	8

400

200

100

300

20162017201820192020202120222023202420253 4 6 13 12 12 15 79 382 68

Accès public

Tout afficher

6 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Soujanya PoriaAssistant Professor, Singapore University of Technology and DesignAdresse e-mail validée de sutd.edu.sg
Navonil MajumderSingapore University of Technology and DesignAdresse e-mail validée de sutd.edu.sg
A V SubramanyamProfessorAdresse e-mail validée de iiitd.ac.in
Rishabh BhardwajSingapore University of Technology and DesignAdresse e-mail validée de mymail.sutd.edu.sg
Sabu EmmanuelSingapore Institute of Technology (SIT), SingaporeAdresse e-mail validée de singaporetech.edu.sg
marie tahonLIUM / Le Mans UniversitéAdresse e-mail validée de univ-lemans.fr
Anthony LarcherProfessor Le Mans UniversitéAdresse e-mail validée de univ-lemans.fr
Mohan KankanhalliProfessor of Computer Science, National University of SingaporeAdresse e-mail validée de comp.nus.edu.sg
Shishir SharmaMcGill UniversityAdresse e-mail validée de mail.mcgill.ca

Suivre

Ambuj Mehrish

Research Fellow, Singapore University of Technology and Design, Singapore

Adresse e-mail validée de sutd.edu.sg

Signal Processing Multimedia Forensics Speech and Language Processing Deep Learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
A review of deep learning techniques for speech processing A Mehrish, N Majumder, R Bharadwaj, R Mihalcea, S Poria Information Fusion 99, 101869, 2023	242	2023
Text-to-audio generation using instruction guided latent diffusion model D Ghosal, N Majumder, A Mehrish, S Poria Proceedings of the 31st ACM International Conference on Multimedia, 3590-3598, 2023	193	2023
Evaluating parameter-efficient transfer learning approaches on sure benchmark for speech understanding Y Li, A Mehrish, R Bhardwaj, N Majumder, B Cheng, S Zhao, A Zadeh, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	20	2023
Speaker embeddings for diarization of broadcast data in the allies challenge A Larcher, A Mehrish, M Tahon, S Meignier, J Carrive, D Doukhan, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	19	2021
Robust PRNU estimation from probabilistic raw measurements A Mehrish, AV Subramanyam, S Emmanuel Signal Processing: Image Communication 66, 30-41, 2018	16	2018
Joint spatial and discrete cosine transform domain-based counter forensics for adaptive contrast enhancement A Mehrish, AV Subramanyam, S Emmanuel IEEE access 7, 27183-27195, 2019	14	2019
Multimedia signatures for vehicle forensics A Mehrish, AV Subramanyam, M Kankanhalli 2017 IEEE International Conference on Multimedia and Expo (ICME), 685-690, 2017	14	2017
Sensor pattern noise estimation using probabilistically estimated RAW values A Mehrish, AV Subramanyam, S Emmanuel IEEE Signal Processing Letters 23 (5), 693-697, 2016	14	2016
Anti-forensic technique for median filtering using L₁-L₂ TV model S Sharma, AV Subramanyam, M Jain, A Mehrish, S Emmanuel 2016 IEEE International Workshop on Information Forensics and Security (WIFS …, 2016	11	2016
Adaptermix: Exploring the efficacy of mixture of adapters for low-resource tts adaptation A Mehrish, AR Kashyap, L Yingting, N Majumder, S Poria arXiv preprint arXiv:2305.18028, 2023	10	2023
Improving text-to-audio models with synthetic captions Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ... arXiv preprint arXiv:2406.15487, 2024	9	2024
Egocentric analysis of dash-cam videos for vehicle forensics A Mehrish, P Singh, P Jain, AV Subramanyam, M Kankanhalli IEEE Transactions on Circuits and Systems for Video Technology 30 (9), 3000-3014, 2019	7	2019
Cm-tts: Enhancing real time text-to-speech synthesis efficiency through weighted samplers and consistency models X Li, F Bu, A Mehrish, Y Li, J Han, B Cheng, S Poria arXiv preprint arXiv:2404.00569, 2024	6	2024
Learning accent representation with multi-level vae towards controllable speech synthesis J Melechovsky, A Mehrish, D Herremans, B Sisman 2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023	6	2023
Accented text-to-speech synthesis with a conditional variational autoencoder J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2211.03316, 2022	5	2022
Towards lifelong human assisted speaker diarization M Shamsi, A Larcher, L Barrault, S Meignier, Y Prokopalo, M Tahon, ... Computer Speech & Language 77, 101437, 2023	4	2023
Precoding based on signal-to-leakage and noise ratio to reduce ICI in MIMO-OFDM systems A Mehrish, H Kumar, A Goswami International Journal of Computer Applications 975, 8887, 2014	3	2014
HyperTTS: Parameter efficient adaptation in text to speech using hypernetworks Y Li, R Bhardwaj, A Mehrish, B Cheng, S Poria arXiv preprint arXiv:2404.04645, 2024	2	2024
Text-to-audio generation using instruction-tuned llm and latent diffusion model G Deepanway, M Navonil, M Ambuj, P Soujanya arXiv preprint arXiv:2304.13731, 2023	2	2023
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2406.01018, 2024	1	2024

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs