Suivre
Ambuj Mehrish
Titre
Citée par
Citée par
Année
A review of deep learning techniques for speech processing
A Mehrish, N Majumder, R Bharadwaj, R Mihalcea, S Poria
Information Fusion 99, 101869, 2023
2422023
Text-to-audio generation using instruction guided latent diffusion model
D Ghosal, N Majumder, A Mehrish, S Poria
Proceedings of the 31st ACM International Conference on Multimedia, 3590-3598, 2023
1932023
Evaluating parameter-efficient transfer learning approaches on sure benchmark for speech understanding
Y Li, A Mehrish, R Bhardwaj, N Majumder, B Cheng, S Zhao, A Zadeh, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
202023
Speaker embeddings for diarization of broadcast data in the allies challenge
A Larcher, A Mehrish, M Tahon, S Meignier, J Carrive, D Doukhan, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
192021
Robust PRNU estimation from probabilistic raw measurements
A Mehrish, AV Subramanyam, S Emmanuel
Signal Processing: Image Communication 66, 30-41, 2018
162018
Joint spatial and discrete cosine transform domain-based counter forensics for adaptive contrast enhancement
A Mehrish, AV Subramanyam, S Emmanuel
IEEE access 7, 27183-27195, 2019
142019
Multimedia signatures for vehicle forensics
A Mehrish, AV Subramanyam, M Kankanhalli
2017 IEEE International Conference on Multimedia and Expo (ICME), 685-690, 2017
142017
Sensor pattern noise estimation using probabilistically estimated RAW values
A Mehrish, AV Subramanyam, S Emmanuel
IEEE Signal Processing Letters 23 (5), 693-697, 2016
142016
Anti-forensic technique for median filtering using L1-L2 TV model
S Sharma, AV Subramanyam, M Jain, A Mehrish, S Emmanuel
2016 IEEE International Workshop on Information Forensics and Security (WIFS …, 2016
112016
Adaptermix: Exploring the efficacy of mixture of adapters for low-resource tts adaptation
A Mehrish, AR Kashyap, L Yingting, N Majumder, S Poria
arXiv preprint arXiv:2305.18028, 2023
102023
Improving text-to-audio models with synthetic captions
Z Kong, S Lee, D Ghosal, N Majumder, A Mehrish, R Valle, S Poria, ...
arXiv preprint arXiv:2406.15487, 2024
92024
Egocentric analysis of dash-cam videos for vehicle forensics
A Mehrish, P Singh, P Jain, AV Subramanyam, M Kankanhalli
IEEE Transactions on Circuits and Systems for Video Technology 30 (9), 3000-3014, 2019
72019
Cm-tts: Enhancing real time text-to-speech synthesis efficiency through weighted samplers and consistency models
X Li, F Bu, A Mehrish, Y Li, J Han, B Cheng, S Poria
arXiv preprint arXiv:2404.00569, 2024
62024
Learning accent representation with multi-level vae towards controllable speech synthesis
J Melechovsky, A Mehrish, D Herremans, B Sisman
2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023
62023
Accented text-to-speech synthesis with a conditional variational autoencoder
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2211.03316, 2022
52022
Towards lifelong human assisted speaker diarization
M Shamsi, A Larcher, L Barrault, S Meignier, Y Prokopalo, M Tahon, ...
Computer Speech & Language 77, 101437, 2023
42023
Precoding based on signal-to-leakage and noise ratio to reduce ICI in MIMO-OFDM systems
A Mehrish, H Kumar, A Goswami
International Journal of Computer Applications 975, 8887, 2014
32014
HyperTTS: Parameter efficient adaptation in text to speech using hypernetworks
Y Li, R Bhardwaj, A Mehrish, B Cheng, S Poria
arXiv preprint arXiv:2404.04645, 2024
22024
Text-to-audio generation using instruction-tuned llm and latent diffusion model
G Deepanway, M Navonil, M Ambuj, P Soujanya
arXiv preprint arXiv:2304.13731, 2023
22023
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2406.01018, 2024
12024
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20