Volgen
Yinghao Aaron Li
Yinghao Aaron Li
PhD Student, Columbia University
Geverifieerd e-mailadres voor columbia.edu
Titel
Geciteerd door
Geciteerd door
Jaar
Starganv2-vc: A diverse, unsupervised, non-parallel framework for natural-sounding voice conversion
YA Li, A Zare, N Mesgarani
arXiv preprint arXiv:2107.10394, 2021
1112021
Styletts 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models
YA Li, C Han, V Raghavan, G Mischler, N Mesgarani
Advances in Neural Information Processing Systems 36, 2024
982024
Simple framework for constructing functional spiking recurrent neural networks
R Kim, Y Li, TJ Sejnowski
Proceedings of the national academy of sciences 116 (45), 22811-22820, 2019
762019
Styletts: A style-based generative model for natural and diverse text-to-speech synthesis
YA Li, C Han, N Mesgarani
IEEE Journal of Selected Topics in Signal Processing, 2025
482025
Styletts-vc: One-shot voice conversion by knowledge transfer from style-based tts models
YA Li, C Han, N Mesgarani
2022 IEEE Spoken Language Technology Workshop (SLT), 920-927, 2023
192023
Phoneme-level bert for enhanced prosody of text-to-speech with grapheme predictions
YA Li, C Han, X Jiang, N Mesgarani
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
162023
Contextual feature extraction hierarchies converge in large language models and the brain
G Mischler, YA Li, S Bickel, AD Mehta, N Mesgarani
Nature Machine Intelligence, 1-11, 2024
142024
Slmgan: Exploiting speech language model representations for unsupervised zero-shot voice conversion in gans
YA Li, C Han, N Mesgarani
2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023
82023
Speech slytherin: Examining the performance and efficiency of mamba for speech separation, recognition, and synthesis
X Jiang, YA Li, AN Florea, C Han, N Mesgarani
arXiv preprint arXiv:2407.09732, 2024
72024
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
YA Li, C Han, X Jiang, N Mesgarani
arXiv preprint arXiv:2309.09493, 2023
62023
Learning the synaptic and intrinsic membrane dynamics underlying working memory in spiking neural network models
Y Li, R Kim, TJ Sejnowski
Neural Computation 33 (12), 3264-3287, 2021
52021
Improved decoding of attentional selection in multi-talker environments with self-supervised learned speech representation
C Han, V Choudhari, YA Li, N Mesgarani
2023 45th Annual International Conference of the IEEE Engineering in …, 2023
42023
Style-talker: Finetuning audio language model and style-based text-to-speech model for fast spoken dialogue generation
YA Li, X Jiang, J Darefsky, G Zhu, N Mesgarani
arXiv preprint arXiv:2408.11849, 2024
32024
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
YA Li, X Jiang, C Han, N Mesgarani
arXiv preprint arXiv:2409.10058, 2024
22024
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience
X Jiang, C Han, YA Li, N Mesgarani
arXiv preprint arXiv:2402.03710, 2024
22024
Supervised spike sorting using deep convolutional siamese network and hierarchical clustering
Y Li, S Tang, VR de Sa
unpublished thesis, 2019
22019
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation
X Jiang, C Han, YA Li, N Mesgarani
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization
YA Li, R Kumar, Z Jin
arXiv preprint arXiv:2410.11097, 2024
2024
The impact of musical expertise on disentangled and contextual neural encoding of music revealed by generative music models
G Mischler, YA Li, S Bickel, AD Mehta, N Mesgarani
bioRxiv, 2024.12. 20.629729, 2024
2024
DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes
X Jiang, YA Li, N Mesgarani
arXiv preprint arXiv:2305.18441, 2023
2023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20