Yinghao Aaron Li

Geciteerd door

	Alles	Sinds 2020
Citaties	422	421
h-index	8	8
i10-index	7	7

240

120

180

2020202120222023202420258 21 31 93 228 40

Openbare toegang

Alles bekijken

6 artikelen

0 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Nima MesgaraniAssociate Professor, Columbia UniversityGeverifieerd e-mailadres voor ee.columbia.edu
Cong HanGoogle, Columbia UniversityGeverifieerd e-mailadres voor columbia.edu
Xilin JiangPhD student, Columbia UniversityGeverifieerd e-mailadres voor columbia.edu
Gavin MischlerPhD Student at Columbia UniversityGeverifieerd e-mailadres voor columbia.edu
Robert KimMD/PhD Student, UCSD, Salk InstituteGeverifieerd e-mailadres voor ucsd.edu
Terrence SejnowskiFrancis Crick Professor, Salk Institute, Distingished Professor, UC San DiegoGeverifieerd e-mailadres voor salk.edu
Ali ZareColumbia UniversityGeverifieerd e-mailadres voor columbia.edu
Vinay S RaghavanPostdoctoral Fellow, The City College of New YorkGeverifieerd e-mailadres voor ccny.cuny.edu
Vishal ChoudhariElectrical Engineering Ph.D. Student, Columbia UniversityGeverifieerd e-mailadres voor columbia.edu
Virginia de SaProfessor of Cognitive Science, Associate Director of the Halicioglu Data Science Institute, UCSDGeverifieerd e-mailadres voor ucsd.edu
Zeyu JinAdobe ResearchGeverifieerd e-mailadres voor adobe.com
Shuai TangQuant Research @ Jump Trading
Rithesh KumarAdobe ResearchGeverifieerd e-mailadres voor adobe.com

Volgen

Yinghao Aaron Li

PhD Student, Columbia University

Geverifieerd e-mailadres voor columbia.edu

Computational Neuroscience Voice Conversion Speech Synthesis


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
Starganv2-vc: A diverse, unsupervised, non-parallel framework for natural-sounding voice conversion YA Li, A Zare, N Mesgarani arXiv preprint arXiv:2107.10394, 2021	111	2021
Styletts 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models YA Li, C Han, V Raghavan, G Mischler, N Mesgarani Advances in Neural Information Processing Systems 36, 2024	98	2024
Simple framework for constructing functional spiking recurrent neural networks R Kim, Y Li, TJ Sejnowski Proceedings of the national academy of sciences 116 (45), 22811-22820, 2019	76	2019
Styletts: A style-based generative model for natural and diverse text-to-speech synthesis YA Li, C Han, N Mesgarani IEEE Journal of Selected Topics in Signal Processing, 2025	48	2025
Styletts-vc: One-shot voice conversion by knowledge transfer from style-based tts models YA Li, C Han, N Mesgarani 2022 IEEE Spoken Language Technology Workshop (SLT), 920-927, 2023	19	2023
Phoneme-level bert for enhanced prosody of text-to-speech with grapheme predictions YA Li, C Han, X Jiang, N Mesgarani ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	16	2023
Contextual feature extraction hierarchies converge in large language models and the brain G Mischler, YA Li, S Bickel, AD Mehta, N Mesgarani Nature Machine Intelligence, 1-11, 2024	14	2024
Slmgan: Exploiting speech language model representations for unsupervised zero-shot voice conversion in gans YA Li, C Han, N Mesgarani 2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023	8	2023
Speech slytherin: Examining the performance and efficiency of mamba for speech separation, recognition, and synthesis X Jiang, YA Li, AN Florea, C Han, N Mesgarani arXiv preprint arXiv:2407.09732, 2024	7	2024
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform YA Li, C Han, X Jiang, N Mesgarani arXiv preprint arXiv:2309.09493, 2023	6	2023
Learning the synaptic and intrinsic membrane dynamics underlying working memory in spiking neural network models Y Li, R Kim, TJ Sejnowski Neural Computation 33 (12), 3264-3287, 2021	5	2021
Improved decoding of attentional selection in multi-talker environments with self-supervised learned speech representation C Han, V Choudhari, YA Li, N Mesgarani 2023 45th Annual International Conference of the IEEE Engineering in …, 2023	4	2023
Style-talker: Finetuning audio language model and style-based text-to-speech model for fast spoken dialogue generation YA Li, X Jiang, J Darefsky, G Zhu, N Mesgarani arXiv preprint arXiv:2408.11849, 2024	3	2024
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion YA Li, X Jiang, C Han, N Mesgarani arXiv preprint arXiv:2409.10058, 2024	2	2024
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience X Jiang, C Han, YA Li, N Mesgarani arXiv preprint arXiv:2402.03710, 2024	2	2024
Supervised spike sorting using deep convolutional siamese network and hierarchical clustering Y Li, S Tang, VR de Sa unpublished thesis, 2019	2	2019
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation X Jiang, C Han, YA Li, N Mesgarani ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
DMDSpeech: Distilled Diffusion Model Surpassing The Teacher in Zero-shot Speech Synthesis via Direct Metric Optimization YA Li, R Kumar, Z Jin arXiv preprint arXiv:2410.11097, 2024		2024
The impact of musical expertise on disentangled and contextual neural encoding of music revealed by generative music models G Mischler, YA Li, S Bickel, AD Mehta, N Mesgarani bioRxiv, 2024.12. 20.629729, 2024		2024
DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes X Jiang, YA Li, N Mesgarani arXiv preprint arXiv:2305.18441, 2023		2023

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs