Obserwuj
Jan Melechovsky
Tytuł
Cytowane przez
Cytowane przez
Rok
Mustango: Toward controllable text-to-music generation
J Melechovsky, Z Guo, D Ghosal, N Majumder, D Herremans, S Poria
arXiv preprint arXiv:2311.08355, 2023
482023
Comparison of automated acoustic methods for oral diadochokinesis assessment in amyotrophic lateral sclerosis
M Novotny, J Melechovsky, K Rozenstoks, T Tykalova, P Kryze, M Kanok, ...
Journal of Speech, Language, and Hearing Research 63 (10), 3453-3460, 2020
212020
Alzheimer’s dementia speech (audio vs. text): Multi-modal machine learning at high vs. low resolution
P Priyadarshinee, CJ Clarke, J Melechovsky, CMY Lin, B BT, JM Chen
Applied Sciences 13 (7), 4244, 2023
132023
Learning accent representation with multi-level vae towards controllable speech synthesis
J Melechovsky, A Mehrish, D Herremans, B Sisman
2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023
62023
Accented text-to-speech synthesis with a conditional variational autoencoder
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2211.03316, 2022
52022
MidiCaps: A large-scale MIDI dataset with text captions
J Melechovsky, A Roy, D Herremans
arXiv preprint arXiv:2406.02255, 2024
22024
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2406.01018, 2024
12024
Drum Kit sound localization tests on binaural hearing model with ANN
J Melechovský, J Bouše, F Rund, E Koshkina
2018 28th International Conference Radioelektronika (RADIOELEKTRONIKA), 1-5, 2018
12018
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech
J Melechovsky, A Mehrish, B Sisman, D Herremans
arXiv preprint arXiv:2410.13342, 2024
2024
ADDRESSING MULTI-MODAL MULTI-MODEL MULTI-FEATURE CUES IN ALZHEIMER’S DEMENTIA
CJ Clarke, J Melechovsky, CMY Lin, P Priyadarshinee, BT Balamurali, ...
2022
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–10