Consistency trajectory models: Learning probability flow ode trajectory of diffusion D Kim, CH Lai, WH Liao, N Murata, Y Takida, T Uesaka, Y He, Y Mitsufuji, ... arXiv preprint arXiv:2310.02279, 2023 | 120 | 2023 |
Sq-vae: Variational bayes on discrete representation with self-annealed stochastic quantization Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ... arXiv preprint arXiv:2205.07547, 2022 | 67 | 2022 |
Manifold preserving guided diffusion Y He, N Murata, CH Lai, Y Takida, T Uesaka, D Kim, WH Liao, Y Mitsufuji, ... arXiv preprint arXiv:2311.16424, 2023 | 35 | 2023 |
Automatic music mixing with deep learning and out-of-domain data MA Martínez-Ramírez, WH Liao, G Fabbro, S Uhlich, C Nagashima, ... arXiv preprint arXiv:2208.11428, 2022 | 27 | 2022 |
Automatic piano transcription with hierarchical frequency-time transformer K Toyama, T Akama, Y Ikemiya, Y Takida, WH Liao, Y Mitsufuji arXiv preprint arXiv:2307.04305, 2023 | 24 | 2023 |
The Sound Demixing Challenge 2023$\unicode {x2013} $ Music Demixing Track G Fabbro, S Uhlich, CH Lai, W Choi, M Martínez-Ramírez, W Liao, ... arXiv preprint arXiv:2308.06979, 2023 | 21 | 2023 |
Music mixing style transfer: A contrastive learning approach to disentangle audio effects J Koo, MA Martínez-Ramírez, WH Liao, S Uhlich, K Lee, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 19 | 2023 |
Musicmagus: Zero-shot text-to-music editing via diffusion models Y Zhang, Y Ikemiya, G Xia, N Murata, MA Martínez-Ramírez, WH Liao, ... arXiv preprint arXiv:2402.06178, 2024 | 17 | 2024 |
Automatic DJ transitions with differentiable audio effects and generative adversarial networks BY Chen, WH Hsu, WH Liao, MAM Ramírez, Y Mitsufuji, YH Yang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 17 | 2022 |
Preventing posterior collapse induced by oversmoothing in gaussian VAE Y Takida, WH Liao, T Uesaka, S Takahashi, Y Mitsufuji arXiv preprint arXiv:2102.08663 3 (5), 6, 2021 | 17 | 2021 |
On the modeling of sound textures based on the STFT representation WH Liao, A Roebel, AWY Su Proc. of the 16th Int. Conference on Digital Audio Effects (DAFx-13), 33, 2013 | 16 | 2013 |
Preventing oversmoothing in VAE via generalized variance parameterization Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji Neurocomputing 509, 137-156, 2022 | 11 | 2022 |
On stretching gaussian noises with the phase vocoder WH Liao, A Roebel, AWY Su Proc. of the 15th Int. Conference on Digital Audio Effects (DAFx-12), 41, 2012 | 11 | 2012 |
Vrdmg: Vocal restoration via diffusion posterior sampling with multiple guidance C Hernandez-Olivan, K Saito, N Murata, CH Lai, MA Martínez-Ramirez, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ... arXiv preprint arXiv:2401.00365, 2023 | 8 | 2023 |
Transparency in music-generative AI: A systematic literature review R Batlle-Roca, E Gómez, WH Liao, X Serra, Y Mitsufuji | 8 | 2023 |
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning Y Zhang, Y Ikemiya, W Choi, N Murata, MA Martínez-Ramírez, L Lin, G Xia, ... arXiv preprint arXiv:2405.18386, 2024 | 7 | 2024 |
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher D Kim, CH Lai, WH Liao, Y Takida, N Murata, T Uesaka, Y Mitsufuji, ... arXiv preprint arXiv:2405.14822, 2024 | 7 | 2024 |
Modelling and transformation of sound textures and environmental sounds WH Liao Université Pierre et Marie Curie-Paris VI; National Cheng Kung University …, 2015 | 7 | 2015 |
Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription F Cwitkowitz, KW Cheuk, W Choi, MA Martínez-Ramírez, K Toyama, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |