Predict & cluster: Unsupervised skeleton based action recognition K Su, X Liu, E Shlizerman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 248 | 2020 |
Incremental learning meets transfer learning: Application to multi-site prostate mri segmentation C You, J Xiang, K Su, X Zhang, S Dong, J Onofrey, L Staib, JS Duncan International Workshop on Distributed, Collaborative, and Federated Learning …, 2022 | 70 | 2022 |
Audeo: Audio generation for a silent performance video K Su, X Liu, E Shlizerman Advances in Neural Information Processing Systems 33, 3325-3337, 2020 | 69 | 2020 |
How does it sound? K Su, X Liu, E Shlizerman Advances in Neural Information Processing Systems 34, 29258-29273, 2021 | 35 | 2021 |
Inras: Implicit neural representation for audio scenes K Su, M Chen, E Shlizerman Advances in Neural Information Processing Systems 35, 8144-8158, 2022 | 32 | 2022 |
Multi-instrumentalist net: Unsupervised generation of music from body movements K Su, X Liu, E Shlizerman arXiv preprint arXiv:2012.03478, 2020 | 25 | 2020 |
Physics-driven diffusion models for impact sound synthesis from videos K Su, K Qian, E Shlizerman, A Torralba, C Gan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 24 | 2023 |
Clustering and recognition of spatiotemporal features through interpretable embedding of sequence to sequence recurrent neural networks K Su, E Shlizerman Frontiers in artificial intelligence 3, 70, 2020 | 15 | 2020 |
Bi-maml: Balanced incremental approach for meta learning Y Zheng, J Xiang, K Su, E Shlizerman arXiv preprint arXiv:2006.07412, 2020 | 13 | 2020 |
Parameter identification of spatial–temporal varying processes by a multi-robot system in realistic diffusion fields W Wu, J You, Y Zhang, M Li, K Su Robotica 39 (5), 842-861, 2021 | 10 | 2021 |
V2meow: Meowing to the visual beat via music generation K Su, JY Li, Q Huang, D Kuzmin, J Lee, C Donahue, F Sha, A Jansen, ... arXiv preprint arXiv:2305.06594, 2023 | 9 | 2023 |
Be everywhere-hear everything (bee): Audio scene reconstruction by sparse audio-visual samples M Chen, K Su, E Shlizerman Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 8 | 2023 |
Cooperative parameter identification of advection-diffusion processes using a mobile sensor network J You, Y Zhang, M Li, K Su, F Zhang, W Wu 2017 American Control Conference (ACC), 3230-3236, 2017 | 7 | 2017 |
V2Meow: meowing to the visual beat via video-to-music generation K Su, JY Li, Q Huang, D Kuzmin, J Lee, C Donahue, F Sha, A Jansen, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4952-4960, 2024 | 5 | 2024 |
Tell What You Hear From What You See--Video to Audio Generation Through Text X Liu, K Su, E Shlizerman arXiv preprint arXiv:2411.05679, 2024 | 3 | 2024 |
UniMuMo: Unified Text, Music and Motion Generation H Yang, K Su, Y Zhang, J Chen, K Qian, G Liu, C Gan arXiv preprint arXiv:2410.04534, 2024 | 2 | 2024 |
From vision to audio and beyond: A unified model for audio-visual representation and generation K Su, X Liu, E Shlizerman arXiv preprint arXiv:2409.19132, 2024 | 1 | 2024 |
Let the beat follow you-creating interactive drum sounds from body rhythm X Liu, K Su, E Shlizerman Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 1 | 2024 |
Experimental validation of diffusion coefficient identification using a multi-robot system M Li, K Su, Y Zhang, J You, W Wu 2016 IEEE MIT Undergraduate Research Technology Conference (URTC), 1-4, 2016 | 1 | 2016 |
Diff4Steer: Steerable Diffusion Prior for Generative Music Retrieval with Semantic Guidance X Bao, JY Li, ZY Wan, K Su, T Denk, J Lee, D Kuzmin, F Sha arXiv preprint arXiv:2412.04746, 2024 | | 2024 |