Segui
Kun Su
Titolo
Citata da
Citata da
Anno
Predict & cluster: Unsupervised skeleton based action recognition
K Su, X Liu, E Shlizerman
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
2482020
Incremental learning meets transfer learning: Application to multi-site prostate mri segmentation
C You, J Xiang, K Su, X Zhang, S Dong, J Onofrey, L Staib, JS Duncan
International Workshop on Distributed, Collaborative, and Federated Learning …, 2022
702022
Audeo: Audio generation for a silent performance video
K Su, X Liu, E Shlizerman
Advances in Neural Information Processing Systems 33, 3325-3337, 2020
692020
How does it sound?
K Su, X Liu, E Shlizerman
Advances in Neural Information Processing Systems 34, 29258-29273, 2021
352021
Inras: Implicit neural representation for audio scenes
K Su, M Chen, E Shlizerman
Advances in Neural Information Processing Systems 35, 8144-8158, 2022
322022
Multi-instrumentalist net: Unsupervised generation of music from body movements
K Su, X Liu, E Shlizerman
arXiv preprint arXiv:2012.03478, 2020
252020
Physics-driven diffusion models for impact sound synthesis from videos
K Su, K Qian, E Shlizerman, A Torralba, C Gan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
242023
Clustering and recognition of spatiotemporal features through interpretable embedding of sequence to sequence recurrent neural networks
K Su, E Shlizerman
Frontiers in artificial intelligence 3, 70, 2020
152020
Bi-maml: Balanced incremental approach for meta learning
Y Zheng, J Xiang, K Su, E Shlizerman
arXiv preprint arXiv:2006.07412, 2020
132020
Parameter identification of spatial–temporal varying processes by a multi-robot system in realistic diffusion fields
W Wu, J You, Y Zhang, M Li, K Su
Robotica 39 (5), 842-861, 2021
102021
V2meow: Meowing to the visual beat via music generation
K Su, JY Li, Q Huang, D Kuzmin, J Lee, C Donahue, F Sha, A Jansen, ...
arXiv preprint arXiv:2305.06594, 2023
92023
Be everywhere-hear everything (bee): Audio scene reconstruction by sparse audio-visual samples
M Chen, K Su, E Shlizerman
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
82023
Cooperative parameter identification of advection-diffusion processes using a mobile sensor network
J You, Y Zhang, M Li, K Su, F Zhang, W Wu
2017 American Control Conference (ACC), 3230-3236, 2017
72017
V2Meow: meowing to the visual beat via video-to-music generation
K Su, JY Li, Q Huang, D Kuzmin, J Lee, C Donahue, F Sha, A Jansen, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4952-4960, 2024
52024
Tell What You Hear From What You See--Video to Audio Generation Through Text
X Liu, K Su, E Shlizerman
arXiv preprint arXiv:2411.05679, 2024
32024
UniMuMo: Unified Text, Music and Motion Generation
H Yang, K Su, Y Zhang, J Chen, K Qian, G Liu, C Gan
arXiv preprint arXiv:2410.04534, 2024
22024
From vision to audio and beyond: A unified model for audio-visual representation and generation
K Su, X Liu, E Shlizerman
arXiv preprint arXiv:2409.19132, 2024
12024
Let the beat follow you-creating interactive drum sounds from body rhythm
X Liu, K Su, E Shlizerman
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
12024
Experimental validation of diffusion coefficient identification using a multi-robot system
M Li, K Su, Y Zhang, J You, W Wu
2016 IEEE MIT Undergraduate Research Technology Conference (URTC), 1-4, 2016
12016
Diff4Steer: Steerable Diffusion Prior for Generative Music Retrieval with Semantic Guidance
X Bao, JY Li, ZY Wan, K Su, T Denk, J Lee, D Kuzmin, F Sha
arXiv preprint arXiv:2412.04746, 2024
2024
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–20