Predict & cluster: Unsupervised skeleton based action recognition K Su, X Liu, E Shlizerman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 252 | 2020 |
Audeo: Audio generation for a silent performance video K Su, X Liu, E Shlizerman Advances in Neural Information Processing Systems 33, 3325-3337, 2020 | 72 | 2020 |
MuseChat: A Conversational Music Recommendation System for Videos Z Dong, X Liu, B Chen, P Polak, P Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 27 | 2024 |
How does it sound? K Su, X Liu, E Shlizerman Advances in Neural Information Processing Systems 34, 29258-29273, 2021 | 27 | 2021 |
Multi-instrumentalist net: Unsupervised generation of music from body movements K Su, X Liu, E Shlizerman arXiv preprint arXiv:2012.03478, 2020 | 25 | 2020 |
Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering X Liu, Z Dong, P Zhang Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 17 | 2024 |
How does it sound? generation of rhythmic soundtracks for human movement videos K Su, X Liu, E Shlizerman Conf. Neural Inf. Process. Syst 35, 0-10, 2021 | 8 | 2021 |
Calo-VQ: Vector-Quantized Two-Stage Generative Model in Calorimeter Simulation Q Liu, C Shimmin, X Liu, E Shlizerman, S Li, SC Hsu arXiv preprint arXiv:2405.06605, 2024 | 7 | 2024 |
Calochallenge 2022: A community challenge for fast calorimeter simulation C Krause, MF Giannelli, G Kasieczka, B Nachman, D Salamani, D Shih, ... arXiv preprint arXiv:2410.21611, 2024 | 6 | 2024 |
Tell What You Hear From What You See--Video to Audio Generation Through Text X Liu, K Su, E Shlizerman arXiv preprint arXiv:2411.05679, 2024 | 3 | 2024 |
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments X Liu, S Paul, M Chatterjee, A Cherian Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3765-3773, 2024 | 2 | 2024 |
Active Sparse Conversations for Improved Audio-Visual Embodied Navigation X Liu, S Paul, M Chatterjee, A Cherian arXiv preprint arXiv:2306.04047v2, 2023 | 2 | 2023 |
Let the Beat Follow You-Creating Interactive Drum Sounds From Body Rhythm X Liu, K Su, E Shlizerman Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 1 | 2024 |
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation K Su, X Liu, E Shlizerman Forty-first International Conference on Machine Learning, 0 | 1* | |
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation K Su, X Liu, E Shlizerman Proceedings of the 41st International Conference on Machine Learning, PMLR …, 2024 | | 2024 |