Прати
Susan Liang
Susan Liang
Верификована је имејл адреса на ur.rochester.edu - Почетна страница
Наслов
Навело
Навело
Година
Video understanding with large language models: A survey
Y Tang, J Bi, S Xu, L Song, S Liang, T Wang, D Zhang, J An, J Lin, R Zhu, ...
arXiv preprint arXiv:2312.17432, 2023
632023
Unicon: Unified context network for robust active speaker detection
Y Zhang, S Liang, S Yang, X Liu, Z Wu, S Shan, X Chen
Proceedings of the 29th ACM international conference on multimedia, 3964-3972, 2021
452021
Av-nerf: Learning neural fields for real-world audio-visual scene synthesis
S Liang, C Huang, Y Tian, A Kumar, C Xu
Advances in Neural Information Processing Systems 36, 37472-37490, 2023
212023
Neural acoustic context field: Rendering realistic room impulse response with neural fields
S Liang, C Huang, Y Tian, A Kumar, C Xu
arXiv preprint arXiv:2309.15977, 2023
122023
Random smooth-based certified defense against text adversarial attack
Z Zhang, W Yao, S Liang, C Xu
Findings of the Association for Computational Linguistics: EACL 2024, 1251-1265, 2024
92024
Unicon+: Ictcas-ucas submission to the ava-activespeaker task at activitynet challenge 2022
Y Zhang, S Liang, S Yang, S Shan
arXiv preprint arXiv:2206.10861, 2022
62022
Learning to transform dynamically for better adversarial transferability
R Zhu, Z Zhang, S Liang, Z Liu, C Xu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
52024
DAVIS: High-quality audio-visual separation with generative diffusion models
C Huang, S Liang, Y Tian, A Kumar, C Xu
52023
Ictcas-ucas-tal submission to the ava-activespeaker task at activitynet challenge 2021
Y Zhang, S Liang, S Yang, X Liu, Z Wu, S Shan
The ActivityNet Large-Scale Activity Recognition Challenge 1 (3), 4, 2021
42021
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
Y Tang, J Guo, H Hua, S Liang, M Feng, X Li, R Mao, C Huang, J Bi, ...
arXiv preprint arXiv:2411.10979, 2024
12024
Scaling Concept With Text-Guided Diffusion Models
C Huang, S Liang, Y Tang, Y Tian, A Kumar, C Xu
arXiv preprint arXiv:2410.24151, 2024
12024
Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?
Z Zhang, X Liang, M Feng, S Liang, C Xu
arXiv preprint arXiv:2410.10160, 2024
12024
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
S Liang, C Huang, Y Tian, A Kumar, C Xu
Proceedings of the Asian Conference on Computer Vision, 1011-1027, 2024
12024
Scalable CP Decomposition for Tensor Learning using GPU Tensor Cores
Z Zhang, Z Liu, S Liang, Z Wang, Y Zhu, C Ding, C Xu
arXiv preprint arXiv:2311.13693, 2023
12023
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives
Z Zhang, S Liang, D Shimada, C Xu
arXiv preprint arXiv:2502.11858, 2025
2025
From 16-Bit to 1-Bit: Visual KV Cache Quantization for Memory-Efficient Multimodal Large Language Models
Z Zhang, Y Zhu, S Liang, Z Wang, J Liu, H Lin, M Zhao, C Xu, K Wan, ...
arXiv preprint arXiv:2502.14882, 2025
2025
Generative AI for Cel-Animation: A Survey
Y Tang, J Guo, P Liu, Z Wang, H Hua, JX Zhong, Y Xiao, C Huang, L Song, ...
arXiv preprint arXiv:2501.06250, 2025
2025
Neural radiance field systems and methods for synthesis of audio-visual scenes
XU Chenliang, S Liang, C Huang, Y Tian, FNUA Kumar
US Patent App. 18/431,491, 2024
2024
Approximated Likelihood Ratio: A Forward-Only and Parallel Framework for Boosting Neural Network Training
Z Zhang, J Jiang, Z Liu, S Liang, Y Peng, C Xu
arXiv preprint arXiv:2403.12320, 2024
2024
High-Quality Visually-Guided Sound Separation from Diverse Categories
C Huang, S Liang, Y Tian, A Kumar, C Xu
Proceedings of the Asian Conference on Computer Vision, 35-49, 2024
2024
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–20