Follow
Xiangming Gu
Title
Cited by
Cited by
Year
Agent smith: A single image can jailbreak one million multimodal llm agents exponentially fast
X Gu, X Zheng, T Pang, C Du, Q Liu, Y Wang, J Jiang, M Lin
International Conference on Machine Learning (ICML 2024), 2024
372024
On memorization in diffusion models
X Gu, C Du, T Pang, C Li, M Lin, Y Wang
Transactions on Machine Learning Research (TMLR 2025), 2025
352025
Boosting monocular 3d human pose estimation with part aware attention
Y Xue, J Chen, X Gu, H Ma, H Ma
IEEE Transactions on Image Processing (TIP 2022) 31, 4278-4291, 2022
292022
Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
L Ou*, X Gu*, Y Wang
International Society for Music Information Retrieval Conference (ISMIR 2022), 2022
282022
MM-ALT: A multimodal automatic lyric transcription system
X Gu, L Ou, D Ong, Y Wang
Proceedings of the 30th ACM International Conference on Multimedia (MM 2022 …, 2022
142022
Extrapolative continuous-time bayesian neural network for fast training-free test-time adaptation
H Huang, X Gu, H Wang, C Xiao, H Liu, Y Wang
Advances in Neural Information Processing Systems (NeurIPS 2022) 35, 36000-36013, 2022
112022
Laser endoscopic manipulator using spring-reinforced multi-DoF soft actuator
B Zhang, P Yang, X Gu, H Liao
IEEE Robotics and Automation Letters (RAL 2021) 6 (4), 7736-7743, 2021
92021
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
X Gu, L Ou, W Zeng, J Zhang, N Wong, Y Wang
ACM Transactions on Multimedia Computing, Communications and Applications …, 2024
8*2024
Distilling a deep neural network into a Takagi-Sugeno-Kang fuzzy inference system
X Gu, X Cheng
arXiv preprint arXiv:2010.04974, 2020
82020
When attention sink emerges in language models: An empirical view
X Gu, T Pang, C Du, Q Liu, F Zhang, C Du, Y Wang, M Lin
International Conference on Learning Representations (ICLR 2025), 2025
5*2025
Elucidate gender fairness in singing voice transcription
X Gu, W Zeng, Y Wang
Proceedings of the 31st ACM International Conference on Multimedia (MM 2023 …, 2023
42023
Disentangled adversarial domain adaptation for phonation mode detection in singing and speech
Y Wang, W Wei, X Gu, X Guan, Y Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP 2023), 2023
32023
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization
W Wei, H Huang, X Gu, H Wang, Y Wang
Transactions on Machine Learning Research (TMLR 2022), 2022
22022
On Calibration of LLM-based Guard Models for Reliable Content Moderation
H Liu, H Huang, H Wang, X Gu, Y Wang
International Conference on Learning Representations (ICLR 2025), 2025
12025
Spring-reinforced pneumatic actuator and soft robotic applications
B Zhang, X Gu, J Liu, J Kang, C Hu, H Liao
Smart Materials and Structures 33 (10), 105017, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–15