BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis H Liu, Z Zhu, N Iwamoto, Y Peng, Z Li, Y Zhou, E Bozkurt, B Zheng European Conference on Computer Vision (ECCV), 2022 | 139 | 2022 |
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling H Liu, Z Zhu, G Becherini, Y Peng, M Su, N Iwamoto, B Zheng, MJ Black Computer Vision and Pattern Recognition Conference (CVPR), 2024 | 44* | 2024 |
DisCo: Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis H Liu, N Iwamoto, Z Zhu, Z Li, Y Zhou, E Bozkurt, B Zheng ACM International Conference on Multimedia (ACMMM), 2022 | 37 | 2022 |
Reinforcement learning based neural architecture search for audio tagging H Liu, C Zhang International Joint Conference on Neural Networks (IJCNN), 2020 | 12 | 2020 |
Improving Ultrasound Tongue Image Reconstruction from Lip Images Using Self-supervised Learning and Attention Mechanism H Liu, J Zhang ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD Workshop), 2021 | 7 | 2021 |
Resolution irrelevant encoding and difficulty balanced loss based network independent supervision for multi-person pose estimation H Liu, D Luo, S Du, T Ikenaga International Conference on Human System Interaction (HSI), 2020 | 5 | 2020 |
TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio Motion Embedding and Diffusion Interpolation H Liu, X Yang, T Akiyama, Y Huang, Q Li, S Kuriyama, T Taketomi International Conference on Learning Representations (LCLR, Oral), 2024 | 4 | 2024 |
SandGAN: Style-Mix Assisted Noise Distortion for Imbalanced Conditional Image Synthesis H Liu, Y Endo, J Lee, S Kamijo Neurocomputing, 2023 | 4 | 2023 |
Free-viewpoint Human Animation with Pose-correlated Reference Selection FT Hong, Z Xu, H Liu, Q Lin, L Song, Z Shu, Y Zhou, D Ceylan, D Xu arXiv preprint arXiv:2412.17290, 2024 | | 2024 |