Osv: One step is enough for high-quality image to video generation X Mao, Z Jiang, FY Wang, W Zhu, J Zhang, H Chen, M Chi, Y Wang arXiv preprint arXiv:2409.11367, 2024 | 5 | 2024 |
Mdt-a2g: Exploring masked diffusion transformers for co-speech gesture generation X Mao, Z Jiang, Q Wang, C Fu, J Zhang, J Wu, Y Wang, C Wang, W Li, ... Proceedings of the 32nd ACM International Conference on Multimedia, 3266-3274, 2024 | 1 | 2024 |
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion C Fu, Y Wang, J Zhang, Z Jiang, X Mao, J Wu, W Cao, C Wang, Y Ge, ... Proceedings of the 32nd ACM International Conference on Multimedia, 10794-10803, 2024 | 1 | 2024 |
VI3DRM: Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis H Chen, J Wu, Y Jin, J Peng, X Mao, M Chi, M Yao, B Peng, J Li, Y Cao arXiv preprint arXiv:2409.08207, 2024 | | 2024 |