Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory SJ Park, M Kim, J Hong, J Choi, YM Ro Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 2062-2070, 2022 | 81 | 2022 |
Multi-modality associative bridging through memory: Speech sound recollected from face video M Kim, J Hong, SJ Park, YM Ro Proceedings of the IEEE/CVF International Conference on Computer Vision, 296-306, 2021 | 52 | 2021 |
Lip to speech synthesis with visual context attentional GAN M Kim, J Hong, YM Ro Advances in Neural Information Processing Systems 34, 2758-2770, 2021 | 48 | 2021 |
Watch or listen: Robust audio-visual speech recognition with visual corruption modeling and reliability scoring J Hong, M Kim, J Choi, YM Ro Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 38 | 2023 |
Cromm-vsr: Cross-modal memory augmented visual speech recognition M Kim, J Hong, SJ Park, YM Ro IEEE Transactions on Multimedia 24, 4342-4355, 2021 | 33 | 2021 |
Speech reconstruction with reminiscent sound via visual voice memory J Hong, M Kim, SJ Park, YM Ro IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3654-3667, 2021 | 25 | 2021 |
Visual context-driven audio feature enhancement for robust end-to-end audio-visual speech recognition J Hong, M Kim, D Yoo, YM Ro arXiv preprint arXiv:2207.06020, 2022 | 24 | 2022 |
Lip-to-speech synthesis in the wild with multi-task learning M Kim, J Hong, YM Ro ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 22 | 2023 |
Diffv2s: Diffusion-based video-to-speech synthesis with vision-guided speaker embedding J Choi, J Hong, YM Ro Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
Face tells detailed expression: Generating comprehensive facial expression sentence through facial action units J Hong, HJ Lee, Y Kim, YM Ro International Conference on Multimedia Modeling, 100-111, 2019 | 9 | 2019 |
Intuitive multilingual audio-visual speech recognition with a single-trained model J Hong, SJ Park, YM Ro arXiv preprint arXiv:2310.14946, 2023 | 7 | 2023 |
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation SJ Park, CW Kim, H Rha, M Kim, J Hong, JH Yeo, YM Ro arXiv preprint arXiv:2406.07867, 2024 | 6 | 2024 |
Visagesyntalk: Unseen speaker video-to-speech synthesis via speech-visage feature selection J Hong, M Kim, YM Ro European Conference on Computer Vision, 452-468, 2022 | 6 | 2022 |
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion SJ Park, J Hong, M Kim, YM Ro arXiv preprint arXiv:2310.05934, 2023 | 5 | 2023 |
Unsupervised disentangling of viewpoint and residues variations by substituting representations for robust face recognition M Kim, J Hong, J Kim, HJ Lee, YM Ro 2020 25th International Conference on Pattern Recognition (ICPR), 8952-8959, 2021 | 1 | 2021 |
Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment J Hong, S Parekh, H Chen, J Donley, K Tan, B Xu, A Kumar arXiv preprint arXiv:2501.18157, 2025 | | 2025 |
Learning Style Correlation for Elaborate Few-Shot Classification J Kim, M Kim, JU Kim, HJ Lee, S Lee, J Hong, YM Ro 2020 IEEE International Conference on Image Processing (ICIP), 1791-1795, 2020 | | 2020 |
Comprehensive Facial Expression Synthesis Using Human-Interpretable Language J Hong, JU Kim, S Lee, YM Ro 2020 IEEE International Conference on Image Processing (ICIP), 1641-1645, 2020 | | 2020 |