Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models BK Lee, CW Kim, B Park, YM Ro arXiv preprint arXiv:2405.15574, 2024 | 16 | 2024 |
Moai: Mixture of all intelligence for large language and vision models BK Lee, B Park, C Won Kim, Y Man Ro European Conference on Computer Vision, 273-302, 2024 | 13 | 2024 |
Collavo: Crayon large language and vision model BK Lee, B Park, CW Kim, YM Ro arXiv preprint arXiv:2402.11248, 2024 | 13 | 2024 |
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation SJ Park, CW Kim, H Rha, M Kim, J Hong, JH Yeo, YM Ro arXiv preprint arXiv:2406.07867, 2024 | 6 | 2024 |
TroL: Traversal of Layers for Large Language and Vision Models BK Lee, S Chung, CW Kim, B Park, YM Ro arXiv preprint arXiv:2406.12246, 2024 | 5 | 2024 |
Phantom of latent for large language and vision models BK Lee, S Chung, CW Kim, B Park, YM Ro arXiv preprint arXiv:2409.14713, 2024 | 3 | 2024 |
Deep visual forced alignment: learning to align transcription with talking face video M Kim, CW Kim, YM Ro Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 8273-8281, 2023 | 1 | 2023 |
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language JH Yeo, CW Kim, H Kim, H Rha, S Han, WH Cheng, YM Ro arXiv preprint arXiv:2409.00986, 2024 | | 2024 |
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation S Han, SJ Park, CW Kim, YM Ro ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |