Pano-avqa: Grounded audio-visual question answering on 360deg videos H Yun, Y Yu, W Yang, K Lee, G Kim
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
91 2021 Transitional adaptation of pretrained models for visual storytelling Y Yu, J Chung, H Yun, J Kim, G Kim
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
34 2021 Multimodal knowledge alignment with reinforcement learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, P Ammanabrolu, R Zellers, ...
arXiv preprint arXiv:2205.12630, 2022
32 2022 Panoramic Vision Transformer for Saliency Detection in 360 Videos H Yun, S Lee, G Kim
European Conference on Computer Vision, 422-439, 2022
20 2022 Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, R Zellers, P Ammanabrolu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
16 2023 Character grounding and re-identification in story of videos and text descriptions Y Yu, J Kim, H Yun, J Chung, G Kim
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
7 2020 Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation H Yun, J Na, G Kim
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
5 2023 Spherical World-Locking for Audio-Visual Localization in Egocentric Videos H Yun, R Gao, I Ananthabhotla, A Kumar, J Donley, C Li, G Kim, VK Ithapu, ...
European Conference on Computer Vision, 256-274, 2024
1 2024 A mobile robot generating video summaries of seniors' indoor activities CY Yang, H Yun, S Varadaraj, JY Hsu
Proceedings of the 21st International Conference on Human-Computer …, 2019
1 2019