Vilt: Vision-and-language transformer without convolution or region supervision W Kim, B Son, I Kim Proceedings of the 38th International Conference on Machine Learning (ICML …, 2021 | 1907 | 2021 |
Chartsense: Interactive data extraction from chart images D Jung, W Kim, H Song, J Hwang, B Lee, B Kim, J Seo Proceedings of the 2017 chi conference on human factors in computing systems …, 2017 | 195 | 2017 |
Vidt: An efficient and effective fully transformer-based object detector H Song, D Sun, S Chun, V Jampani, D Han, B Heo, W Kim, MH Yang arXiv preprint arXiv:2110.03921, 2021 | 110 | 2021 |
What Do Self-Supervised Vision Transformers Learn? N Park, W Kim, B Heo, T Kim, S Yun The Eleventh International Conference on Learning Representations, 2023 | 88 | 2023 |
Compodiff: Versatile composed image retrieval with latent diffusion G Gu, S Chun, W Kim, HJ Jun, Y Kang, S Yun arXiv preprint arXiv:2303.11916, 2023 | 51 | 2023 |
Eccv caption: Correcting false negatives by collecting machine-and-human-verified image-caption associations for ms-coco S Chun, W Kim, S Park, M Chang, SJ Oh European Conference on Computer Vision, 1-19, 2022 | 45 | 2022 |
Language-only training of zero-shot composed image retrieval G Gu, S Chun, W Kim, Y Kang, S Yun Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 35 | 2024 |
An extendable, efficient and effective transformer-based object detector H Song, D Sun, S Chun, V Jampani, D Han, B Heo, W Kim, MH Yang arXiv preprint arXiv:2204.07962, 2022 | 22 | 2022 |
Unified chest x-ray and radiology report generation model with multi-view chest x-rays H Lee, W Kim, JH Kim, T Kim, J Kim, L Sunwoo, E Choi arXiv preprint arXiv:2302.12172 3 (7), 8, 2023 | 20 | 2023 |
Speeding up inference with user simulators through policy modulation HS Moon, S Do, W Kim, J Seo, M Chang, B Lee Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems …, 2022 | 18 | 2022 |
Diversified mutual learning for deep metric learning W Park, W Kim, K You, M Cho Computer Vision–ECCV 2020 Workshops: Glasgow, UK, August 23–28, 2020 …, 2020 | 12 | 2020 |
Swifttuna: Responsive and incremental visual exploration of large-scale multidimensional data J Jo, W Kim, S Yoo, B Kim, J Seo 2017 IEEE Pacific Visualization Symposium (PacificVis), 131-140, 2017 | 12 | 2017 |
Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning W Kim, Y Lee Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019), 2019 | 11 | 2019 |
Seit: Storage-efficient vision training with tokens using 1% of pixel storage S Park, S Chun, B Heo, W Kim, S Yun Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 10 | 2023 |
Hype: Hyperbolic entailment filtering for underspecified images and texts W Kim, S Chun, T Kim, D Han, S Yun European Conference on Computer Vision, 247-265, 2024 | 6 | 2024 |
Vision-Language Generative Model for View-Specific Chest X-ray Generation H Lee, DY Lee, W Kim, JH Kim, T Kim, J Kim, L Sunwoo, E Choi arXiv preprint arXiv:2302.12172, 2023 | 5 | 2023 |
Pivotal role of language modeling in recommender systems: Enriching task-specific and task-agnostic representation learning K Shin, H Kwak, W Kim, J Jeong, S Jung, KM Kim, JW Ha, SW Lee arXiv preprint arXiv:2212.03760, 2022 | 5 | 2022 |
Correlation between alignment-uniformity and performance of dense contrastive representations JH Moon, W Kim, E Choi arXiv preprint arXiv:2210.08819, 2022 | 5 | 2022 |
Discrete Infomax Codes for Supervised Representation Learning Y Lee, W Kim, W Park, S Choi arXiv preprint arXiv:1905.11656, 2019 | 5 | 2019 |
Reducing task discrepancy of text encoders for zero-shot composed image retrieval J Byun, S Jeong, W Kim, S Chun, T Moon arXiv preprint arXiv:2406.09188, 2024 | 4 | 2024 |