フォロー
Paul Hongsuck Seo
タイトル
引用先
引用先
Image question answering using convolutional neural network with dynamic parameter prediction
H Noh, PH Seo, B Han
Proceedings of the IEEE conference on computer vision and pattern …, 2016
4272016
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning
A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
2382023
End-to-end generative pretraining for multimodal video captioning
PH Seo, A Nagrani, A Arnab, C Schmid
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2042022
Visual reference resolution using attention memory for visual dialog
PH Seo, A Lehrmann, B Han, L Sigal
Advances in neural information processing systems 30, 2017
1392017
Marioqa: Answering questions by watching gameplay videos
J Mun, P Hongsuck Seo, I Jung, B Han
Proceedings of the IEEE International Conference on Computer Vision, 2867-2875, 2017
1182017
Learning audio-video modalities from image captions
A Nagrani, PH Seo, B Seybold, A Hauth, S Manen, C Sun, C Schmid
European Conference on Computer Vision, 407-426, 2022
972022
Attentive semantic alignment with offset-aware correlation kernels
P Hongsuck Seo, J Lee, D Jung, B Han, M Cho
arXiv e-prints, arXiv: 1808.02128, 2018
92*2018
Cat-seg: Cost aggregation for open-vocabulary semantic segmentation
S Cho, H Shin, S Hong, A Arnab, PH Seo, S Kim
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
882024
Look before you speak: Visually contextualized utterances
PH Seo, A Nagrani, C Schmid
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
842021
Learning for single-shot confidence calibration in deep neural networks through stochastic inferences
S Seo, PH Seo, B Han
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
842019
Progressive attention networks for visual attribute prediction
PH Seo, Z Lin, S Cohen, X Shen, B Han
arXiv preprint arXiv:1606.02393, 2016
81*2016
Cplanet: Enhancing image geolocalization by combinatorial partitioning of maps
PH Seo, T Weyand, J Sim, B Han
Proceedings of the European Conference on Computer Vision (ECCV), 536-551, 2018
782018
Zero-shot referring image segmentation with global-local context features
S Yu, PH Seo, J Son
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
542023
Reinforcing an image caption generator using off-line human feedback
PH Seo, P Sharma, T Levinboim, B Han, R Soricut
Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2693-2700, 2020
292020
Combinatorial inference against label noise
PH Seo, G Kim, B Han
Advances in neural information processing systems 32, 2019
212019
Ifseg: Image-free semantic segmentation via vision-language model
S Yun, SH Park, PH Seo, J Shin
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
162023
Method and apparatus for correcting speech recognition error
GB Lee, JH Choi, IJ Lee, DH Lee, HS Seo, YH Kim, SH Ryu, SJ Koo
US Patent 9,318,102, 2016
152016
Conversational knowledge teaching agent that uses a knowledge base
K Lee, H Seo, J Choi, S Koo, GG Lee
Proceedings of the 16th Annual Meeting of the Special Interest Group on …, 2015
142015
Learning correlation structures for vision transformers
M Kim, PH Seo, C Schmid, M Cho
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
122024
Avatar: Unconstrained audiovisual speech recognition
V Gabeur, PH Seo, A Nagrani, C Sun, K Alahari, C Schmid
arXiv preprint arXiv:2206.07684, 2022
122022
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20