Paul Hongsuck Seo

引用先

	すべて	2020 年以来
引用	1952	1555
h 指標	16	16
i10 指標	22	20

620

310

155

465

201620172018201920202021202220232024202532 79 115 150 148 176 197 366 613 51

オープンアクセス

すべて表示

1 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Bohyung HanProfessor, Seoul National University確認したメールアドレス: snu.ac.kr
Arsha NagraniResearch Scientist, Google確認したメールアドレス: google.com
Cordelia SchmidResearch director INRIA 確認したメールアドレス: inria.fr
Anurag ArnabGoogle DeepMind確認したメールアドレス: google.com
Minsu ChoMu-Eun-Jae Endowed Chair Professor, Associate Professor of CSE & AI, POSTECH確認したメールアドレス: postech.ac.kr
gary geunbae leeprofessor of computer science and engineering, postech確認したメールアドレス: postech.ac.kr
Seungryong KimAssociate Professor, KAIST確認したメールアドレス: kaist.ac.kr
Antoine MiechGoogle DeepMind確認したメールアドレス: google.com
Ivan LaptevProfessor at MBZUAI, on leave from INRIA確認したメールアドレス: inria.fr
Jordi Pont-TusetResearch Scientist at Google Deepmind確認したメールアドレス: google.com
Josef SivicCzech Technical University, CIIRC, ELLIS Unit Prague確認したメールアドレス: cvut.cz
Kyusong LeeLanguage Technologies Institute, Carnegie Mellon University確認したメールアドレス: andrew.cmu.edu
Hyeonwoo NohOpenAI確認したメールアドレス: openai.com
Sunghwan HongPh.D Candidate Computer Science, Korea University確認したメールアドレス: korea.ac.kr
Seokju ChoKAIST確認したメールアドレス: kaist.ac.kr
Jeany SonGIST確認したメールアドレス: gist.ac.kr
Zhe L. LinSenior Principal Scientist, Adobe Research確認したメールアドレス: adobe.com
Scott CohenAdobe Research確認したメールアドレス: adobe.com
Andreas M. LehrmannFacebook Reality Labs確認したメールアドレス: fb.com
Leonid SigalProfessor, University of British Columbia確認したメールアドレス: cs.ubc.ca

フォロー

Paul Hongsuck Seo

Korea University

確認したメールアドレス: korea.ac.kr - ホームページ

Multimodal Interactive Intelligence Vision Speech and Language Understanding


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Image question answering using convolutional neural network with dynamic parameter prediction H Noh, PH Seo, B Han Proceedings of the IEEE conference on computer vision and pattern …, 2016	427	2016
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	238	2023
End-to-end generative pretraining for multimodal video captioning PH Seo, A Nagrani, A Arnab, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	204	2022
Visual reference resolution using attention memory for visual dialog PH Seo, A Lehrmann, B Han, L Sigal Advances in neural information processing systems 30, 2017	139	2017
Marioqa: Answering questions by watching gameplay videos J Mun, P Hongsuck Seo, I Jung, B Han Proceedings of the IEEE International Conference on Computer Vision, 2867-2875, 2017	118	2017
Learning audio-video modalities from image captions A Nagrani, PH Seo, B Seybold, A Hauth, S Manen, C Sun, C Schmid European Conference on Computer Vision, 407-426, 2022	97	2022
Attentive semantic alignment with offset-aware correlation kernels P Hongsuck Seo, J Lee, D Jung, B Han, M Cho arXiv e-prints, arXiv: 1808.02128, 2018	92*	2018
Cat-seg: Cost aggregation for open-vocabulary semantic segmentation S Cho, H Shin, S Hong, A Arnab, PH Seo, S Kim Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	88	2024
Look before you speak: Visually contextualized utterances PH Seo, A Nagrani, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	84	2021
Learning for single-shot confidence calibration in deep neural networks through stochastic inferences S Seo, PH Seo, B Han Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	84	2019
Progressive attention networks for visual attribute prediction PH Seo, Z Lin, S Cohen, X Shen, B Han arXiv preprint arXiv:1606.02393, 2016	81*	2016
Cplanet: Enhancing image geolocalization by combinatorial partitioning of maps PH Seo, T Weyand, J Sim, B Han Proceedings of the European Conference on Computer Vision (ECCV), 536-551, 2018	78	2018
Zero-shot referring image segmentation with global-local context features S Yu, PH Seo, J Son Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	54	2023
Reinforcing an image caption generator using off-line human feedback PH Seo, P Sharma, T Levinboim, B Han, R Soricut Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2693-2700, 2020	29	2020
Combinatorial inference against label noise PH Seo, G Kim, B Han Advances in neural information processing systems 32, 2019	21	2019
Ifseg: Image-free semantic segmentation via vision-language model S Yun, SH Park, PH Seo, J Shin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	16	2023
Method and apparatus for correcting speech recognition error GB Lee, JH Choi, IJ Lee, DH Lee, HS Seo, YH Kim, SH Ryu, SJ Koo US Patent 9,318,102, 2016	15	2016
Conversational knowledge teaching agent that uses a knowledge base K Lee, H Seo, J Choi, S Koo, GG Lee Proceedings of the 16th Annual Meeting of the Special Interest Group on …, 2015	14	2015
Learning correlation structures for vision transformers M Kim, PH Seo, C Schmid, M Cho Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	12	2024
Avatar: Unconstrained audiovisual speech recognition V Gabeur, PH Seo, A Nagrani, C Sun, K Alahari, C Schmid arXiv preprint arXiv:2206.07684, 2022	12	2022

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者