Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer Z Lu, S He, X Zhu, L Zhang, YZ Song, T Xiang ICCV 2021, 2021 | 211 | 2021 |
Image Captioning through Image Transformer S He, W Liao, HR Tavakoli, M Yang, B Rosenhahn, N Pugeault Asian Conference on Computer Vision 2020, 2020 | 146 | 2020 |
Diffused heads: Diffusion models beat gans on talking-face generation M Stypułkowski, K Vougioukas, S He, M Zięba, S Petridis, M Pantic Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 132 | 2024 |
Style-Based Global Appearance Flow for Virtual Try-On S He, YZ Song, T Xiang CVPR 2022, 2022 | 127 | 2022 |
Text-Based Person Search with Limited Data X Han, S He, L Zhang, T Xiang BMVC 2021, 2021 | 112 | 2021 |
Hybrid Graph Neural Networks for Few-Shot Learning T Yu, S He, YZ Song, T Xiang AAAI 2022, 2021 | 64 | 2021 |
Human Attention in Image Captioning: Dataset and Analysis S He, HR Tavakoli, A Borji, N Pugeault The IEEE International Conference on Computer Vision, 8529-8538, 2019 | 62* | 2019 |
Context-Aware Layout to Image Generation with Enhanced Object Appearance S He, W Liao, MY Yang, Y Yang, YZ Song, B Rosenhahn, T Xiang CVPR 2021, 2021 | 61 | 2021 |
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Y Cong, M Xu, C Simon, S Chen, J Ren, Y Xie, JM Perez-Rua, ... ICLR 2024, 2023 | 60 | 2023 |
GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation S Chen, M Xu, J Ren, Y Cong, S He, Y Xie, A Sinha, P Luo, T Xiang, ... CVPR 2024, 2023 | 52* | 2023 |
Understanding and visualizing deep visual saliency models S He, HR Tavakoli, A Borji, Y Mi, N Pugeault Proceedings of the ieee conference on computer vision and pattern …, 2019 | 52 | 2019 |
Disentangled Lifespan Face Synthesis S He, W Liao, MY Yang, YZ Song, B Rosenhahn, T Xiang ICCV 2021, 2021 | 35 | 2021 |
Prediction calibration for generalized few-shot semantic segmentation Z Lu, S He, D Li, YZ Song, T Xiang IEEE transactions on image processing 32, 3311-3323, 2023 | 21 | 2023 |
What catches the eye? Visualizing and understanding deep saliency models S He, A Borji, Y Mi, N Pugeault arXiv preprint arXiv:1803.05753, 2018 | 17 | 2018 |
Aggregated sparse attention for steering angle prediction S He, D Kangin, Y Mi, N Pugeault 2018 24th International Conference on Pattern Recognition (ICPR), 2398-2403, 2018 | 8 | 2018 |
Deep saliency: What is learnt by a deep network about saliency? S He, N Pugeault arXiv preprint arXiv:1801.04261, 2018 | 7 | 2018 |
Uigr: Unified interactive garment retrieval X Han, S He, L Zhang, YZ Song, T Xiang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 6 | 2022 |
Mardini: Masked autoregressive diffusion for video generation at scale H Liu, S Liu, Z Zhou, M Xu, Y Xie, X Han, JC Pérez, D Liu, K Kahatapitiya, ... arXiv preprint arXiv:2410.20280, 2024 | 5 | 2024 |
A Spherical Approach to Planar Semantic Segmentation C Zhang, S He, S Liwicki British Machine Vision Conference 2020, 2020 | 5 | 2020 |
Adaptive caching for faster video generation with diffusion transformers K Kahatapitiya, H Liu, S He, D Liu, M Jia, C Zhang, MS Ryoo, T Xie arXiv preprint arXiv:2411.02397, 2024 | 3 | 2024 |