DocFormer: End-to-End Transformer for Document Understanding RM Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 313* | 2021 |
Mixgen: A new multi-modal data augmentation X Hao, Y Zhu, S Appalaraju, A Zhang, W Zhang, B Li, M Li Proceedings of the IEEE/CVF winter conference on applications of computer …, 2023 | 108 | 2023 |
Latr: Layout-aware transformer for scene-text vqa AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 105 | 2022 |
Image similarity using Deep CNN and Curriculum Learning S Appalaraju, V Chaoji GHCI 2017, 2017 | 104 | 2017 |
Scalable logo recognition using proxies I Fehérvári, S Appalaraju 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), 715-725, 2019 | 64 | 2019 |
Saliency Driven Perceptual Image Compression Y Patel, S Appalaraju, R Manmatha Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021 | 53 | 2021 |
Artificial intelligence system for image similarity analysis using optimized image pair selection and multi-scale convolutional neural networks S Appalaraju, VS Chaoji US Patent 10,467,526, 2019 | 47 | 2019 |
Docformerv2: Local features for document understanding S Appalaraju, P Tang, Q Dong, N Sankaran, Y Zhou, R Manmatha Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 709-718, 2024 | 40 | 2024 |
Deep perceptual compression Y Patel, S Appalaraju, R Manmatha arXiv preprint arXiv:1907.08310, 2019 | 30 | 2019 |
Human perceptual evaluations for image compression Y Patel, S Appalaraju, R Manmatha arXiv preprint arXiv:1908.04187, 2019 | 29 | 2019 |
Unbiased evaluation of deep metric learning algorithms I Fehervari, A Ravichandran, S Appalaraju arXiv preprint arXiv:1911.12528, 2019 | 28 | 2019 |
Towards Good Practices in Self-supervised Representation Learning S Appalaraju, Y Zhu, Y Xie, I Fehérvári Neural Information Processing Systems (NeurIPS Self-Supervision Workshop 2020), 2020 | 20 | 2020 |
Yoro-lightweight end to end visual grounding CH Ho, S Appalaraju, B Jasani, R Manmatha, N Vasconcelos European Conference on Computer Vision, 3-23, 2022 | 19 | 2022 |
Seetek: Very large-scale open-set logo recognition with text-aware metric learning C Li, I Fehérvári, X Zhao, I Macedo, S Appalaraju Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022 | 17 | 2022 |
Hierarchical auto-regressive image compression system S Appalaraju, Y Patel, R Manmatha US Patent 10,965,948, 2021 | 10 | 2021 |
Learned lossy image compression codec S Appalaraju, R Manmatha, Y Patel US Patent 10,909,728, 2021 | 10 | 2021 |
Identifying Software Products to Test S Appalaraju, A Wanjari, V Bhargava US Patent 10,089,661, 2018 | 10 | 2018 |
Latr: Layoutaware transformer for scene-text vqa. 2022 IEEE AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha CVF Conference on Computer Vision and Pattern Recognition (CVPR), 16527-16537, 2021 | 9 | 2021 |
Enhancing vision-language pre-training with rich supervisions Y Gao, K Shi, P Zhu, E Belval, O Nuriel, S Appalaraju, S Ghadar, Z Tu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 8 | 2024 |
Simcon loss with multiple views for text supervised semantic segmentation Y Patel, Y Xie, Y Zhu, S Appalaraju, R Manmatha arXiv preprint arXiv:2302.03432, 2023 | 8 | 2023 |