Подписаться
Hanoona Abdul Rasheed
Hanoona Abdul Rasheed
PhD Computer Vision Student at MBZUAI
Подтвержден адрес электронной почты в домене mbzuai.ac.ae - Главная страница
Название
Процитировано
Процитировано
Год
Maple: Multi-modal prompt learning
MU Khattak, H Rasheed, M Maaz, S Khan, FS Khan
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
7112023
Video-chatgpt: Towards detailed video understanding via large vision and language models
M Maaz, H Rasheed, S Khan, FS Khan
arXiv preprint arXiv:2306.05424, 2023
5892023
UNETR++: delving into efficient and accurate 3D medical image segmentation
AM Shaker, M Maaz, H Rasheed, S Khan, MH Yang, FS Khan
IEEE Transactions on Medical Imaging, 2024
1722024
Bridging the gap between object and image-level representations for open-vocabulary detection
H Bangalath, M Maaz, MU Khattak, SH Khan, F Shahbaz Khan
Advances in Neural Information Processing Systems 35, 33781-33794, 2022
1712022
Fine-tuned clip models are efficient video learners
H Rasheed, MU Khattak, M Maaz, S Khan, FS Khan
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
1652023
Glamm: Pixel grounding large multimodal model
H Rasheed, M Maaz, S Shaji, A Shaker, S Khan, H Cholakkal, RM Anwer, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1642024
Class-agnostic object detection with multi-modal transformer
M Maaz, H Rasheed, S Khan, FS Khan, RM Anwer, MH Yang
European conference on computer vision, 512-531, 2022
119*2022
Swiftformer: Efficient additive attention for transformer-based real-time mobile vision applications
A Shaker, M Maaz, H Rasheed, S Khan, MH Yang, FS Khan
Proceedings of the IEEE/CVF international conference on computer vision …, 2023
1112023
Videogpt+: Integrating image and video encoders for enhanced video understanding
M Maaz, H Rasheed, S Khan, F Khan
arXiv preprint arXiv:2406.09418, 2024
282024
Pg-video-llava: Pixel grounding large video-language models
S Munasinghe, R Thushara, M Maaz, HA Rasheed, S Khan, M Shah, ...
arXiv preprint arXiv:2311.13435, 2023
282023
Palo: A polyglot large multimodal model for 5b people
M Maaz, H Rasheed, A Shaker, S Khan, H Cholakal, RM Anwer, ...
arXiv preprint arXiv:2402.14818, 2024
102024
Self-supervised learning for fine-grained visual categorization
M Maaz, HA Rasheed, D Gaddam
arXiv preprint arXiv:2105.08788, 2021
32021
System and method for 3d medical image segmentation
A Shaker, M Muhammad, H Rasheed, S Khan, FS Khan
US Patent App. 18/307,058, 2024
2024
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
AS Youssief, M Maaz, H Rasheed, S Khan, MH Yang, FS Khan
2023
A System For Analyzing Milk Composition By a Reflection Probe
T Francis, M Shankara, H Rasheed, D Joy
IN Patent 19/2,021, 2021
2021
UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
ASM Maaz, H Rasheed, S Khan, MH Yang, FS Khan
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–16