Foundation Models Defining a New Era in Vision: a Survey and Outlook

M Awais, M Naseer, S Khan, RM Anwer… - … on Pattern Analysis …, 2025‏ - ieeexplore.ieee.org
Vision systems that see and reason about the compositional nature of visual scenes are
fundamental to understanding our world. The complex relations between objects and their …

Foundational models in medical imaging: A comprehensive survey and future vision

B Azad, R Azad, S Eskandari, A Bozorgpour… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Foundation models, large-scale, pre-trained deep-learning models adapted to a wide range
of downstream tasks have gained significant interest lately in various deep-learning …

Multimodal foundation models: From specialists to general-purpose assistants

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024‏ - nowpublishers.com
Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …

[HTML][HTML] The segment anything model (sam) for remote sensing applications: From zero to one shot

LP Osco, Q Wu, EL De Lemos, WN Gonçalves… - International Journal of …, 2023‏ - Elsevier
Segmentation is an essential step for remote sensing image processing. This study aims to
advance the application of the Segment Anything Model (SAM), an innovative image …

Remoteclip: A vision language foundation model for remote sensing

F Liu, D Chen, Z Guan, X Zhou, J Zhu… - … on Geoscience and …, 2024‏ - ieeexplore.ieee.org
General-purpose foundation models have led to recent breakthroughs in artificial
intelligence (AI). In remote sensing, self-supervised learning (SSL) and masked image …

Graphadapter: Tuning vision-language models with dual knowledge graph

X Li, D Lian, Z Lu, J Bai, Z Chen… - Advances in Neural …, 2023‏ - proceedings.neurips.cc
Adapter-style efficient transfer learning (ETL) has shown excellent performance in the tuning
of vision-language models (VLMs) under the low-data regime, where only a few additional …

Vision language models in autonomous driving: A survey and outlook

X Zhou, M Liu, E Yurtsever, BL Zagar… - IEEE Transactions …, 2024‏ - ieeexplore.ieee.org
The applications of Vision-Language Models (VLMs) in the field of Autonomous Driving (AD)
have attracted widespread attention due to their outstanding performance and the ability to …

Position: What can large language models tell us about time series analysis

M **, Y Zhang, W Chen… - 41st …, 2024‏ - research-repository.griffith.edu.au
Time series analysis is essential for comprehending the complexities inherent in various real-
world systems and applications. Although large language models (LLMs) have recently …

On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?

M Zanella, I Ben Ayed - … of the IEEE/CVF Conference on …, 2024‏ - openaccess.thecvf.com
The development of large vision-language models notably CLIP has catalyzed research into
effective adaptation techniques with a particular focus on soft prompt tuning. Conjointly test …

Low-rank few-shot adaptation of vision-language models

M Zanella, I Ben Ayed - … of the IEEE/CVF Conference on …, 2024‏ - openaccess.thecvf.com
Recent progress in the few-shot adaptation of Vision-Language Models (VLMs) has further
pushed their generalization capabilities at the expense of just a few labeled samples within …