Artificial general intelligence for medical imaging analysis

X Li, L Zhao, L Zhang, Z Wu, Z Liu… - IEEE Reviews in …, 2024 - ieeexplore.ieee.org
Large-scale Artificial General Intelligence (AGI) models, including Large Language Models
(LLMs) such as ChatGPT/GPT-4, have achieved unprecedented success in a variety of …

Evaluation of openai o1: Opportunities and challenges of agi

T Zhong, Z Liu, Y Pan, Y Zhang, Y Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
This comprehensive study evaluates the performance of OpenAI's o1-preview large
language model across a diverse array of complex reasoning tasks, spanning multiple …

Towards generalist biomedical AI

T Tu, S Azizi, D Driess, M Schaekermann, M Amin… - NEJM AI, 2024 - ai.nejm.org
Background Medicine is inherently multimodal, requiring the simultaneous interpretation
and integration of insights between many data modalities spanning text, imaging, genomics …

A systematic survey of prompt engineering on vision-language foundation models

J Gu, Z Han, S Chen, A Beirami, B He, G Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Prompt engineering is a technique that involves augmenting a large pre-trained model with
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …

Mllm-as-a-judge: Assessing multimodal llm-as-a-judge with vision-language benchmark

D Chen, R Chen, S Zhang, Y Liu, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) have gained significant attention recently,
showing remarkable potential in artificial general intelligence. However, assessing the utility …

Metatool benchmark for large language models: Deciding whether to use tools and which to use

Y Huang, J Shi, Y Li, C Fan, S Wu, Q Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have garnered significant attention due to their impressive
natural language processing (NLP) capabilities. Recently, many studies have focused on …

Samaug: Point prompt augmentation for segment anything model

H Dai, C Ma, Z Yan, Z Liu, E Shi, Y Li, P Shu… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper introduces SAMAug, a novel visual point augmentation method for the Segment
Anything Model (SAM) that enhances interactive image segmentation performance …

A comprehensive review of multimodal large language models: Performance and challenges across different tasks

J Wang, H Jiang, Y Liu, C Ma, X Zhang, Y Pan… - arxiv preprint arxiv …, 2024 - arxiv.org
In an era defined by the explosive growth of data and rapid technological advancements,
Multimodal Large Language Models (MLLMs) stand at the forefront of artificial intelligence …

Evaluating large language models for radiology natural language processing

Z Liu, T Zhong, Y Li, Y Zhang, Y Pan, Z Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org
The rise of large language models (LLMs) has marked a pivotal shift in the field of natural
language processing (NLP). LLMs have revolutionized a multitude of domains, and they …

Has multimodal learning delivered universal intelligence in healthcare? A comprehensive survey

Q Lin, Y Zhu, X Mei, L Huang, J Ma, K He, Z Peng… - Information …, 2024 - Elsevier
The rapid development of artificial intelligence has constantly reshaped the field of
intelligent healthcare and medicine. As a vital technology, multimodal learning has …