Artificial intelligence in liver cancer—New tools for research and patient management

J Calderaro, L Žigutytė, D Truhn, A Jaffe… - Nature Reviews …, 2024 - nature.com
Liver cancer has high incidence and mortality globally. Artificial intelligence (AI) has
advanced rapidly, influencing cancer care. AI systems are already approved for clinical use …

[HTML][HTML] Large language models illuminate a progressive pathway to artificial intelligent healthcare assistant

M Yuan, P Bao, J Yuan, Y Shen, Z Chen, Y **e, J Zhao… - Medicine Plus, 2024 - Elsevier
With the rapid development of artificial intelligence, large language models (LLMs) have
shown promising capabilities in mimicking human-level language comprehension and …

A multimodal generative AI copilot for human pathology

MY Lu, B Chen, DFK Williamson, RJ Chen, M Zhao… - Nature, 2024 - nature.com
Computational pathology, has witnessed considerable progress in the development of both
task-specific predictive models and task-agnostic self-supervised vision encoders …

MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models

MS Sepehri, Z Fabian, M Soltanolkotabi… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) have tremendous potential to improve the
accuracy, availability, and cost-effectiveness of healthcare by providing automated solutions …

Omnimedvqa: A new large-scale comprehensive evaluation benchmark for medical lvlm

Y Hu, T Li, Q Lu, W Shao, J He… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Large Vision-Language Models (LVLMs) have demonstrated remarkable
capabilities in various multimodal tasks. However their potential in the medical domain …

Can chatgpt detect deepfakes? a study of using multimodal large language models for media forensics

S Jia, R Lyu, K Zhao, Y Chen, Z Yan… - Proceedings of the …, 2024 - openaccess.thecvf.com
DeepFakes which refer to AI-generated media content have become an increasing concern
due to their use as a means for disinformation. Detecting DeepFakes is currently solved with …

Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine

Q **, F Chen, Y Zhou, Z Xu, JM Cheung, R Chen… - npj Digital …, 2024 - nature.com
Recent studies indicate that Generative Pre-trained Transformer 4 with Vision (GPT-4V)
outperforms human physicians in medical challenge tasks. However, these evaluations …

Dense connector for mllms

H Yao, W Wu, T Yang, YX Song… - Advances in …, 2025 - proceedings.neurips.cc
Do we fully leverage the potential of visual encoder in Multimodal Large Language Models
(MLLMs)? The recent outstanding performance of MLLMs in multimodal understanding has …

Scifibench: Benchmarking large multimodal models for scientific figure interpretation

J Roberts, K Han, N Houlsby… - Advances in Neural …, 2025 - proceedings.neurips.cc
Large multimodal models (LMMs) have proven flexible and generalisable across many tasks
and fields. Although they have strong potential to aid scientific research, their capabilities in …

Charting new territories: Exploring the geographic and geospatial capabilities of multimodal llms

J Roberts, T Lüddecke, R Sheikh… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multimodal large language models (MLLMs) have shown remarkable capabilities across a
broad range of tasks but their knowledge and abilities in the geographic and geospatial …