Look-m: Look-once optimization in kv cache for efficient multimodal long-context inference

Z Wan, Z Wu, C Liu, J Huang, Z Zhu, P **… - arxiv preprint arxiv …, 2024 - arxiv.org
Long-context Multimodal Large Language Models (MLLMs) demand substantial
computational resources for inference as the growth of their multimodal Key-Value (KV) …

Imitate: Clinical prior guided hierarchical vision-language pre-training

C Liu, S Cheng, M Shi, A Shah, W Bai… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In medical Vision-Language Pre-training (VLP), significant work focuses on extracting text
and image features from clinical reports and medical images. Yet, existing methods may …

Foundation Models in Electrocardiogram: A Review

Y Han, X Liu, X Zhang, C Ding - arxiv preprint arxiv:2410.19877, 2024 - arxiv.org
The electrocardiogram (ECG) is ubiquitous across various healthcare domains, such as
cardiac arrhythmia detection and sleep monitoring, making ECG analysis critically essential …

D2o: Dynamic discriminative operations for efficient generative inference of large language models

Z Wan, X Wu, Y Zhang, Y **n, C Tao, Z Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Efficient inference in Large Language Models (LLMs) is impeded by the growing memory
demands of key-value (KV) caching, especially for longer sequences. Traditional KV cache …

Medtsllm: Leveraging llms for multimodal medical time series analysis

N Chan, F Parker, W Bennett, T Wu, MY Jia… - arxiv preprint arxiv …, 2024 - arxiv.org
The complexity and heterogeneity of data in many real-world applications pose significant
challenges for traditional machine learning and signal processing techniques. For instance …

Ccam: Cross-channel association mining for ubiquitous sleep staging

S Ma, Y Zhang, Y Liu, Y Chen, W Yang… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
Accurate sleep staging is crucial for wearable sensor-based sleep monitoring and health
interventions. Polysomnography (PSG) signals, rich in information from multiple …

Benchmarking and boosting radiology report generation for 3D high-resolution medical images

C Liu, Z Wan, Y Wang, H Shen, H Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Automatic radiology report generation can significantly benefit the labor-intensive process of
report writing by radiologists, especially for 3D radiographs like CT scans, which are crucial …

Beyond model adaptation at test time: A survey

Z **ao, CGM Snoek - arxiv preprint arxiv:2411.03687, 2024 - arxiv.org
Machine learning algorithms have achieved remarkable success across various disciplines,
use cases and applications, under the prevailing assumption that training and test samples …

Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model

J **, H Wang, H Li, J Li, J Pan, S Hong - arxiv preprint arxiv:2502.10707, 2025 - arxiv.org
Electrocardiogram (ECG) is essential for the clinical diagnosis of arrhythmias and other
heart diseases, but deep learning methods based on ECG often face limitations due to the …

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

K Zhu, P **a, Y Li, H Zhu, S Wang, H Yao - arxiv preprint arxiv …, 2024 - arxiv.org
The advancement of Large Vision-Language Models (LVLMs) has propelled their
application in the medical field. However, Medical LVLMs (Med-LVLMs) encounter factuality …