U-kan makes strong backbone for medical image segmentation and generation

C Li, X Liu, W Li, C Wang, H Liu, Y Liu, Z Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
U-Net has become a cornerstone in various visual applications such as image segmentation
and diffusion probability models. While numerous innovative designs and improvements …

Gtp-4o: Modality-prompted heterogeneous graph learning for omni-modal biomedical representation

C Li, X Liu, C Wang, Y Liu, W Yu, J Shao… - European conference on …, 2024 - Springer
Recent advances in learning multi-modal representation have witnessed the success in
biomedical domains. While established techniques enable handling multi-modal …

Lgs: A light-weight 4d gaussian splatting for efficient surgical scene reconstruction

H Liu, Y Liu, C Li, W Li, Y Yuan - International Conference on Medical …, 2024 - Springer
The advent of 3D Gaussian Splatting (3D-GS) techniques and their dynamic scene modeling
variants, 4D-GS, offers promising prospects for real-time rendering of dynamic surgical …

EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting

C Li, BY Feng, Y Liu, H Liu, C Wang, W Yu… - … Conference on Medical …, 2024 - Springer
Abstract 3D reconstruction of biological tissues from a collection of endoscopic images is a
key to unlock various important downstream surgical applications with 3D capabilities …

Gaussianstego: A generalizable stenography pipeline for generative 3d gaussians splatting

C Li, H Liu, Z Fan, W Li, Y Liu, P Pan… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in large generative models and real-time neural rendering using
point-based techniques pave the way for a future of widespread visual data distribution …

Diffrect: Latent diffusion label rectification for semi-supervised medical image segmentation

X Liu, W Li, Y Yuan - … Conference on Medical Image Computing and …, 2024 - Springer
Semi-supervised medical image segmentation aims to leverage limited annotated data and
rich unlabeled data to perform accurate segmentation. However, existing semi-supervised …

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

W Li, X Liu, J Ma, Y Yuan - European Conference on Computer Vision, 2024 - Springer
Open-vocabulary object detection (OVD) utilizes image-level cues to expand the linguistic
space of region proposals, thereby facilitating the detection of diverse novel classes. Recent …

Bora: Biomedical generalist video generation model

W Sun, X You, R Zheng, Z Yuan, X Li, L He, Q Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative models hold promise for revolutionizing medical education, robot-assisted
surgery, and data augmentation for medical AI development. Diffusion models can now …

Surgen: Text-guided diffusion model for surgical video generation

J Cho, S Schmidgall, C Zakka, M Mathur… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion-based video generation models have made significant strides, producing outputs
with improved visual fidelity, temporal coherence, and user control. These advancements …

P2SAM: Probabilistically Prompted SAMs Are Efficient Segmentator for Ambiguous Medical Images

Y Huang, C Li, Z Lin, H Liu, H Xu, Y Liu… - Proceedings of the …, 2024 - dl.acm.org
Generating diverse plausible outputs from a single input is crucial for addressing visual
ambiguities, exemplified in medical imaging where experts may provide varying semantic …