U-kan makes strong backbone for medical image segmentation and generation

C Li, X Liu, W Li, C Wang, H Liu, Y Liu, Z Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
U-Net has become a cornerstone in various visual applications such as image segmentation
and diffusion probability models. While numerous innovative designs and improvements …

Endora: Video Generation Models as Endoscopy Simulators

C Li, H Liu, Y Liu, BY Feng, W Li, X Liu, Z Chen… - … Conference on Medical …, 2024 - Springer
Generative models hold promise for revolutionizing medical education, robot-assisted
surgery, and data augmentation for machine learning. Despite progress in generating 2D …

Gtp-4o: Modality-prompted heterogeneous graph learning for omni-modal biomedical representation

C Li, X Liu, C Wang, Y Liu, W Yu, J Shao… - European conference on …, 2024 - Springer
Recent advances in learning multi-modal representation have witnessed the success in
biomedical domains. While established techniques enable handling multi-modal …

EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting

C Li, BY Feng, Y Liu, H Liu, C Wang, W Yu… - … Conference on Medical …, 2024 - Springer
Abstract 3D reconstruction of biological tissues from a collection of endoscopic images is a
key to unlock various important downstream surgical applications with 3D capabilities …

A review of 3d reconstruction techniques for deformable tissues in robotic surgery

M Xu, Z Guo, A Wang, L Bai, H Ren - International Conference on Medical …, 2024 - Springer
As a crucial and intricate task in robotic minimally invasive surgery, reconstructing surgical
scenes using stereo or monocular endoscopic video holds immense potential for clinical …

Deform3dgs: Flexible deformation for fast surgical scene reconstruction with gaussian splatting

S Yang, Q Li, D Shen, B Gong, Q Dou, Y ** - International Conference on …, 2024 - Springer
Tissue deformation poses a key challenge for accurate surgical scene reconstruction.
Despite yielding high reconstruction quality, existing methods suffer from slow rendering …

[HTML][HTML] Cardiovascular Medical Image and Analysis based on 3D Vision: A Comprehensive Survey

Z Wang, R Yi, X Wen, C Zhu, K Xu - Meta-Radiology, 2024 - Elsevier
With the rapid development of 3D vision and computer graphics technology, the way
humans interact with the world has undergone significant transformations. 3D vision-related …

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

W Li, X Liu, J Ma, Y Yuan - European Conference on Computer Vision, 2024 - Springer
Open-vocabulary object detection (OVD) utilizes image-level cues to expand the linguistic
space of region proposals, thereby facilitating the detection of diverse novel classes. Recent …

P2SAM: Probabilistically Prompted SAMs Are Efficient Segmentator for Ambiguous Medical Images

Y Huang, C Li, Z Lin, H Liu, H Xu, Y Liu… - Proceedings of the …, 2024 - dl.acm.org
Generating diverse plausible outputs from a single input is crucial for addressing visual
ambiguities, exemplified in medical imaging where experts may provide varying semantic …

EndoGS: deformable endoscopic tissues reconstruction with gaussian splatting

L Zhu, Z Wang, J Cui, Z **, G Lin, L Yu - International Conference on …, 2024 - Springer
Surgical 3D reconstruction is a critical area of research in robotic surgery, with recent works
adopting variants of dynamic radiance fields to achieve success in 3D reconstruction of …