U-kan makes strong backbone for medical image segmentation and generation
U-Net has become a cornerstone in various visual applications such as image segmentation
and diffusion probability models. While numerous innovative designs and improvements …
and diffusion probability models. While numerous innovative designs and improvements …
Endora: Video Generation Models as Endoscopy Simulators
Generative models hold promise for revolutionizing medical education, robot-assisted
surgery, and data augmentation for machine learning. Despite progress in generating 2D …
surgery, and data augmentation for machine learning. Despite progress in generating 2D …
Gtp-4o: Modality-prompted heterogeneous graph learning for omni-modal biomedical representation
Recent advances in learning multi-modal representation have witnessed the success in
biomedical domains. While established techniques enable handling multi-modal …
biomedical domains. While established techniques enable handling multi-modal …
EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
Abstract 3D reconstruction of biological tissues from a collection of endoscopic images is a
key to unlock various important downstream surgical applications with 3D capabilities …
key to unlock various important downstream surgical applications with 3D capabilities …
A review of 3d reconstruction techniques for deformable tissues in robotic surgery
As a crucial and intricate task in robotic minimally invasive surgery, reconstructing surgical
scenes using stereo or monocular endoscopic video holds immense potential for clinical …
scenes using stereo or monocular endoscopic video holds immense potential for clinical …
Deform3dgs: Flexible deformation for fast surgical scene reconstruction with gaussian splatting
Tissue deformation poses a key challenge for accurate surgical scene reconstruction.
Despite yielding high reconstruction quality, existing methods suffer from slow rendering …
Despite yielding high reconstruction quality, existing methods suffer from slow rendering …
[HTML][HTML] Cardiovascular Medical Image and Analysis based on 3D Vision: A Comprehensive Survey
With the rapid development of 3D vision and computer graphics technology, the way
humans interact with the world has undergone significant transformations. 3D vision-related …
humans interact with the world has undergone significant transformations. 3D vision-related …
CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
Open-vocabulary object detection (OVD) utilizes image-level cues to expand the linguistic
space of region proposals, thereby facilitating the detection of diverse novel classes. Recent …
space of region proposals, thereby facilitating the detection of diverse novel classes. Recent …
P2SAM: Probabilistically Prompted SAMs Are Efficient Segmentator for Ambiguous Medical Images
Generating diverse plausible outputs from a single input is crucial for addressing visual
ambiguities, exemplified in medical imaging where experts may provide varying semantic …
ambiguities, exemplified in medical imaging where experts may provide varying semantic …
EndoGS: deformable endoscopic tissues reconstruction with gaussian splatting
Surgical 3D reconstruction is a critical area of research in robotic surgery, with recent works
adopting variants of dynamic radiance fields to achieve success in 3D reconstruction of …
adopting variants of dynamic radiance fields to achieve success in 3D reconstruction of …