U-kan makes strong backbone for medical image segmentation and generation
U-Net has become a cornerstone in various visual applications such as image segmentation
and diffusion probability models. While numerous innovative designs and improvements …
and diffusion probability models. While numerous innovative designs and improvements …
Gtp-4o: Modality-prompted heterogeneous graph learning for omni-modal biomedical representation
Recent advances in learning multi-modal representation have witnessed the success in
biomedical domains. While established techniques enable handling multi-modal …
biomedical domains. While established techniques enable handling multi-modal …
Lgs: A light-weight 4d gaussian splatting for efficient surgical scene reconstruction
The advent of 3D Gaussian Splatting (3D-GS) techniques and their dynamic scene modeling
variants, 4D-GS, offers promising prospects for real-time rendering of dynamic surgical …
variants, 4D-GS, offers promising prospects for real-time rendering of dynamic surgical …
EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
Abstract 3D reconstruction of biological tissues from a collection of endoscopic images is a
key to unlock various important downstream surgical applications with 3D capabilities …
key to unlock various important downstream surgical applications with 3D capabilities …
Gaussianstego: A generalizable stenography pipeline for generative 3d gaussians splatting
Recent advancements in large generative models and real-time neural rendering using
point-based techniques pave the way for a future of widespread visual data distribution …
point-based techniques pave the way for a future of widespread visual data distribution …
Diffrect: Latent diffusion label rectification for semi-supervised medical image segmentation
Semi-supervised medical image segmentation aims to leverage limited annotated data and
rich unlabeled data to perform accurate segmentation. However, existing semi-supervised …
rich unlabeled data to perform accurate segmentation. However, existing semi-supervised …
CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
Open-vocabulary object detection (OVD) utilizes image-level cues to expand the linguistic
space of region proposals, thereby facilitating the detection of diverse novel classes. Recent …
space of region proposals, thereby facilitating the detection of diverse novel classes. Recent …
Bora: Biomedical generalist video generation model
Generative models hold promise for revolutionizing medical education, robot-assisted
surgery, and data augmentation for medical AI development. Diffusion models can now …
surgery, and data augmentation for medical AI development. Diffusion models can now …
Surgen: Text-guided diffusion model for surgical video generation
Diffusion-based video generation models have made significant strides, producing outputs
with improved visual fidelity, temporal coherence, and user control. These advancements …
with improved visual fidelity, temporal coherence, and user control. These advancements …
P2SAM: Probabilistically Prompted SAMs Are Efficient Segmentator for Ambiguous Medical Images
Generating diverse plausible outputs from a single input is crucial for addressing visual
ambiguities, exemplified in medical imaging where experts may provide varying semantic …
ambiguities, exemplified in medical imaging where experts may provide varying semantic …