[HTML][HTML] Digital twins as a unifying framework for surgical data science: the enabling role of geometric scene understanding

H Ding, L Seenivasan, BD Killeen, SM Cho… - Artificial Intelligence …, 2024 - oaepublish.com
Surgical data science is devoted to enhancing the quality, safety, and efficacy of
interventional healthcare. While the use of powerful machine learning algorithms is …

Rethinking exemplars for continual semantic segmentation in endoscopy scenes: Entropy-based mini-batch pseudo-replay

G Wang, L Bai, Y Wu, T Chen, H Ren - Computers in Biology and Medicine, 2023 - Elsevier
Endoscopy is a widely used technique for the early detection of diseases or robotic-assisted
minimally invasive surgery (RMIS). Numerous deep learning (DL)-based research works …

Gp-vls: A general-purpose vision language model for surgery

S Schmidgall, J Cho, C Zakka, W Hiesinger - arxiv preprint arxiv …, 2024 - arxiv.org
Surgery requires comprehensive medical knowledge, visual assessment skills, and
procedural expertise. While recent surgical AI models have focused on solving task-specific …

Endo-4dgs: Endoscopic monocular scene reconstruction with 4d gaussian splatting

Y Huang, B Cui, L Bai, Z Guo, M Xu, M Islam… - … Conference on Medical …, 2024 - Springer
In the realm of robot-assisted minimally invasive surgery, dynamic scene reconstruction can
significantly enhance downstream tasks and improve surgical outcomes. Neural Radiance …

General surgery vision transformer: A video pre-trained foundation model for general surgery

S Schmidgall, JW Kim, J Jopling, A Krieger - arxiv preprint arxiv …, 2024 - arxiv.org
The absence of openly accessible data and specialized foundation models is a major barrier
for computational research in surgery. Toward this,(i) we open-source the largest dataset of …

Joint sparse representations and coupled dictionary learning in multisource heterogeneous image pseudo-color fusion

L Bai, S Yao, K Gao, Y Huang, R Tang… - IEEE sensors …, 2023 - ieeexplore.ieee.org
Considering that coupled dictionary learning (CDL) method can obtain a reasonable linear
mathematical relationship between resource images, we propose a novel CDL-based …

Artificial intelligence for biomedical video generation

L Li, J Qiu, A Saha, L Li, P Li, M He, Z Guo… - arxiv preprint arxiv …, 2024 - arxiv.org
As a prominent subfield of Artificial Intelligence Generated Content (AIGC), video generation
has achieved notable advancements in recent years. The introduction of Sora-alike models …

Augmented reality navigation systems in endoscopy

R Metzger, P Suppa, Z Li, A Vemuri - Frontiers in Gastroenterology, 2024 - frontiersin.org
Navigation assistance has become part of our daily lives and its implementation in medicine
has been going on for the last 3 decades. Navigation is defined as the determination of a …

Scaling up self-supervised learning for improved surgical foundation models

TJM Jaspers, RLPD de Jong, Y Li, CHJ Kusters… - arxiv preprint arxiv …, 2025 - arxiv.org
Foundation models have revolutionized computer vision by achieving vastly superior
performance across diverse tasks through large-scale pretraining on extensive datasets …

SAF-IS: A spatial annotation free framework for instance segmentation of surgical tools

L Sestini, B Rosa, E De Momi, G Ferrigno… - Medical Image Analysis, 2025 - Elsevier
Instance segmentation of surgical instruments is a long-standing research problem, crucial
for the development of many applications for computer-assisted surgery. This problem is …