Deep learning in surgical workflow analysis: a review of phase and step recognition

KC Demir, H Schieber, T Weise, D Roth… - IEEE Journal of …, 2023‏ - ieeexplore.ieee.org
Objective: In the last two decades, there has been a growing interest in exploring surgical
procedures with statistical models to analyze operations at different semantic levels. This …

A vision transformer for decoding surgeon activity from surgical videos

D Kiyasseh, R Ma, TF Haque, BJ Miles… - Nature biomedical …, 2023‏ - nature.com
The intraoperative activity of a surgeon has substantial impact on postoperative outcomes.
However, for most surgical procedures, the details of intraoperative surgical actions, which …

Rendezvous: Attention mechanisms for the recognition of surgical action triplets in endoscopic videos

CI Nwoye, T Yu, C Gonzalez, B Seeliger… - Medical Image …, 2022‏ - Elsevier
Out of all existing frameworks for surgical workflow analysis in endoscopic videos, action
triplet recognition stands out as the only one aiming to provide truly fine-grained and …

Ophnet: A large-scale video benchmark for ophthalmic surgical workflow understanding

M Hu, P **a, L Wang, S Yan, F Tang, Z Xu… - … on Computer Vision, 2024‏ - Springer
Surgical scene perception via videos is critical for advancing robotic surgery, telesurgery,
and AI-assisted surgery, particularly in ophthalmology. However, the scarcity of diverse and …

[HTML][HTML] Preliminary Evidence of the Use of Generative AI in Health Care Clinical Services: Systematic Narrative Review

D Yim, J Khuntia, V Parameswaran… - JMIR Medical …, 2024‏ - medinform.jmir.org
Background: Generative artificial intelligence tools and applications (GenAI) are being
increasingly used in health care. Physicians, specialists, and other providers have started …

Cholectriplet2021: A benchmark challenge for surgical action triplet recognition

CI Nwoye, D Alapatt, T Yu, A Vardazaryan, F **a… - Medical Image …, 2023‏ - Elsevier
Context-aware decision support in the operating room can foster surgical safety and
efficiency by leveraging real-time feedback from surgical workflow analysis. Most existing …

Ophclip: Hierarchical retrieval-augmented learning for ophthalmic surgical video-language pretraining

M Hu, K Yuan, Y Shen, F Tang, X Xu, L Zhou… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Surgical practice involves complex visual interpretation, procedural skills, and advanced
medical knowledge, making surgical vision-language pretraining (VLP) particularly …

CholecTriplet2022: Show me a tool and tell me the triplet—An endoscopic vision challenge for surgical action triplet detection

CI Nwoye, T Yu, S Sharma, A Murali, D Alapatt… - Medical Image …, 2023‏ - Elsevier
Formalizing surgical activities as triplets of the used instruments, actions performed, and
target anatomies is becoming a gold standard approach for surgical activity modeling. The …

Dissecting self-supervised learning methods for surgical computer vision

S Ramesh, V Srivastav, D Alapatt, T Yu, A Murali… - Medical Image …, 2023‏ - Elsevier
The field of surgical computer vision has undergone considerable breakthroughs in recent
years with the rising popularity of deep neural network-based methods. However, standard …

RFMiD: Retinal Image Analysis for multi-Disease Detection Challenge

S Pachade, P Porwal, M Kokare, G Deshmukh… - Medical Image …, 2025‏ - Elsevier
In the last decades, many publicly available large fundus image datasets have been
collected for diabetic retinopathy, glaucoma, and age-related macular degeneration, and a …