Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

[HTML][HTML] Transformers in medical image analysis

K He, C Gan, Z Li, I Rekik, Z Yin, W Ji, Y Gao, Q Wang… - Intelligent …, 2023 - Elsevier
Transformers have dominated the field of natural language processing and have recently
made an impact in the area of computer vision. In the field of medical image analysis …

A vision transformer for decoding surgeon activity from surgical videos

D Kiyasseh, R Ma, TF Haque, BJ Miles… - Nature biomedical …, 2023 - nature.com
The intraoperative activity of a surgeon has substantial impact on postoperative outcomes.
However, for most surgical procedures, the details of intraoperative surgical actions, which …

Rendezvous: Attention mechanisms for the recognition of surgical action triplets in endoscopic videos

CI Nwoye, T Yu, C Gonzalez, B Seeliger… - Medical Image …, 2022 - Elsevier
Out of all existing frameworks for surgical workflow analysis in endoscopic videos, action
triplet recognition stands out as the only one aiming to provide truly fine-grained and …

Ophnet: A large-scale video benchmark for ophthalmic surgical workflow understanding

M Hu, P **a, L Wang, S Yan, F Tang, Z Xu… - … on Computer Vision, 2024 - Springer
Surgical scene perception via videos is critical for advancing robotic surgery, telesurgery,
and AI-assisted surgery, particularly in ophthalmology. However, the scarcity of diverse and …

Skit: a fast key information video transformer for online surgical phase recognition

Y Liu, J Huo, J Peng, R Sparks… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper introduces SKiT, a fast Key information Transformer for phase recognition of
videos. Unlike previous methods that rely on complex models to capture long-term temporal …

Creating a digital twin of spinal surgery: A proof of concept

J Hein, F Giraud, L Calvet, A Schwarz… - Proceedings of the …, 2024 - openaccess.thecvf.com
Surgery digitalization is the process of creating a virtual replica of real-world surgery also
referred to as a surgical digital twin (SDT). It has significant applications in various fields …

Towards holistic surgical scene understanding

N Valderrama, P Ruiz Puentes, I Hernández… - … conference on medical …, 2022 - Springer
Most benchmarks for studying surgical interventions focus on a specific challenge instead of
leveraging the intrinsic complementarity among different tasks. In this work, we present a …

[HTML][HTML] Lovit: Long video transformer for surgical phase recognition

Y Liu, M Boels, LC Garcia-Peraza-Herrera… - Medical Image …, 2025 - Elsevier
Online surgical phase recognition plays a significant role towards building contextual tools
that could quantify performance and oversee the execution of surgical workflows. Current …

Deep learning in surgical workflow analysis: a review of phase and step recognition

KC Demir, H Schieber, T Weise, D Roth… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Objective: In the last two decades, there has been a growing interest in exploring surgical
procedures with statistical models to analyze operations at different semantic levels. This …