Error detection in egocentric procedural task videos
We present a new egocentric procedural error dataset containing videos with various types
of errors as well as normal videos and propose a new framework for procedural error …
of errors as well as normal videos and propose a new framework for procedural error …
Fact: Frame-action cross-attention temporal modeling for efficient action segmentation
We study supervised action segmentation whose goal is to predict framewise action labels
of a video. To capture temporal dependencies over long horizons prior works either improve …
of a video. To capture temporal dependencies over long horizons prior works either improve …
Progress-aware online action segmentation for egocentric procedural task videos
We address the problem of online action segmentation for egocentric procedural task
videos. While previous studies have mostly focused on offline action segmentation where …
videos. While previous studies have mostly focused on offline action segmentation where …
Deep learning for surgical workflow analysis: a survey of progresses, limitations, and trends
Y Li, Z Zhao, R Li, F Li - Artificial Intelligence Review, 2024 - Springer
Automatic surgical workflow analysis, which aims to recognize the ongoing surgical events
in videos, is fundamental for develo** context-aware computer-assisted systems. This …
in videos, is fundamental for develo** context-aware computer-assisted systems. This …
Ophclip: Hierarchical retrieval-augmented learning for ophthalmic surgical video-language pretraining
Surgical practice involves complex visual interpretation, procedural skills, and advanced
medical knowledge, making surgical vision-language pretraining (VLP) particularly …
medical knowledge, making surgical vision-language pretraining (VLP) particularly …
Surgformer: Surgical transformer with hierarchical temporal attention for surgical phase recognition
Existing state-of-the-art methods for surgical phase recognition either rely on the extraction
of spatial-temporal features at a short-range temporal resolution or adopt the sequential …
of spatial-temporal features at a short-range temporal resolution or adopt the sequential …
AI solutions for overcoming delays in telesurgery and telementoring to enhance surgical practice and education
Artificial intelligence (AI) has emerged as a transformative tool in surgery, particularly in
telesurgery and telementoring. However, its potential to enhance data transmission …
telesurgery and telementoring. However, its potential to enhance data transmission …
Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Resections with Pringle Maneuver
Pringle maneuver (PM) in laparoscopic liver resection aims to reduce blood loss and
provide a clear surgical view by intermittently blocking blood inflow of the liver, whereas …
provide a clear surgical view by intermittently blocking blood inflow of the liver, whereas …
Tunes: A temporal u-net with self-attention for video-based surgical phase recognition
Objective: To enable context-aware computer assistance in the operating room of the future,
cognitive systems need to understand automatically which surgical phase is being …
cognitive systems need to understand automatically which surgical phase is being …
Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin-based Scene Representation
Purpose: Surgical phase recognition (SPR) is an integral component of surgical data
science, enabling high-level surgical analysis. End-to-end trained neural networks that …
science, enabling high-level surgical analysis. End-to-end trained neural networks that …