SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

Boels, Maxence; Liu, Yang; Dasgupta, Prokar; Granados, Alejandro; Ourselin, Sebastien

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.18849 (cs)

[Submitted on 25 Dec 2024 (v1), last revised 15 Jun 2025 (this version, v4)]

Title:SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

Authors:Maxence Boels, Yang Liu, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin

View PDF HTML (experimental)

Abstract:While existing approaches excel at recognising current surgical phases, they provide limited foresight and intraoperative guidance into future procedural steps. Similarly, current anticipation methods are constrained to predicting short-term and single events, neglecting the dense, repetitive, and long sequential nature of surgical workflows. To address these needs and limitations, we propose SWAG (Surgical Workflow Anticipative Generation), a framework that combines phase recognition and anticipation using a generative approach. This paper investigates two distinct decoding methods - single-pass (SP) and auto-regressive (AR) - to generate sequences of future surgical phases at minute intervals over long horizons. We propose a novel embedding approach using class transition probabilities to enhance the accuracy of phase anticipation. Additionally, we propose a generative framework using remaining time regression to classification (R2C). SWAG was evaluated on two publicly available datasets, Cholec80 and AutoLaparo21. Our single-pass model with class transition probability embeddings (SP*) achieves 32.1% and 41.3% F1 scores over 20 and 30 minutes on Cholec80 and AutoLaparo21, respectively. Moreover, our approach competes with existing methods on phase remaining time regression, achieving weighted mean absolute errors of 0.32 and 0.48 minutes for 2- and 3-minute horizons. SWAG demonstrates versatility across generative decoding frame works and classification and regression tasks to create temporal continuity between surgical workflow recognition and anticipation. Our method provides steps towards intraoperative surgical workflow generation for anticipation. Project: this https URL.

Comments:	Accepted at IJCARS, Demo website: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2412.18849 [cs.CV]
	(or arXiv:2412.18849v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.18849

Submission history

From: Maxence Boels [view email]
[v1] Wed, 25 Dec 2024 09:29:57 UTC (18,231 KB)
[v2] Thu, 6 Feb 2025 18:54:37 UTC (28,300 KB)
[v3] Mon, 9 Jun 2025 10:02:29 UTC (8,824 KB)
[v4] Sun, 15 Jun 2025 09:17:03 UTC (8,824 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SWAG: Long-term Surgical Workflow Prediction with Generative-based Anticipation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators