Surgical workflow recognition using two-stream mixed convolution network

Yuan Ding, Jingfan Fan*, Kun Pang, Heng Li, Tianyu Fu, Hong Song, Lingfeng Chen, Jian Yang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Surgical workflow recognition is the prerequisite for automatic indexing of surgical video databases and optimization of real-time operating scheduling, which is an important part of the modern operating room (OR). In this paper, we propose a surgical phase recognition method based on a two-stream mixed convolutional network (TsMCNet) to automatically recognize surgical workflow. TsMCNet optimizes the visual and temporal features learned from surgical videos by integrating 2D and 3D convolutional networks (CNNs) to form a spatio-temporal complementary architecture. Specifically, temporal branch (3D CNN) is responsible for learning the spatio-temporal features among adjacent frames, whereas the parallel visual branch (2D CNN) is focused on capturing the deep visual features of each frame. Extensive experiments on a public surgical video dataset (MICCAI 2016 Workflow Challenge) demonstrated outstanding performance of our proposed method, exceeding that of state-of-the-art methods (e.g., 86.2% accuracy and 83.0% F1 score).

Original languageEnglish
Title of host publicationProceedings - 2020 3rd International Conference on Advanced Electronic Materials, Computers and Software Engineering, AEMCSE 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages264-269
Number of pages6
ISBN (Electronic)9781728181431
DOIs
Publication statusPublished - Apr 2020
Event3rd International Conference on Advanced Electronic Materials, Computers and Software Engineering, AEMCSE 2020 - Shenzhen, China
Duration: 24 Apr 202026 Apr 2020

Publication series

NameProceedings - 2020 3rd International Conference on Advanced Electronic Materials, Computers and Software Engineering, AEMCSE 2020

Conference

Conference3rd International Conference on Advanced Electronic Materials, Computers and Software Engineering, AEMCSE 2020
Country/TerritoryChina
CityShenzhen
Period24/04/2026/04/20

Keywords

  • Convolutional neural network
  • Spatio-temporal features
  • Surgical video analysis
  • Temporal information

Fingerprint

Dive into the research topics of 'Surgical workflow recognition using two-stream mixed convolution network'. Together they form a unique fingerprint.

Cite this