Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network

To enhance surgeons' efficiency and safety of patients, minimally invasive surgery (MIS) is widely used in a variety of clinical surgeries. Real-time surgical tool detection plays an important role in MIS. However, most methods of surgical tool detection may not achieve a good trade-off between...

Full description

Bibliographic Details
Main Authors: Pan Shi, Zijian Zhao, Sanyuan Hu, Faliang Chang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9301279/
id doaj-dc8c76ddc94b4861b8f7b1abd597d5f7
record_format Article
spelling doaj-dc8c76ddc94b4861b8f7b1abd597d5f72021-03-30T04:27:32ZengIEEEIEEE Access2169-35362020-01-01822885322886210.1109/ACCESS.2020.30462589301279Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural NetworkPan Shi0https://orcid.org/0000-0001-8240-4663Zijian Zhao1https://orcid.org/0000-0002-7849-814XSanyuan Hu2Faliang Chang3https://orcid.org/0000-0003-1276-2267School of Control Science and Engineering, Shandong University, Jinan, ChinaSchool of Control Science and Engineering, Shandong University, Jinan, ChinaDepartment of General Surgery, First Affiliated Hospital of Shandong First Medical University, Jinan, ChinaSchool of Control Science and Engineering, Shandong University, Jinan, ChinaTo enhance surgeons' efficiency and safety of patients, minimally invasive surgery (MIS) is widely used in a variety of clinical surgeries. Real-time surgical tool detection plays an important role in MIS. However, most methods of surgical tool detection may not achieve a good trade-off between detection speed and accuracy. We propose a real-time attention-guided convolutional neural network (CNN) for frame-by-frame detection of surgical tools in MIS videos, which comprises a coarse (CDM) and a refined (RDM) detection modules. The CDM is used to coarsely regress the parameters of locations to get the refined anchors and perform binary classification, which determines whether the anchor is a tool or background. The RDM subtly incorporates the attention mechanism to generate accurate detection results utilizing the refined anchors from CDM. Finally, a light-head module for more efficient surgical tool detection is proposed. The proposed method is compared to eight state-of-the-art detection algorithms using two public (EndoVis Challenge and ATLAS Dione) datasets and a new dataset we introduced (Cholec80-locations), which extends the Cholec80 dataset with spatial annotations of surgical tools. Our approach runs in real-time at 55.5 FPS and achieves 100, 94.05, and 91.65% mAP for the above three datasets, respectively. Our method achieves accurate, fast, and robust detection results by end-to-end training in MIS videos. The results demonstrate the effectiveness and superiority of our method over the eight state-of-the-art methods.https://ieeexplore.ieee.org/document/9301279/Attention mechanismconvolutional neural networklight-head modulereal-timesurgical tool detection
collection DOAJ
language English
format Article
sources DOAJ
author Pan Shi
Zijian Zhao
Sanyuan Hu
Faliang Chang
spellingShingle Pan Shi
Zijian Zhao
Sanyuan Hu
Faliang Chang
Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
IEEE Access
Attention mechanism
convolutional neural network
light-head module
real-time
surgical tool detection
author_facet Pan Shi
Zijian Zhao
Sanyuan Hu
Faliang Chang
author_sort Pan Shi
title Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
title_short Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
title_full Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
title_fullStr Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
title_full_unstemmed Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
title_sort real-time surgical tool detection in minimally invasive surgery based on attention-guided convolutional neural network
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description To enhance surgeons' efficiency and safety of patients, minimally invasive surgery (MIS) is widely used in a variety of clinical surgeries. Real-time surgical tool detection plays an important role in MIS. However, most methods of surgical tool detection may not achieve a good trade-off between detection speed and accuracy. We propose a real-time attention-guided convolutional neural network (CNN) for frame-by-frame detection of surgical tools in MIS videos, which comprises a coarse (CDM) and a refined (RDM) detection modules. The CDM is used to coarsely regress the parameters of locations to get the refined anchors and perform binary classification, which determines whether the anchor is a tool or background. The RDM subtly incorporates the attention mechanism to generate accurate detection results utilizing the refined anchors from CDM. Finally, a light-head module for more efficient surgical tool detection is proposed. The proposed method is compared to eight state-of-the-art detection algorithms using two public (EndoVis Challenge and ATLAS Dione) datasets and a new dataset we introduced (Cholec80-locations), which extends the Cholec80 dataset with spatial annotations of surgical tools. Our approach runs in real-time at 55.5 FPS and achieves 100, 94.05, and 91.65% mAP for the above three datasets, respectively. Our method achieves accurate, fast, and robust detection results by end-to-end training in MIS videos. The results demonstrate the effectiveness and superiority of our method over the eight state-of-the-art methods.
topic Attention mechanism
convolutional neural network
light-head module
real-time
surgical tool detection
url https://ieeexplore.ieee.org/document/9301279/
work_keys_str_mv AT panshi realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork
AT zijianzhao realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork
AT sanyuanhu realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork
AT faliangchang realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork
_version_ 1724181816246534144