Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network
To enhance surgeons' efficiency and safety of patients, minimally invasive surgery (MIS) is widely used in a variety of clinical surgeries. Real-time surgical tool detection plays an important role in MIS. However, most methods of surgical tool detection may not achieve a good trade-off between...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9301279/ |
id |
doaj-dc8c76ddc94b4861b8f7b1abd597d5f7 |
---|---|
record_format |
Article |
spelling |
doaj-dc8c76ddc94b4861b8f7b1abd597d5f72021-03-30T04:27:32ZengIEEEIEEE Access2169-35362020-01-01822885322886210.1109/ACCESS.2020.30462589301279Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural NetworkPan Shi0https://orcid.org/0000-0001-8240-4663Zijian Zhao1https://orcid.org/0000-0002-7849-814XSanyuan Hu2Faliang Chang3https://orcid.org/0000-0003-1276-2267School of Control Science and Engineering, Shandong University, Jinan, ChinaSchool of Control Science and Engineering, Shandong University, Jinan, ChinaDepartment of General Surgery, First Affiliated Hospital of Shandong First Medical University, Jinan, ChinaSchool of Control Science and Engineering, Shandong University, Jinan, ChinaTo enhance surgeons' efficiency and safety of patients, minimally invasive surgery (MIS) is widely used in a variety of clinical surgeries. Real-time surgical tool detection plays an important role in MIS. However, most methods of surgical tool detection may not achieve a good trade-off between detection speed and accuracy. We propose a real-time attention-guided convolutional neural network (CNN) for frame-by-frame detection of surgical tools in MIS videos, which comprises a coarse (CDM) and a refined (RDM) detection modules. The CDM is used to coarsely regress the parameters of locations to get the refined anchors and perform binary classification, which determines whether the anchor is a tool or background. The RDM subtly incorporates the attention mechanism to generate accurate detection results utilizing the refined anchors from CDM. Finally, a light-head module for more efficient surgical tool detection is proposed. The proposed method is compared to eight state-of-the-art detection algorithms using two public (EndoVis Challenge and ATLAS Dione) datasets and a new dataset we introduced (Cholec80-locations), which extends the Cholec80 dataset with spatial annotations of surgical tools. Our approach runs in real-time at 55.5 FPS and achieves 100, 94.05, and 91.65% mAP for the above three datasets, respectively. Our method achieves accurate, fast, and robust detection results by end-to-end training in MIS videos. The results demonstrate the effectiveness and superiority of our method over the eight state-of-the-art methods.https://ieeexplore.ieee.org/document/9301279/Attention mechanismconvolutional neural networklight-head modulereal-timesurgical tool detection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Pan Shi Zijian Zhao Sanyuan Hu Faliang Chang |
spellingShingle |
Pan Shi Zijian Zhao Sanyuan Hu Faliang Chang Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network IEEE Access Attention mechanism convolutional neural network light-head module real-time surgical tool detection |
author_facet |
Pan Shi Zijian Zhao Sanyuan Hu Faliang Chang |
author_sort |
Pan Shi |
title |
Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network |
title_short |
Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network |
title_full |
Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network |
title_fullStr |
Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network |
title_full_unstemmed |
Real-Time Surgical Tool Detection in Minimally Invasive Surgery Based on Attention-Guided Convolutional Neural Network |
title_sort |
real-time surgical tool detection in minimally invasive surgery based on attention-guided convolutional neural network |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
To enhance surgeons' efficiency and safety of patients, minimally invasive surgery (MIS) is widely used in a variety of clinical surgeries. Real-time surgical tool detection plays an important role in MIS. However, most methods of surgical tool detection may not achieve a good trade-off between detection speed and accuracy. We propose a real-time attention-guided convolutional neural network (CNN) for frame-by-frame detection of surgical tools in MIS videos, which comprises a coarse (CDM) and a refined (RDM) detection modules. The CDM is used to coarsely regress the parameters of locations to get the refined anchors and perform binary classification, which determines whether the anchor is a tool or background. The RDM subtly incorporates the attention mechanism to generate accurate detection results utilizing the refined anchors from CDM. Finally, a light-head module for more efficient surgical tool detection is proposed. The proposed method is compared to eight state-of-the-art detection algorithms using two public (EndoVis Challenge and ATLAS Dione) datasets and a new dataset we introduced (Cholec80-locations), which extends the Cholec80 dataset with spatial annotations of surgical tools. Our approach runs in real-time at 55.5 FPS and achieves 100, 94.05, and 91.65% mAP for the above three datasets, respectively. Our method achieves accurate, fast, and robust detection results by end-to-end training in MIS videos. The results demonstrate the effectiveness and superiority of our method over the eight state-of-the-art methods. |
topic |
Attention mechanism convolutional neural network light-head module real-time surgical tool detection |
url |
https://ieeexplore.ieee.org/document/9301279/ |
work_keys_str_mv |
AT panshi realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork AT zijianzhao realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork AT sanyuanhu realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork AT faliangchang realtimesurgicaltooldetectioninminimallyinvasivesurgerybasedonattentionguidedconvolutionalneuralnetwork |
_version_ |
1724181816246534144 |