Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade

Surgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method u...

Full description

Bibliographic Details
Main Authors: Zijian Zhao, Tongbiao Cai, Faliang Chang, Xiaolin Cheng
Format: Article
Language:English
Published: Wiley 2019-10-01
Series:Healthcare Technology Letters
Subjects:
cnn
Online Access:https://digital-library.theiet.org/content/journals/10.1049/htl.2019.0064
id doaj-b2b63b4a8bdb4ba594bdc26563a7cded
record_format Article
spelling doaj-b2b63b4a8bdb4ba594bdc26563a7cded2021-04-02T05:35:58ZengWileyHealthcare Technology Letters2053-37132019-10-0110.1049/htl.2019.0064HTL.2019.0064Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascadeZijian Zhao0Tongbiao Cai1Faliang Chang2Xiaolin Cheng3School of Control Science and EngineeringSchool of Control Science and EngineeringSchool of Control Science and EngineeringLaboratory of Laparoscopic Technique and Engineering, Qilu Hospital of Shandong UniversitySurgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method using a cascading convolutional neural network (CNN) which consists of two different CNNs for real-time multi-tool detection. An hourglass network and a modified visual geometry group (VGG) network are applied to jointly predict the localisation. The former CNN outputs detection heatmaps representing the location of tool tip areas, and the latter performs bounding-box regression for tool tip areas on these heatmaps stacked with input RGB image frames. The authors’ method is tested on the publicly available EndoVis Challenge dataset and the ATLAS Dione dataset. The experimental results show that their method achieves better performance than mainstream detection methods in terms of detection accuracy and speed.https://digital-library.theiet.org/content/journals/10.1049/htl.2019.0064object detectionmedical image processingimage colour analysismedical roboticsregression analysissurgerylearning (artificial intelligence)convolutional neural netsrobot visionconvolutional neural network cascaderobot-assisted surgery videosvision componentsingle-tool detectioncascading convolutional neural networkcnnreal-time multitool detectionhourglass networkmodified vgg networkdetection heatmapstool tip areasbounding-box regressionauthorsmainstream detection methodsrgb image framesframe-by-frame detection methoddeep learning methodsendovis challenge datasetatlas dione datasetreal-time surgical instrument detectionreal-time multi-tool detection
collection DOAJ
language English
format Article
sources DOAJ
author Zijian Zhao
Tongbiao Cai
Faliang Chang
Xiaolin Cheng
spellingShingle Zijian Zhao
Tongbiao Cai
Faliang Chang
Xiaolin Cheng
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
Healthcare Technology Letters
object detection
medical image processing
image colour analysis
medical robotics
regression analysis
surgery
learning (artificial intelligence)
convolutional neural nets
robot vision
convolutional neural network cascade
robot-assisted surgery videos
vision component
single-tool detection
cascading convolutional neural network
cnn
real-time multitool detection
hourglass network
modified vgg network
detection heatmaps
tool tip areas
bounding-box regression
authors
mainstream detection methods
rgb image frames
frame-by-frame detection method
deep learning methods
endovis challenge dataset
atlas dione dataset
real-time surgical instrument detection
real-time multi-tool detection
author_facet Zijian Zhao
Tongbiao Cai
Faliang Chang
Xiaolin Cheng
author_sort Zijian Zhao
title Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
title_short Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
title_full Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
title_fullStr Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
title_full_unstemmed Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
title_sort real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
publisher Wiley
series Healthcare Technology Letters
issn 2053-3713
publishDate 2019-10-01
description Surgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method using a cascading convolutional neural network (CNN) which consists of two different CNNs for real-time multi-tool detection. An hourglass network and a modified visual geometry group (VGG) network are applied to jointly predict the localisation. The former CNN outputs detection heatmaps representing the location of tool tip areas, and the latter performs bounding-box regression for tool tip areas on these heatmaps stacked with input RGB image frames. The authors’ method is tested on the publicly available EndoVis Challenge dataset and the ATLAS Dione dataset. The experimental results show that their method achieves better performance than mainstream detection methods in terms of detection accuracy and speed.
topic object detection
medical image processing
image colour analysis
medical robotics
regression analysis
surgery
learning (artificial intelligence)
convolutional neural nets
robot vision
convolutional neural network cascade
robot-assisted surgery videos
vision component
single-tool detection
cascading convolutional neural network
cnn
real-time multitool detection
hourglass network
modified vgg network
detection heatmaps
tool tip areas
bounding-box regression
authors
mainstream detection methods
rgb image frames
frame-by-frame detection method
deep learning methods
endovis challenge dataset
atlas dione dataset
real-time surgical instrument detection
real-time multi-tool detection
url https://digital-library.theiet.org/content/journals/10.1049/htl.2019.0064
work_keys_str_mv AT zijianzhao realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade
AT tongbiaocai realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade
AT faliangchang realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade
AT xiaolincheng realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade
_version_ 1724172442026377216