Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade
Surgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method u...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2019-10-01
|
Series: | Healthcare Technology Letters |
Subjects: | |
Online Access: | https://digital-library.theiet.org/content/journals/10.1049/htl.2019.0064 |
id |
doaj-b2b63b4a8bdb4ba594bdc26563a7cded |
---|---|
record_format |
Article |
spelling |
doaj-b2b63b4a8bdb4ba594bdc26563a7cded2021-04-02T05:35:58ZengWileyHealthcare Technology Letters2053-37132019-10-0110.1049/htl.2019.0064HTL.2019.0064Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascadeZijian Zhao0Tongbiao Cai1Faliang Chang2Xiaolin Cheng3School of Control Science and EngineeringSchool of Control Science and EngineeringSchool of Control Science and EngineeringLaboratory of Laparoscopic Technique and Engineering, Qilu Hospital of Shandong UniversitySurgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method using a cascading convolutional neural network (CNN) which consists of two different CNNs for real-time multi-tool detection. An hourglass network and a modified visual geometry group (VGG) network are applied to jointly predict the localisation. The former CNN outputs detection heatmaps representing the location of tool tip areas, and the latter performs bounding-box regression for tool tip areas on these heatmaps stacked with input RGB image frames. The authors’ method is tested on the publicly available EndoVis Challenge dataset and the ATLAS Dione dataset. The experimental results show that their method achieves better performance than mainstream detection methods in terms of detection accuracy and speed.https://digital-library.theiet.org/content/journals/10.1049/htl.2019.0064object detectionmedical image processingimage colour analysismedical roboticsregression analysissurgerylearning (artificial intelligence)convolutional neural netsrobot visionconvolutional neural network cascaderobot-assisted surgery videosvision componentsingle-tool detectioncascading convolutional neural networkcnnreal-time multitool detectionhourglass networkmodified vgg networkdetection heatmapstool tip areasbounding-box regressionauthorsmainstream detection methodsrgb image framesframe-by-frame detection methoddeep learning methodsendovis challenge datasetatlas dione datasetreal-time surgical instrument detectionreal-time multi-tool detection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zijian Zhao Tongbiao Cai Faliang Chang Xiaolin Cheng |
spellingShingle |
Zijian Zhao Tongbiao Cai Faliang Chang Xiaolin Cheng Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade Healthcare Technology Letters object detection medical image processing image colour analysis medical robotics regression analysis surgery learning (artificial intelligence) convolutional neural nets robot vision convolutional neural network cascade robot-assisted surgery videos vision component single-tool detection cascading convolutional neural network cnn real-time multitool detection hourglass network modified vgg network detection heatmaps tool tip areas bounding-box regression authors mainstream detection methods rgb image frames frame-by-frame detection method deep learning methods endovis challenge dataset atlas dione dataset real-time surgical instrument detection real-time multi-tool detection |
author_facet |
Zijian Zhao Tongbiao Cai Faliang Chang Xiaolin Cheng |
author_sort |
Zijian Zhao |
title |
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade |
title_short |
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade |
title_full |
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade |
title_fullStr |
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade |
title_full_unstemmed |
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade |
title_sort |
real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade |
publisher |
Wiley |
series |
Healthcare Technology Letters |
issn |
2053-3713 |
publishDate |
2019-10-01 |
description |
Surgical instrument detection in robot-assisted surgery videos is an import vision component for these systems. Most of the current deep learning methods focus on single-tool detection and suffer from low detection speed. To address this, the authors propose a novel frame-by-frame detection method using a cascading convolutional neural network (CNN) which consists of two different CNNs for real-time multi-tool detection. An hourglass network and a modified visual geometry group (VGG) network are applied to jointly predict the localisation. The former CNN outputs detection heatmaps representing the location of tool tip areas, and the latter performs bounding-box regression for tool tip areas on these heatmaps stacked with input RGB image frames. The authors’ method is tested on the publicly available EndoVis Challenge dataset and the ATLAS Dione dataset. The experimental results show that their method achieves better performance than mainstream detection methods in terms of detection accuracy and speed. |
topic |
object detection medical image processing image colour analysis medical robotics regression analysis surgery learning (artificial intelligence) convolutional neural nets robot vision convolutional neural network cascade robot-assisted surgery videos vision component single-tool detection cascading convolutional neural network cnn real-time multitool detection hourglass network modified vgg network detection heatmaps tool tip areas bounding-box regression authors mainstream detection methods rgb image frames frame-by-frame detection method deep learning methods endovis challenge dataset atlas dione dataset real-time surgical instrument detection real-time multi-tool detection |
url |
https://digital-library.theiet.org/content/journals/10.1049/htl.2019.0064 |
work_keys_str_mv |
AT zijianzhao realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade AT tongbiaocai realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade AT faliangchang realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade AT xiaolincheng realtimesurgicalinstrumentdetectioninrobotassistedsurgeryusingaconvolutionalneuralnetworkcascade |
_version_ |
1724172442026377216 |