Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images

There are currently different efforts to use Supervised Neural Networks (NN) to automatically label damages on images of above ground infrastructure (buildings made of concrete) taken after an earthquake. The goal of the supervised NN is to classify raw input data according to the patterns learned f...

Full description

Bibliographic Details
Main Author: Mulligan, Kyle John
Format: Others
Published: DigitalCommons@CalPoly 2018
Subjects:
Online Access:https://digitalcommons.calpoly.edu/theses/1934
https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=3213&context=theses
id ndltd-CALPOLY-oai-digitalcommons.calpoly.edu-theses-3213
record_format oai_dc
spelling ndltd-CALPOLY-oai-digitalcommons.calpoly.edu-theses-32132021-09-14T05:01:41Z Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images Mulligan, Kyle John There are currently different efforts to use Supervised Neural Networks (NN) to automatically label damages on images of above ground infrastructure (buildings made of concrete) taken after an earthquake. The goal of the supervised NN is to classify raw input data according to the patterns learned from an input training set. This input training data set is usually supplied by experts in the field, and in the case of this project, structural engineers carefully and mostly manually label these images for different types of damage. The level of expertise of the professionals labeling the training set varies widely, and some data sets contain pictures that different people have labeled in different ways when in reality the label should have been the same. Therefore, we need to get several experts to evaluate the same data set; the bigger the ground truth/training set the more accurate the NN classifier will be. To evaluate these variations among experts, which can be considered equal to the task of evaluating the quality of the expert, using probabilistic theory we first need to implement a tool able to compare different images classified by different experts and apply a certainty level to the experts tagged labels. This master's thesis implements this comparative tool. We also decided to implement the comparative tool using parallel programming paradigms since we foresee that it will be used to train multiple young engineering students/professionals or even novice citizen volunteers (“trainees”) during after-earthquake meetings and workshops. The implementation of this software tool involves selecting around 200 photographs tagged by an expert with proven accuracy (“ground truth”) and comparing them to files tagged by the trainees. The trainees are then provided with instantaneous feedback on the accuracy of their damage assessment. The aforementioned problem of evaluating trainee results against the expert is not as simple as comparing and finding differences between two sets of image files. We anticipate challenges in that each trainee will select a slightly different sized area for the same occurrence of damage, and some damage-structure pairs are more difficult to recognize and tag. Results show that we can compare 500 files in 1.5 seconds which is an improvement of 2x faster compared to sequential implementation. 2018-06-01T07:00:00Z text application/pdf https://digitalcommons.calpoly.edu/theses/1934 https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=3213&context=theses Master's Theses DigitalCommons@CalPoly Jaccard acceleration parallel earthquake image tagging Other Computer Sciences
collection NDLTD
format Others
sources NDLTD
topic Jaccard
acceleration
parallel
earthquake
image tagging
Other Computer Sciences
spellingShingle Jaccard
acceleration
parallel
earthquake
image tagging
Other Computer Sciences
Mulligan, Kyle John
Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
description There are currently different efforts to use Supervised Neural Networks (NN) to automatically label damages on images of above ground infrastructure (buildings made of concrete) taken after an earthquake. The goal of the supervised NN is to classify raw input data according to the patterns learned from an input training set. This input training data set is usually supplied by experts in the field, and in the case of this project, structural engineers carefully and mostly manually label these images for different types of damage. The level of expertise of the professionals labeling the training set varies widely, and some data sets contain pictures that different people have labeled in different ways when in reality the label should have been the same. Therefore, we need to get several experts to evaluate the same data set; the bigger the ground truth/training set the more accurate the NN classifier will be. To evaluate these variations among experts, which can be considered equal to the task of evaluating the quality of the expert, using probabilistic theory we first need to implement a tool able to compare different images classified by different experts and apply a certainty level to the experts tagged labels. This master's thesis implements this comparative tool. We also decided to implement the comparative tool using parallel programming paradigms since we foresee that it will be used to train multiple young engineering students/professionals or even novice citizen volunteers (“trainees”) during after-earthquake meetings and workshops. The implementation of this software tool involves selecting around 200 photographs tagged by an expert with proven accuracy (“ground truth”) and comparing them to files tagged by the trainees. The trainees are then provided with instantaneous feedback on the accuracy of their damage assessment. The aforementioned problem of evaluating trainee results against the expert is not as simple as comparing and finding differences between two sets of image files. We anticipate challenges in that each trainee will select a slightly different sized area for the same occurrence of damage, and some damage-structure pairs are more difficult to recognize and tag. Results show that we can compare 500 files in 1.5 seconds which is an improvement of 2x faster compared to sequential implementation.
author Mulligan, Kyle John
author_facet Mulligan, Kyle John
author_sort Mulligan, Kyle John
title Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_short Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_full Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_fullStr Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_full_unstemmed Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_sort acceleration of jaccard’s index algorithm for training to tag damage on post-earthquake images
publisher DigitalCommons@CalPoly
publishDate 2018
url https://digitalcommons.calpoly.edu/theses/1934
https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=3213&context=theses
work_keys_str_mv AT mulligankylejohn accelerationofjaccardsindexalgorithmfortrainingtotagdamageonpostearthquakeimages
_version_ 1719480512224952320