Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images

There are currently different efforts to use Supervised Neural Networks (NN) to automatically label damages on images of above ground infrastructure (buildings made of concrete) taken after an earthquake. The goal of the supervised NN is to classify raw input data according to the patterns learned f...

Full description

Bibliographic Details
Main Author:	Mulligan, Kyle John
Format:	Others
Published:	DigitalCommons@CalPoly 2018
Subjects:	Jaccard acceleration parallel earthquake image tagging Other Computer Sciences
Online Access:	https://digitalcommons.calpoly.edu/theses/1934 https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=3213&context=theses

id	ndltd-CALPOLY-oai-digitalcommons.calpoly.edu-theses-3213
record_format	oai_dc
spelling	ndltd-CALPOLY-oai-digitalcommons.calpoly.edu-theses-32132021-09-14T05:01:41Z Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images Mulligan, Kyle John There are currently different efforts to use Supervised Neural Networks (NN) to automatically label damages on images of above ground infrastructure (buildings made of concrete) taken after an earthquake. The goal of the supervised NN is to classify raw input data according to the patterns learned from an input training set. This input training data set is usually supplied by experts in the field, and in the case of this project, structural engineers carefully and mostly manually label these images for different types of damage. The level of expertise of the professionals labeling the training set varies widely, and some data sets contain pictures that different people have labeled in different ways when in reality the label should have been the same. Therefore, we need to get several experts to evaluate the same data set; the bigger the ground truth/training set the more accurate the NN classifier will be. To evaluate these variations among experts, which can be considered equal to the task of evaluating the quality of the expert, using probabilistic theory we first need to implement a tool able to compare different images classified by different experts and apply a certainty level to the experts tagged labels. This master's thesis implements this comparative tool. We also decided to implement the comparative tool using parallel programming paradigms since we foresee that it will be used to train multiple young engineering students/professionals or even novice citizen volunteers (“trainees”) during after-earthquake meetings and workshops. The implementation of this software tool involves selecting around 200 photographs tagged by an expert with proven accuracy (“ground truth”) and comparing them to files tagged by the trainees. The trainees are then provided with instantaneous feedback on the accuracy of their damage assessment. The aforementioned problem of evaluating trainee results against the expert is not as simple as comparing and finding differences between two sets of image files. We anticipate challenges in that each trainee will select a slightly different sized area for the same occurrence of damage, and some damage-structure pairs are more difficult to recognize and tag. Results show that we can compare 500 files in 1.5 seconds which is an improvement of 2x faster compared to sequential implementation. 2018-06-01T07:00:00Z text application/pdf https://digitalcommons.calpoly.edu/theses/1934 https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=3213&context=theses Master's Theses DigitalCommons@CalPoly Jaccard acceleration parallel earthquake image tagging Other Computer Sciences
collection	NDLTD
format	Others
sources	NDLTD
topic	Jaccard acceleration parallel earthquake image tagging Other Computer Sciences
spellingShingle	Jaccard acceleration parallel earthquake image tagging Other Computer Sciences Mulligan, Kyle John Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
description	There are currently different efforts to use Supervised Neural Networks (NN) to automatically label damages on images of above ground infrastructure (buildings made of concrete) taken after an earthquake. The goal of the supervised NN is to classify raw input data according to the patterns learned from an input training set. This input training data set is usually supplied by experts in the field, and in the case of this project, structural engineers carefully and mostly manually label these images for different types of damage. The level of expertise of the professionals labeling the training set varies widely, and some data sets contain pictures that different people have labeled in different ways when in reality the label should have been the same. Therefore, we need to get several experts to evaluate the same data set; the bigger the ground truth/training set the more accurate the NN classifier will be. To evaluate these variations among experts, which can be considered equal to the task of evaluating the quality of the expert, using probabilistic theory we first need to implement a tool able to compare different images classified by different experts and apply a certainty level to the experts tagged labels. This master's thesis implements this comparative tool. We also decided to implement the comparative tool using parallel programming paradigms since we foresee that it will be used to train multiple young engineering students/professionals or even novice citizen volunteers (“trainees”) during after-earthquake meetings and workshops. The implementation of this software tool involves selecting around 200 photographs tagged by an expert with proven accuracy (“ground truth”) and comparing them to files tagged by the trainees. The trainees are then provided with instantaneous feedback on the accuracy of their damage assessment. The aforementioned problem of evaluating trainee results against the expert is not as simple as comparing and finding differences between two sets of image files. We anticipate challenges in that each trainee will select a slightly different sized area for the same occurrence of damage, and some damage-structure pairs are more difficult to recognize and tag. Results show that we can compare 500 files in 1.5 seconds which is an improvement of 2x faster compared to sequential implementation.
author	Mulligan, Kyle John
author_facet	Mulligan, Kyle John
author_sort	Mulligan, Kyle John
title	Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_short	Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_full	Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_fullStr	Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_full_unstemmed	Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images
title_sort	acceleration of jaccard’s index algorithm for training to tag damage on post-earthquake images
publisher	DigitalCommons@CalPoly
publishDate	2018
url	https://digitalcommons.calpoly.edu/theses/1934 https://digitalcommons.calpoly.edu/cgi/viewcontent.cgi?article=3213&context=theses
work_keys_str_mv	AT mulligankylejohn accelerationofjaccardsindexalgorithmfortrainingtotagdamageonpostearthquakeimages
_version_	1719480512224952320

Acceleration of Jaccard’s Index Algorithm for Training to Tag Damage on Post-Earthquake Images

Similar Items