Empirical evaluation of semi-supervised naïve Bayes for active learning

This thesis describes an empirical evaluation of semi-supervised and active learning individually, and in combination for the naïve Bayes classifier. Active learning aims to minimise the amount of labelled data required to train the classifier by using the model to direct the labelling of the most...

Full description

Bibliographic Details
Main Author:	Saeed, Awat Abdulla
Published:	University of East Anglia 2018
Subjects:	004
Online Access:	https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.753892

id	ndltd-bl.uk-oai-ethos.bl.uk-753892
record_format	oai_dc
spelling	ndltd-bl.uk-oai-ethos.bl.uk-7538922019-03-05T15:44:01ZEmpirical evaluation of semi-supervised naïve Bayes for active learningSaeed, Awat Abdulla2018This thesis describes an empirical evaluation of semi-supervised and active learning individually, and in combination for the naïve Bayes classifier. Active learning aims to minimise the amount of labelled data required to train the classifier by using the model to direct the labelling of the most informative unlabelled examples. The key difficulty with active learning is that the initial model often gives a poor direction for labelling the unlabelled data in the early stages. However, using both labelled and unlabelled data with semi-supervised learning might be achieve a better initial model because the limited labelled data are augmented by the information in the unlabelled data. In this thesis, a suite of benchmark datasets is used to evaluate the benefit of semi-supervised learning and presents the learning curves for experiments to compare the performance of each approach. First, we will show that the semi-supervised naïve Bayes does not significantly improve the performance of the naïve Bayes classifier. Subsequently, a down-weighting technique is used to control the influence of the unlabelled data, but again this does not improve performance. In the next experiment, a novel algorithm is proposed by using a sigmoid transformation to recalibrate the overly confident naïve Bayes classifier. This algorithm does not significantly improve on the naïve Bayes classifier, but at least does improve the semi-supervised naïve Bayes classifier. In the final experiment we investigate the effectiveness of the combination of active and semi-supervised learning and empirically illustrate when the combination does work, and when does not.004University of East Angliahttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.753892https://ueaeprints.uea.ac.uk/67655/Electronic Thesis or Dissertation
collection	NDLTD
sources	NDLTD
topic	004
spellingShingle	004 Saeed, Awat Abdulla Empirical evaluation of semi-supervised naïve Bayes for active learning
description	This thesis describes an empirical evaluation of semi-supervised and active learning individually, and in combination for the naïve Bayes classifier. Active learning aims to minimise the amount of labelled data required to train the classifier by using the model to direct the labelling of the most informative unlabelled examples. The key difficulty with active learning is that the initial model often gives a poor direction for labelling the unlabelled data in the early stages. However, using both labelled and unlabelled data with semi-supervised learning might be achieve a better initial model because the limited labelled data are augmented by the information in the unlabelled data. In this thesis, a suite of benchmark datasets is used to evaluate the benefit of semi-supervised learning and presents the learning curves for experiments to compare the performance of each approach. First, we will show that the semi-supervised naïve Bayes does not significantly improve the performance of the naïve Bayes classifier. Subsequently, a down-weighting technique is used to control the influence of the unlabelled data, but again this does not improve performance. In the next experiment, a novel algorithm is proposed by using a sigmoid transformation to recalibrate the overly confident naïve Bayes classifier. This algorithm does not significantly improve on the naïve Bayes classifier, but at least does improve the semi-supervised naïve Bayes classifier. In the final experiment we investigate the effectiveness of the combination of active and semi-supervised learning and empirically illustrate when the combination does work, and when does not.
author	Saeed, Awat Abdulla
author_facet	Saeed, Awat Abdulla
author_sort	Saeed, Awat Abdulla
title	Empirical evaluation of semi-supervised naïve Bayes for active learning
title_short	Empirical evaluation of semi-supervised naïve Bayes for active learning
title_full	Empirical evaluation of semi-supervised naïve Bayes for active learning
title_fullStr	Empirical evaluation of semi-supervised naïve Bayes for active learning
title_full_unstemmed	Empirical evaluation of semi-supervised naïve Bayes for active learning
title_sort	empirical evaluation of semi-supervised naïve bayes for active learning
publisher	University of East Anglia
publishDate	2018
url	https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.753892
work_keys_str_mv	AT saeedawatabdulla empiricalevaluationofsemisupervisednaivebayesforactivelearning
_version_	1718996601894076416

Empirical evaluation of semi-supervised naïve Bayes for active learning

Similar Items