The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus

The increased use of digital media to store legal, as well as illegal data, has created the need for specialized tools that can monitor, control and even recover this data. An important task in computer forensics and security is to identify the true le type to which a computer le or computer le f...

Full description

Bibliographic Details
Main Author: Wilgenbus, Erich Feodor
Language:en
Published: North-West University 2014
Subjects:
Online Access:http://hdl.handle.net/10394/10215
id ndltd-NWUBOLOKA1-oai-dspace.nwu.ac.za-10394-10215
record_format oai_dc
spelling ndltd-NWUBOLOKA1-oai-dspace.nwu.ac.za-10394-102152014-09-30T04:06:27ZThe file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor WilgenbusWilgenbus, Erich FeodorFile type identifificationFile fragment type identificationMultilayer perceptron neural networkLinear programming-based discriminant analysisEnsemblesClassificationRekenaarlêerformaatidentifiseringRekenaarlêerfragmentformaatidentifiseringLêerfragmentformaatidentifiseringMultilaag-perseptron neurale netwerkLineêre programmeringgebaseerde diskriminantklassifiseerderKlassifikasieThe increased use of digital media to store legal, as well as illegal data, has created the need for specialized tools that can monitor, control and even recover this data. An important task in computer forensics and security is to identify the true le type to which a computer le or computer le fragment belongs. File type identi cation is traditionally done by means of metadata, such as le extensions and le header and footer signatures. As a result, traditional metadata-based le object type identi cation techniques work well in cases where the required metadata is available and unaltered. However, traditional approaches are not reliable when the integrity of metadata is not guaranteed or metadata is unavailable. As an alternative, any pattern in the content of a le object can be used to determine the associated le type. This is called content-based le object type identi cation. Supervised learning techniques can be used to infer a le object type classi er by exploiting some unique pattern that underlies a le type's common le structure. This study builds on existing literature regarding the use of supervised learning techniques for content-based le object type identi cation, and explores the combined use of multilayer perceptron neural network classi ers and linear programming-based discriminant classi ers as a solution to the multiple class le fragment type identi cation problem. The purpose of this study was to investigate and compare the use of a single multilayer perceptron neural network classi er, a single linear programming-based discriminant classi- er and a combined ensemble of these classi ers in the eld of le type identi cation. The ability of each individual classi er and the ensemble of these classi ers to accurately predict the le type to which a le fragment belongs were tested empirically. The study found that both a multilayer perceptron neural network and a linear programming- based discriminant classi er (used in a round robin) seemed to perform well in solving the multiple class le fragment type identi cation problem. The results of combining multilayer perceptron neural network classi ers and linear programming-based discriminant classi ers in an ensemble were not better than those of the single optimized classi ers.MSc (Computer Science), North-West University, Potchefstroom Campus, 2013North-West University2014-03-07T08:09:19Z2014-03-07T08:09:19Z2013Thesishttp://hdl.handle.net/10394/10215en
collection NDLTD
language en
sources NDLTD
topic File type identifification
File fragment type identification
Multilayer perceptron neural network
Linear programming-based discriminant analysis
Ensembles
Classification
Rekenaarlêerformaatidentifisering
Rekenaarlêerfragmentformaatidentifisering
Lêerfragmentformaatidentifisering
Multilaag-perseptron neurale netwerk
Lineêre programmeringgebaseerde diskriminantklassifiseerder
Klassifikasie
spellingShingle File type identifification
File fragment type identification
Multilayer perceptron neural network
Linear programming-based discriminant analysis
Ensembles
Classification
Rekenaarlêerformaatidentifisering
Rekenaarlêerfragmentformaatidentifisering
Lêerfragmentformaatidentifisering
Multilaag-perseptron neurale netwerk
Lineêre programmeringgebaseerde diskriminantklassifiseerder
Klassifikasie
Wilgenbus, Erich Feodor
The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus
description The increased use of digital media to store legal, as well as illegal data, has created the need for specialized tools that can monitor, control and even recover this data. An important task in computer forensics and security is to identify the true le type to which a computer le or computer le fragment belongs. File type identi cation is traditionally done by means of metadata, such as le extensions and le header and footer signatures. As a result, traditional metadata-based le object type identi cation techniques work well in cases where the required metadata is available and unaltered. However, traditional approaches are not reliable when the integrity of metadata is not guaranteed or metadata is unavailable. As an alternative, any pattern in the content of a le object can be used to determine the associated le type. This is called content-based le object type identi cation. Supervised learning techniques can be used to infer a le object type classi er by exploiting some unique pattern that underlies a le type's common le structure. This study builds on existing literature regarding the use of supervised learning techniques for content-based le object type identi cation, and explores the combined use of multilayer perceptron neural network classi ers and linear programming-based discriminant classi ers as a solution to the multiple class le fragment type identi cation problem. The purpose of this study was to investigate and compare the use of a single multilayer perceptron neural network classi er, a single linear programming-based discriminant classi- er and a combined ensemble of these classi ers in the eld of le type identi cation. The ability of each individual classi er and the ensemble of these classi ers to accurately predict the le type to which a le fragment belongs were tested empirically. The study found that both a multilayer perceptron neural network and a linear programming- based discriminant classi er (used in a round robin) seemed to perform well in solving the multiple class le fragment type identi cation problem. The results of combining multilayer perceptron neural network classi ers and linear programming-based discriminant classi ers in an ensemble were not better than those of the single optimized classi ers. === MSc (Computer Science), North-West University, Potchefstroom Campus, 2013
author Wilgenbus, Erich Feodor
author_facet Wilgenbus, Erich Feodor
author_sort Wilgenbus, Erich Feodor
title The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus
title_short The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus
title_full The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus
title_fullStr The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus
title_full_unstemmed The file fragment classification problem : a combined neural network and linear programming discriminant model approach / Erich Feodor Wilgenbus
title_sort file fragment classification problem : a combined neural network and linear programming discriminant model approach / erich feodor wilgenbus
publisher North-West University
publishDate 2014
url http://hdl.handle.net/10394/10215
work_keys_str_mv AT wilgenbuserichfeodor thefilefragmentclassificationproblemacombinedneuralnetworkandlinearprogrammingdiscriminantmodelapproacherichfeodorwilgenbus
AT wilgenbuserichfeodor filefragmentclassificationproblemacombinedneuralnetworkandlinearprogrammingdiscriminantmodelapproacherichfeodorwilgenbus
_version_ 1716715456680689664