Improving sector hash carving with rule-based and entropy-based non-probative block filters

Approved for public release; distribution is unlimited === Digital forensic investigators have traditionally used file hashes to identify known content on searched media. Recently, sector hashing has been proposed as an alternative identification method, in which files are broken up into blocks, whi...

Full description

Bibliographic Details
Main Author: Gutierrez-Villarreal, Francisco Javier
Other Authors: McCarrin, Michael R.
Published: Monterey, California: Naval Postgraduate School 2015
Online Access:http://hdl.handle.net/10945/45194
id ndltd-nps.edu-oai-calhoun.nps.edu-10945-45194
record_format oai_dc
spelling ndltd-nps.edu-oai-calhoun.nps.edu-10945-451942015-05-08T03:57:07Z Improving sector hash carving with rule-based and entropy-based non-probative block filters Gutierrez-Villarreal, Francisco Javier McCarrin, Michael R. Young Joel D. Computer Science Approved for public release; distribution is unlimited Digital forensic investigators have traditionally used file hashes to identify known content on searched media. Recently, sector hashing has been proposed as an alternative identification method, in which files are broken up into blocks, which are then compared to sectors on searched media. Since sectors are read sequentially without accessing the file system, sector hashing can be parallelized easily and is faster than traditional methods. In addition, sector hashing can identify partial files, and does not require an exact file match. In some cases, the presence of even a single block is sufficient to demonstrate with high probability that a file resides on a drive. However, non-probative blocks, common across many files, generate false positive matches; a problem that must be addressed before sector hashing can be adopted. We conduct 7 experiments in two phases to filter non-probative blocks. Our first phase uses rule-based and entropy-based non-probative block filters to improve matching against all file types. In the second phase, we restrict the problem to JPEG files. We find that for general hash-based carving, a rule-based approach outperforms a simple entropy threshold. When searching for JPEGs, we find that an entropy threshold of 10.9 gives a precision of 80% and an accuracy of 99%. 2015-05-06T19:17:40Z 2015-05-06T19:17:40Z 2015-03 Thesis http://hdl.handle.net/10945/45194 This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States. Monterey, California: Naval Postgraduate School
collection NDLTD
sources NDLTD
description Approved for public release; distribution is unlimited === Digital forensic investigators have traditionally used file hashes to identify known content on searched media. Recently, sector hashing has been proposed as an alternative identification method, in which files are broken up into blocks, which are then compared to sectors on searched media. Since sectors are read sequentially without accessing the file system, sector hashing can be parallelized easily and is faster than traditional methods. In addition, sector hashing can identify partial files, and does not require an exact file match. In some cases, the presence of even a single block is sufficient to demonstrate with high probability that a file resides on a drive. However, non-probative blocks, common across many files, generate false positive matches; a problem that must be addressed before sector hashing can be adopted. We conduct 7 experiments in two phases to filter non-probative blocks. Our first phase uses rule-based and entropy-based non-probative block filters to improve matching against all file types. In the second phase, we restrict the problem to JPEG files. We find that for general hash-based carving, a rule-based approach outperforms a simple entropy threshold. When searching for JPEGs, we find that an entropy threshold of 10.9 gives a precision of 80% and an accuracy of 99%.
author2 McCarrin, Michael R.
author_facet McCarrin, Michael R.
Gutierrez-Villarreal, Francisco Javier
author Gutierrez-Villarreal, Francisco Javier
spellingShingle Gutierrez-Villarreal, Francisco Javier
Improving sector hash carving with rule-based and entropy-based non-probative block filters
author_sort Gutierrez-Villarreal, Francisco Javier
title Improving sector hash carving with rule-based and entropy-based non-probative block filters
title_short Improving sector hash carving with rule-based and entropy-based non-probative block filters
title_full Improving sector hash carving with rule-based and entropy-based non-probative block filters
title_fullStr Improving sector hash carving with rule-based and entropy-based non-probative block filters
title_full_unstemmed Improving sector hash carving with rule-based and entropy-based non-probative block filters
title_sort improving sector hash carving with rule-based and entropy-based non-probative block filters
publisher Monterey, California: Naval Postgraduate School
publishDate 2015
url http://hdl.handle.net/10945/45194
work_keys_str_mv AT gutierrezvillarrealfranciscojavier improvingsectorhashcarvingwithrulebasedandentropybasednonprobativeblockfilters
_version_ 1716803412338671616