Improving sector hash carving with rule-based and entropy-based non-probative block filters
Approved for public release; distribution is unlimited === Digital forensic investigators have traditionally used file hashes to identify known content on searched media. Recently, sector hashing has been proposed as an alternative identification method, in which files are broken up into blocks, whi...
Main Author: | |
---|---|
Other Authors: | |
Published: |
Monterey, California: Naval Postgraduate School
2015
|
Online Access: | http://hdl.handle.net/10945/45194 |
id |
ndltd-nps.edu-oai-calhoun.nps.edu-10945-45194 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-nps.edu-oai-calhoun.nps.edu-10945-451942015-05-08T03:57:07Z Improving sector hash carving with rule-based and entropy-based non-probative block filters Gutierrez-Villarreal, Francisco Javier McCarrin, Michael R. Young Joel D. Computer Science Approved for public release; distribution is unlimited Digital forensic investigators have traditionally used file hashes to identify known content on searched media. Recently, sector hashing has been proposed as an alternative identification method, in which files are broken up into blocks, which are then compared to sectors on searched media. Since sectors are read sequentially without accessing the file system, sector hashing can be parallelized easily and is faster than traditional methods. In addition, sector hashing can identify partial files, and does not require an exact file match. In some cases, the presence of even a single block is sufficient to demonstrate with high probability that a file resides on a drive. However, non-probative blocks, common across many files, generate false positive matches; a problem that must be addressed before sector hashing can be adopted. We conduct 7 experiments in two phases to filter non-probative blocks. Our first phase uses rule-based and entropy-based non-probative block filters to improve matching against all file types. In the second phase, we restrict the problem to JPEG files. We find that for general hash-based carving, a rule-based approach outperforms a simple entropy threshold. When searching for JPEGs, we find that an entropy threshold of 10.9 gives a precision of 80% and an accuracy of 99%. 2015-05-06T19:17:40Z 2015-05-06T19:17:40Z 2015-03 Thesis http://hdl.handle.net/10945/45194 This publication is a work of the U.S. Government as defined in Title 17, United States Code, Section 101. Copyright protection is not available for this work in the United States. Monterey, California: Naval Postgraduate School |
collection |
NDLTD |
sources |
NDLTD |
description |
Approved for public release; distribution is unlimited === Digital forensic investigators have traditionally used file hashes to identify known content on searched media. Recently, sector hashing has been proposed as an alternative identification method, in which files are broken up into blocks, which are then compared to sectors on searched media. Since sectors are read sequentially without accessing the file system, sector hashing can be parallelized easily and is faster than traditional methods. In addition, sector hashing can identify partial files, and does not require an exact file match. In some cases, the presence of even a single block is sufficient to demonstrate with high probability that a file resides on a drive. However, non-probative blocks, common across many files, generate false positive matches; a problem that must be addressed before sector hashing can be adopted. We conduct 7 experiments in two phases to filter non-probative blocks. Our first phase uses rule-based and entropy-based non-probative block filters to improve matching against all file types. In the second phase, we restrict the problem to JPEG files. We find that for general hash-based carving, a rule-based approach outperforms a simple entropy threshold. When searching for JPEGs, we find that an entropy threshold of 10.9 gives a precision of 80% and an accuracy of 99%. |
author2 |
McCarrin, Michael R. |
author_facet |
McCarrin, Michael R. Gutierrez-Villarreal, Francisco Javier |
author |
Gutierrez-Villarreal, Francisco Javier |
spellingShingle |
Gutierrez-Villarreal, Francisco Javier Improving sector hash carving with rule-based and entropy-based non-probative block filters |
author_sort |
Gutierrez-Villarreal, Francisco Javier |
title |
Improving sector hash carving with rule-based and entropy-based non-probative block filters |
title_short |
Improving sector hash carving with rule-based and entropy-based non-probative block filters |
title_full |
Improving sector hash carving with rule-based and entropy-based non-probative block filters |
title_fullStr |
Improving sector hash carving with rule-based and entropy-based non-probative block filters |
title_full_unstemmed |
Improving sector hash carving with rule-based and entropy-based non-probative block filters |
title_sort |
improving sector hash carving with rule-based and entropy-based non-probative block filters |
publisher |
Monterey, California: Naval Postgraduate School |
publishDate |
2015 |
url |
http://hdl.handle.net/10945/45194 |
work_keys_str_mv |
AT gutierrezvillarrealfranciscojavier improvingsectorhashcarvingwithrulebasedandentropybasednonprobativeblockfilters |
_version_ |
1716803412338671616 |