Bandwidth Selection in Kernel Density Estimation

In kernel density estimation, the most crucial step is to select a proper bandwidth (smoothing parameter). There are two conceptually different approaches to this problem: a subjective and an objective approach. In this report, we only consider the objective approach, which is based upon minimizing...

Full description

Bibliographic Details
Main Author: Kile, Håkon
Format: Others
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for matematiske fag 2010
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10015
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-10015
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-100152013-01-08T13:26:41ZBandwidth Selection in Kernel Density EstimationengKile, HåkonNorges teknisk-naturvitenskapelige universitet, Institutt for matematiske fagInstitutt for matematiske fag2010ntnudaimSIF3 fysikk og matematikkIndustriell matematikkIn kernel density estimation, the most crucial step is to select a proper bandwidth (smoothing parameter). There are two conceptually different approaches to this problem: a subjective and an objective approach. In this report, we only consider the objective approach, which is based upon minimizing an error, defined by an error criterion. The most common objective bandwidth selection method is to minimize some squared error expression, but this method is not without its critics. This approach is said to not perform satisfactory in the tail(s) of the density, and to put too much weight on observations close to the mode(s) of the density. An approach which minimizes an absolute error expression, is thought to be without these drawbacks. We will provide a new explicit formula for the mean integrated absolute error. The optimal mean integrated absolute error bandwidth will be compared to the optimal mean integrated squared error bandwidth. We will argue that these two bandwidths are essentially equal. In addition, we study data-driven bandwidth selection, and we will propose a new data-driven bandwidth selector. Our new bandwidth selector has promising behavior with respect to the visual error criterion, especially in the cases of limited sample sizes. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10015Local ntnudaim:5383application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic ntnudaim
SIF3 fysikk og matematikk
Industriell matematikk
spellingShingle ntnudaim
SIF3 fysikk og matematikk
Industriell matematikk
Kile, Håkon
Bandwidth Selection in Kernel Density Estimation
description In kernel density estimation, the most crucial step is to select a proper bandwidth (smoothing parameter). There are two conceptually different approaches to this problem: a subjective and an objective approach. In this report, we only consider the objective approach, which is based upon minimizing an error, defined by an error criterion. The most common objective bandwidth selection method is to minimize some squared error expression, but this method is not without its critics. This approach is said to not perform satisfactory in the tail(s) of the density, and to put too much weight on observations close to the mode(s) of the density. An approach which minimizes an absolute error expression, is thought to be without these drawbacks. We will provide a new explicit formula for the mean integrated absolute error. The optimal mean integrated absolute error bandwidth will be compared to the optimal mean integrated squared error bandwidth. We will argue that these two bandwidths are essentially equal. In addition, we study data-driven bandwidth selection, and we will propose a new data-driven bandwidth selector. Our new bandwidth selector has promising behavior with respect to the visual error criterion, especially in the cases of limited sample sizes.
author Kile, Håkon
author_facet Kile, Håkon
author_sort Kile, Håkon
title Bandwidth Selection in Kernel Density Estimation
title_short Bandwidth Selection in Kernel Density Estimation
title_full Bandwidth Selection in Kernel Density Estimation
title_fullStr Bandwidth Selection in Kernel Density Estimation
title_full_unstemmed Bandwidth Selection in Kernel Density Estimation
title_sort bandwidth selection in kernel density estimation
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for matematiske fag
publishDate 2010
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10015
work_keys_str_mv AT kilehakon bandwidthselectioninkerneldensityestimation
_version_ 1716520309443526656