Bandwidth Selection in Kernel Density Estimation
In kernel density estimation, the most crucial step is to select a proper bandwidth (smoothing parameter). There are two conceptually different approaches to this problem: a subjective and an objective approach. In this report, we only consider the objective approach, which is based upon minimizing...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Norges teknisk-naturvitenskapelige universitet, Institutt for matematiske fag
2010
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10015 |
id |
ndltd-UPSALLA1-oai-DiVA.org-ntnu-10015 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-ntnu-100152013-01-08T13:26:41ZBandwidth Selection in Kernel Density EstimationengKile, HåkonNorges teknisk-naturvitenskapelige universitet, Institutt for matematiske fagInstitutt for matematiske fag2010ntnudaimSIF3 fysikk og matematikkIndustriell matematikkIn kernel density estimation, the most crucial step is to select a proper bandwidth (smoothing parameter). There are two conceptually different approaches to this problem: a subjective and an objective approach. In this report, we only consider the objective approach, which is based upon minimizing an error, defined by an error criterion. The most common objective bandwidth selection method is to minimize some squared error expression, but this method is not without its critics. This approach is said to not perform satisfactory in the tail(s) of the density, and to put too much weight on observations close to the mode(s) of the density. An approach which minimizes an absolute error expression, is thought to be without these drawbacks. We will provide a new explicit formula for the mean integrated absolute error. The optimal mean integrated absolute error bandwidth will be compared to the optimal mean integrated squared error bandwidth. We will argue that these two bandwidths are essentially equal. In addition, we study data-driven bandwidth selection, and we will propose a new data-driven bandwidth selector. Our new bandwidth selector has promising behavior with respect to the visual error criterion, especially in the cases of limited sample sizes. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10015Local ntnudaim:5383application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
ntnudaim SIF3 fysikk og matematikk Industriell matematikk |
spellingShingle |
ntnudaim SIF3 fysikk og matematikk Industriell matematikk Kile, Håkon Bandwidth Selection in Kernel Density Estimation |
description |
In kernel density estimation, the most crucial step is to select a proper bandwidth (smoothing parameter). There are two conceptually different approaches to this problem: a subjective and an objective approach. In this report, we only consider the objective approach, which is based upon minimizing an error, defined by an error criterion. The most common objective bandwidth selection method is to minimize some squared error expression, but this method is not without its critics. This approach is said to not perform satisfactory in the tail(s) of the density, and to put too much weight on observations close to the mode(s) of the density. An approach which minimizes an absolute error expression, is thought to be without these drawbacks. We will provide a new explicit formula for the mean integrated absolute error. The optimal mean integrated absolute error bandwidth will be compared to the optimal mean integrated squared error bandwidth. We will argue that these two bandwidths are essentially equal. In addition, we study data-driven bandwidth selection, and we will propose a new data-driven bandwidth selector. Our new bandwidth selector has promising behavior with respect to the visual error criterion, especially in the cases of limited sample sizes. |
author |
Kile, Håkon |
author_facet |
Kile, Håkon |
author_sort |
Kile, Håkon |
title |
Bandwidth Selection in Kernel Density Estimation |
title_short |
Bandwidth Selection in Kernel Density Estimation |
title_full |
Bandwidth Selection in Kernel Density Estimation |
title_fullStr |
Bandwidth Selection in Kernel Density Estimation |
title_full_unstemmed |
Bandwidth Selection in Kernel Density Estimation |
title_sort |
bandwidth selection in kernel density estimation |
publisher |
Norges teknisk-naturvitenskapelige universitet, Institutt for matematiske fag |
publishDate |
2010 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-10015 |
work_keys_str_mv |
AT kilehakon bandwidthselectioninkerneldensityestimation |
_version_ |
1716520309443526656 |