On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior

New methods for classifying tails of probability distributions based on data are proposed. Some methods apply the nonparametric theories of Rojo [35] and Schuster [36] and differ from classical extreme value theory and other well established methods. All the methods implement the extreme spacing of...

Full description

Bibliographic Details
Main Author: Ott, Richard Charles
Other Authors: Rojo, Javier
Format: Others
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/1911/18792
id ndltd-RICE-oai-scholarship.rice.edu-1911-18792
record_format oai_dc
spelling ndltd-RICE-oai-scholarship.rice.edu-1911-187922013-10-23T04:14:26ZOn the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behaviorOtt, Richard CharlesStatisticsNew methods for classifying tails of probability distributions based on data are proposed. Some methods apply the nonparametric theories of Rojo [35] and Schuster [36] and differ from classical extreme value theory and other well established methods. All the methods implement the extreme spacing of the data, the difference of the largest and second largest values. The results are then compared based on power properties to the classical technique of a Points Over Threshold model based on the Generalized Pareto Distribution (GPD). The following topics are the foundation of this thesis: Chapter 1. Review of classical extreme value theory and discussion on the class of medium-tailed distributions. Chapter 2. Review of the tail classification schemes of Parzen, Schuster, and Rojo upon which the latter two suggest the usage of the Extreme Spacing (ES) as a possible classifying instrument. Additional subcategorizations are also provided for the schemes of Schuster and Rojo. Chapter 3. Review of estimation methods for the Points Over Threshold GPD parameters for classification purposes. A Monte Carlo study classifying tails of many common distributions using the GPD by way of maximum likelihood is also provided. Chapter 4. Three classification tests based on the ES are provided. The first is a test to decide whether a sample originates from a completely specified distribution such as Exp(1). The second classifies whether data originated from an exponential distribution with unknown parameter. The third classifies an underlying distribution as short-, medium-, or long-tailed. Also discussed, is the potential benefit of blocking the data before applying the above mentioned tests. Chapter 5. Classifying specific data sets by way of the new methods. Some of the new ES methods may be applicable to the data when classical methods are inapplicable, for example when the GPD maximum likelihood numerical algorithm does not converge to yield a shape parameter estimate or when the variance of the shape parameter cannot be estimated since the parameter estimate is close to a parameter space endpoint. Even when classical methods are applicable, these tests can give a more thorough understanding of the tail behavior of the underlying distribution.Rojo, Javier2009-06-04T08:44:40Z2009-06-04T08:44:40Z2005ThesisText135 p.application/pdfhttp://hdl.handle.net/1911/18792eng
collection NDLTD
language English
format Others
sources NDLTD
topic Statistics
spellingShingle Statistics
Ott, Richard Charles
On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
description New methods for classifying tails of probability distributions based on data are proposed. Some methods apply the nonparametric theories of Rojo [35] and Schuster [36] and differ from classical extreme value theory and other well established methods. All the methods implement the extreme spacing of the data, the difference of the largest and second largest values. The results are then compared based on power properties to the classical technique of a Points Over Threshold model based on the Generalized Pareto Distribution (GPD). The following topics are the foundation of this thesis: Chapter 1. Review of classical extreme value theory and discussion on the class of medium-tailed distributions. Chapter 2. Review of the tail classification schemes of Parzen, Schuster, and Rojo upon which the latter two suggest the usage of the Extreme Spacing (ES) as a possible classifying instrument. Additional subcategorizations are also provided for the schemes of Schuster and Rojo. Chapter 3. Review of estimation methods for the Points Over Threshold GPD parameters for classification purposes. A Monte Carlo study classifying tails of many common distributions using the GPD by way of maximum likelihood is also provided. Chapter 4. Three classification tests based on the ES are provided. The first is a test to decide whether a sample originates from a completely specified distribution such as Exp(1). The second classifies whether data originated from an exponential distribution with unknown parameter. The third classifies an underlying distribution as short-, medium-, or long-tailed. Also discussed, is the potential benefit of blocking the data before applying the above mentioned tests. Chapter 5. Classifying specific data sets by way of the new methods. Some of the new ES methods may be applicable to the data when classical methods are inapplicable, for example when the GPD maximum likelihood numerical algorithm does not converge to yield a shape parameter estimate or when the variance of the shape parameter cannot be estimated since the parameter estimate is close to a parameter space endpoint. Even when classical methods are applicable, these tests can give a more thorough understanding of the tail behavior of the underlying distribution.
author2 Rojo, Javier
author_facet Rojo, Javier
Ott, Richard Charles
author Ott, Richard Charles
author_sort Ott, Richard Charles
title On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
title_short On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
title_full On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
title_fullStr On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
title_full_unstemmed On the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
title_sort on the operating characteristics of some non-parametric methodologies for the classification of distributions by tail behavior
publishDate 2009
url http://hdl.handle.net/1911/18792
work_keys_str_mv AT ottrichardcharles ontheoperatingcharacteristicsofsomenonparametricmethodologiesfortheclassificationofdistributionsbytailbehavior
_version_ 1716610991808053248