Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data

碩士 === 東海大學 === 統計學系 === 104 === In medical applications, we need to invent diagnostic devices that can generate references to detect disease status as soon as possible. In addition to cost considerations, the diagnostic device must have a certain degree of accuracy for determining such reference, a...

Full description

Bibliographic Details
Main Authors: Wang, Shu-Han, 王舒涵
Other Authors: Huang, Yu-Min
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/93548854602394269867
id ndltd-TW-104THU00337002
record_format oai_dc
spelling ndltd-TW-104THU003370022016-05-08T04:05:51Z http://ndltd.ncl.edu.tw/handle/93548854602394269867 Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data 非比例分層樣本下之ROC臨界值分析 Wang, Shu-Han 王舒涵 碩士 東海大學 統計學系 104 In medical applications, we need to invent diagnostic devices that can generate references to detect disease status as soon as possible. In addition to cost considerations, the diagnostic device must have a certain degree of accuracy for determining such reference, and therefore the accuracy of the diagnostic device is an important issue. Typically, the outcomes of diagnostic devices are binary, with positive/negative signs based on a critical value of the quantity of a continuous variable, usually a biomarker which originally measures the medical status. In this work, we perform ROC curve (Receiver Operating Characteristic Curve) analysis to obtain the optimum critical value that has a lowest misclassification rate. In real practices, we may conduct disproportionately stratified sampling to collect data for improving the accuracy of critical value analysis. With such disproportionately stratified samples, we first estimated the distribution of the biomarker with conditional maximum likelihood under some parametrical assumptions. Then, we used the estimated probabilities as adjustments to calculate the ROC curve. Simulation analysis showed that our method can provide good estimators, and ROC curve analysis can objectively provide the best critical value. We also present an empirical study on blood samples of a type of glucose (a biomarker), tested with a newly invented test kit. A preliminary investigation found that the population distribution of such blood data is moderately skewed to the right. Therefore, we suggest a gamma distribution for this biomarker. We performed our method to analyze this data and also analyze questionnaires related to the operations of the test kit. Huang, Yu-Min 黃愉閔 2015 學位論文 ; thesis 73 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 東海大學 === 統計學系 === 104 === In medical applications, we need to invent diagnostic devices that can generate references to detect disease status as soon as possible. In addition to cost considerations, the diagnostic device must have a certain degree of accuracy for determining such reference, and therefore the accuracy of the diagnostic device is an important issue. Typically, the outcomes of diagnostic devices are binary, with positive/negative signs based on a critical value of the quantity of a continuous variable, usually a biomarker which originally measures the medical status. In this work, we perform ROC curve (Receiver Operating Characteristic Curve) analysis to obtain the optimum critical value that has a lowest misclassification rate. In real practices, we may conduct disproportionately stratified sampling to collect data for improving the accuracy of critical value analysis. With such disproportionately stratified samples, we first estimated the distribution of the biomarker with conditional maximum likelihood under some parametrical assumptions. Then, we used the estimated probabilities as adjustments to calculate the ROC curve. Simulation analysis showed that our method can provide good estimators, and ROC curve analysis can objectively provide the best critical value. We also present an empirical study on blood samples of a type of glucose (a biomarker), tested with a newly invented test kit. A preliminary investigation found that the population distribution of such blood data is moderately skewed to the right. Therefore, we suggest a gamma distribution for this biomarker. We performed our method to analyze this data and also analyze questionnaires related to the operations of the test kit.
author2 Huang, Yu-Min
author_facet Huang, Yu-Min
Wang, Shu-Han
王舒涵
author Wang, Shu-Han
王舒涵
spellingShingle Wang, Shu-Han
王舒涵
Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
author_sort Wang, Shu-Han
title Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
title_short Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
title_full Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
title_fullStr Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
title_full_unstemmed Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
title_sort analysis of the optimal critical points on roc curves with disproportionately-sampled data
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/93548854602394269867
work_keys_str_mv AT wangshuhan analysisoftheoptimalcriticalpointsonroccurveswithdisproportionatelysampleddata
AT wángshūhán analysisoftheoptimalcriticalpointsonroccurveswithdisproportionatelysampleddata
AT wangshuhan fēibǐlìfēncéngyàngběnxiàzhīroclínjièzhífēnxī
AT wángshūhán fēibǐlìfēncéngyàngběnxiàzhīroclínjièzhífēnxī
_version_ 1718262412833980416