Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data
碩士 === 東海大學 === 統計學系 === 104 === In medical applications, we need to invent diagnostic devices that can generate references to detect disease status as soon as possible. In addition to cost considerations, the diagnostic device must have a certain degree of accuracy for determining such reference, a...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/93548854602394269867 |
id |
ndltd-TW-104THU00337002 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104THU003370022016-05-08T04:05:51Z http://ndltd.ncl.edu.tw/handle/93548854602394269867 Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data 非比例分層樣本下之ROC臨界值分析 Wang, Shu-Han 王舒涵 碩士 東海大學 統計學系 104 In medical applications, we need to invent diagnostic devices that can generate references to detect disease status as soon as possible. In addition to cost considerations, the diagnostic device must have a certain degree of accuracy for determining such reference, and therefore the accuracy of the diagnostic device is an important issue. Typically, the outcomes of diagnostic devices are binary, with positive/negative signs based on a critical value of the quantity of a continuous variable, usually a biomarker which originally measures the medical status. In this work, we perform ROC curve (Receiver Operating Characteristic Curve) analysis to obtain the optimum critical value that has a lowest misclassification rate. In real practices, we may conduct disproportionately stratified sampling to collect data for improving the accuracy of critical value analysis. With such disproportionately stratified samples, we first estimated the distribution of the biomarker with conditional maximum likelihood under some parametrical assumptions. Then, we used the estimated probabilities as adjustments to calculate the ROC curve. Simulation analysis showed that our method can provide good estimators, and ROC curve analysis can objectively provide the best critical value. We also present an empirical study on blood samples of a type of glucose (a biomarker), tested with a newly invented test kit. A preliminary investigation found that the population distribution of such blood data is moderately skewed to the right. Therefore, we suggest a gamma distribution for this biomarker. We performed our method to analyze this data and also analyze questionnaires related to the operations of the test kit. Huang, Yu-Min 黃愉閔 2015 學位論文 ; thesis 73 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 東海大學 === 統計學系 === 104 === In medical applications, we need to invent diagnostic devices that can generate references to detect disease status as soon as possible. In addition to cost considerations, the diagnostic device must have a certain degree of accuracy for determining such reference, and therefore the accuracy of the diagnostic device is an important issue. Typically, the outcomes of diagnostic devices are binary, with positive/negative signs based on a critical value of the quantity of a continuous variable, usually a biomarker which originally measures the medical status. In this work, we perform ROC curve (Receiver Operating Characteristic Curve) analysis to obtain the optimum critical value that has a lowest misclassification rate. In real practices, we may conduct disproportionately stratified sampling to collect data for improving the accuracy of critical value analysis. With such disproportionately stratified samples, we first estimated the distribution of the biomarker with conditional maximum likelihood under some parametrical assumptions. Then, we used the estimated probabilities as adjustments to calculate the ROC curve. Simulation analysis showed that our method can provide good estimators, and ROC curve analysis can objectively provide the best critical value. We also present an empirical study on blood samples of a type of glucose (a biomarker), tested with a newly invented test kit. A preliminary investigation found that the population distribution of such blood data is moderately skewed to the right. Therefore, we suggest a gamma distribution for this biomarker. We performed our method to analyze this data and also analyze questionnaires related to the operations of the test kit.
|
author2 |
Huang, Yu-Min |
author_facet |
Huang, Yu-Min Wang, Shu-Han 王舒涵 |
author |
Wang, Shu-Han 王舒涵 |
spellingShingle |
Wang, Shu-Han 王舒涵 Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data |
author_sort |
Wang, Shu-Han |
title |
Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data |
title_short |
Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data |
title_full |
Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data |
title_fullStr |
Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data |
title_full_unstemmed |
Analysis of the Optimal Critical Points on ROC Curves with Disproportionately-Sampled Data |
title_sort |
analysis of the optimal critical points on roc curves with disproportionately-sampled data |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/93548854602394269867 |
work_keys_str_mv |
AT wangshuhan analysisoftheoptimalcriticalpointsonroccurveswithdisproportionatelysampleddata AT wángshūhán analysisoftheoptimalcriticalpointsonroccurveswithdisproportionatelysampleddata AT wangshuhan fēibǐlìfēncéngyàngběnxiàzhīroclínjièzhífēnxī AT wángshūhán fēibǐlìfēncéngyàngběnxiàzhīroclínjièzhífēnxī |
_version_ |
1718262412833980416 |