Sparse Factor Analysis and Its Applications
博士 === 中興大學 === 應用數學系所 === 103 === Recent modern data set, such as genomic data and image data, often generate huge amount of information. A critical challenging component in analyzing high-dimensional data is how to reduce the dimension of data and how to extract relevant features. Hence we propose...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/13395304912843724581 |
id |
ndltd-TW-103NCHU5507003 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NCHU55070032016-02-05T04:16:47Z http://ndltd.ncl.edu.tw/handle/13395304912843724581 Sparse Factor Analysis and Its Applications 稀疏的因素分析模型及其應用 Po-Yu Huang 黃博煜 博士 中興大學 應用數學系所 103 Recent modern data set, such as genomic data and image data, often generate huge amount of information. A critical challenging component in analyzing high-dimensional data is how to reduce the dimension of data and how to extract relevant features. Hence we propose a simultaneously sparse factor analysis approach (SSFA) to tackle the problems by employing L1 penalty function to promote sparseness in factor loadings. For the application to clustering, we provide two clustering approaches based on SSFA: (1) Cutoff-split approach with excluding some non-separable patients via imposing another L1 penalty function in factor scores (cutoff-based clustering); (2) The mixture of SSFA (mixture-based clustering). Simulation results show that the SSFA yields a smaller bias and variance compared to other sparse approaches as well as lower classification error rate than other mixture model with dimension reduction methods. Application to a published gene signature in two unique lung cancer datasets demonstrates the utility of two types clustering approaches with SSFA in helping refine the gene signature and improve classification to better predict risk of cancer death and treatment benefit. 許英麟 2015 學位論文 ; thesis 75 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 中興大學 === 應用數學系所 === 103 === Recent modern data set, such as genomic data and image data, often generate huge amount of information. A critical challenging component in analyzing high-dimensional data is how to reduce the dimension of data and how to extract relevant features. Hence we propose a simultaneously sparse factor analysis approach (SSFA) to tackle the problems by employing L1 penalty function to promote sparseness in factor loadings. For the application to clustering, we provide two clustering approaches based on SSFA: (1) Cutoff-split approach with excluding some non-separable patients via imposing another L1 penalty function in factor scores (cutoff-based clustering); (2) The mixture of SSFA (mixture-based clustering).
Simulation results show that the SSFA yields a smaller bias and variance compared to other sparse approaches as well as lower classification error rate than other mixture model with dimension reduction methods. Application to a published gene signature in two unique lung cancer datasets demonstrates the utility of two types clustering approaches with SSFA in helping refine the gene signature and improve classification to better predict risk of cancer death and treatment benefit.
|
author2 |
許英麟 |
author_facet |
許英麟 Po-Yu Huang 黃博煜 |
author |
Po-Yu Huang 黃博煜 |
spellingShingle |
Po-Yu Huang 黃博煜 Sparse Factor Analysis and Its Applications |
author_sort |
Po-Yu Huang |
title |
Sparse Factor Analysis and Its Applications |
title_short |
Sparse Factor Analysis and Its Applications |
title_full |
Sparse Factor Analysis and Its Applications |
title_fullStr |
Sparse Factor Analysis and Its Applications |
title_full_unstemmed |
Sparse Factor Analysis and Its Applications |
title_sort |
sparse factor analysis and its applications |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/13395304912843724581 |
work_keys_str_mv |
AT poyuhuang sparsefactoranalysisanditsapplications AT huángbóyù sparsefactoranalysisanditsapplications AT poyuhuang xīshūdeyīnsùfēnxīmóxíngjíqíyīngyòng AT huángbóyù xīshūdeyīnsùfēnxīmóxíngjíqíyīngyòng |
_version_ |
1718181855297011712 |