Sparse Factor Analysis and Its Applications

博士 === 中興大學 === 應用數學系所 === 103 === Recent modern data set, such as genomic data and image data, often generate huge amount of information. A critical challenging component in analyzing high-dimensional data is how to reduce the dimension of data and how to extract relevant features. Hence we propose...

Full description

Bibliographic Details
Main Authors: Po-Yu Huang, 黃博煜
Other Authors: 許英麟
Format: Others
Language:en_US
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/13395304912843724581
id ndltd-TW-103NCHU5507003
record_format oai_dc
spelling ndltd-TW-103NCHU55070032016-02-05T04:16:47Z http://ndltd.ncl.edu.tw/handle/13395304912843724581 Sparse Factor Analysis and Its Applications 稀疏的因素分析模型及其應用 Po-Yu Huang 黃博煜 博士 中興大學 應用數學系所 103 Recent modern data set, such as genomic data and image data, often generate huge amount of information. A critical challenging component in analyzing high-dimensional data is how to reduce the dimension of data and how to extract relevant features. Hence we propose a simultaneously sparse factor analysis approach (SSFA) to tackle the problems by employing L1 penalty function to promote sparseness in factor loadings. For the application to clustering, we provide two clustering approaches based on SSFA: (1) Cutoff-split approach with excluding some non-separable patients via imposing another L1 penalty function in factor scores (cutoff-based clustering); (2) The mixture of SSFA (mixture-based clustering). Simulation results show that the SSFA yields a smaller bias and variance compared to other sparse approaches as well as lower classification error rate than other mixture model with dimension reduction methods. Application to a published gene signature in two unique lung cancer datasets demonstrates the utility of two types clustering approaches with SSFA in helping refine the gene signature and improve classification to better predict risk of cancer death and treatment benefit. 許英麟 2015 學位論文 ; thesis 75 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 中興大學 === 應用數學系所 === 103 === Recent modern data set, such as genomic data and image data, often generate huge amount of information. A critical challenging component in analyzing high-dimensional data is how to reduce the dimension of data and how to extract relevant features. Hence we propose a simultaneously sparse factor analysis approach (SSFA) to tackle the problems by employing L1 penalty function to promote sparseness in factor loadings. For the application to clustering, we provide two clustering approaches based on SSFA: (1) Cutoff-split approach with excluding some non-separable patients via imposing another L1 penalty function in factor scores (cutoff-based clustering); (2) The mixture of SSFA (mixture-based clustering). Simulation results show that the SSFA yields a smaller bias and variance compared to other sparse approaches as well as lower classification error rate than other mixture model with dimension reduction methods. Application to a published gene signature in two unique lung cancer datasets demonstrates the utility of two types clustering approaches with SSFA in helping refine the gene signature and improve classification to better predict risk of cancer death and treatment benefit.
author2 許英麟
author_facet 許英麟
Po-Yu Huang
黃博煜
author Po-Yu Huang
黃博煜
spellingShingle Po-Yu Huang
黃博煜
Sparse Factor Analysis and Its Applications
author_sort Po-Yu Huang
title Sparse Factor Analysis and Its Applications
title_short Sparse Factor Analysis and Its Applications
title_full Sparse Factor Analysis and Its Applications
title_fullStr Sparse Factor Analysis and Its Applications
title_full_unstemmed Sparse Factor Analysis and Its Applications
title_sort sparse factor analysis and its applications
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/13395304912843724581
work_keys_str_mv AT poyuhuang sparsefactoranalysisanditsapplications
AT huángbóyù sparsefactoranalysisanditsapplications
AT poyuhuang xīshūdeyīnsùfēnxīmóxíngjíqíyīngyòng
AT huángbóyù xīshūdeyīnsùfēnxīmóxíngjíqíyīngyòng
_version_ 1718181855297011712