IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS

碩士 === 國立臺灣大學 === 生醫電子與資訊學研究所 === 102 === Gas chromatography / time of flight mass spectrometer (GC/TOF-MS) has become an important technique for metabolomics. We developed IDMass, a novel algorithm that accurately and sensitively extract and identify the individual components in GC/TOF-MS samples i...

Full description

Bibliographic Details
Main Authors: Yu-Yen Chung, 鍾宇彥
Other Authors: Y. Jane Tseng
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/26300398732089206059
id ndltd-TW-102NTU05114003
record_format oai_dc
spelling ndltd-TW-102NTU051140032016-03-09T04:24:03Z http://ndltd.ncl.edu.tw/handle/26300398732089206059 IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS 去噪、成分分離及辨識於氣相層析串聯飛行時間質譜儀之工具套件 Yu-Yen Chung 鍾宇彥 碩士 國立臺灣大學 生醫電子與資訊學研究所 102 Gas chromatography / time of flight mass spectrometer (GC/TOF-MS) has become an important technique for metabolomics. We developed IDMass, a novel algorithm that accurately and sensitively extract and identify the individual components in GC/TOF-MS samples in this study. IDMass comprises five main steps: noise reduction, deconvolution window determination, chemical rank determination, component extraction and identification. First, by subtracting detector noise in mass dimension, resulting peaks generated by IDMass noise reduction step demonstrates to have better shapes and also improve the identification result. Second, IDMass detects peak regions by calculating a threshold of the baseline corrected total ion chromatogram (TIC) and refining the boundaries of the regions by local minimum nearby without manual specified parameters for evaluating threshold. Third, IDMass determines the chemical rank by a two-layer local maximum method with peak picking using continuous wavelet transform to better separate peaks from different components. The chemical rank determining method is able to detect different components with similar spectrum sensitively. Forth, IDMass uses optimal exponentially modified Gaussian (EMG) model with the particle swarm optimization (PSO) to extracts individual components without manual specify the initial value for evaluating the eluted shape. IDMass uses the peak shape information as a major constraint and it is able to extract purer components than multivariate curve resolution (MCR) approaches especially in the case that co-eluted compounds with similar spectrum. However, some eluted peaks with bad shape caused by saturation of the mass spectrometer detector limit performance of IDMass but can be resolved by sample dilution. Last, by identifying compounds sequentially, IDMass can integrate the result into a peak table automatically for further statistical analysis. The performance of IDMass was tested in a data set containing 76 standard mixtures; the recall, precision and F-score were 0.92, 0.81 and 0.86, respectively. IDMass was successfully used to quantify the identified compounds in the 76 standard mixtures. Y. Jane Tseng 曾宇鳳 2013 學位論文 ; thesis 61 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 生醫電子與資訊學研究所 === 102 === Gas chromatography / time of flight mass spectrometer (GC/TOF-MS) has become an important technique for metabolomics. We developed IDMass, a novel algorithm that accurately and sensitively extract and identify the individual components in GC/TOF-MS samples in this study. IDMass comprises five main steps: noise reduction, deconvolution window determination, chemical rank determination, component extraction and identification. First, by subtracting detector noise in mass dimension, resulting peaks generated by IDMass noise reduction step demonstrates to have better shapes and also improve the identification result. Second, IDMass detects peak regions by calculating a threshold of the baseline corrected total ion chromatogram (TIC) and refining the boundaries of the regions by local minimum nearby without manual specified parameters for evaluating threshold. Third, IDMass determines the chemical rank by a two-layer local maximum method with peak picking using continuous wavelet transform to better separate peaks from different components. The chemical rank determining method is able to detect different components with similar spectrum sensitively. Forth, IDMass uses optimal exponentially modified Gaussian (EMG) model with the particle swarm optimization (PSO) to extracts individual components without manual specify the initial value for evaluating the eluted shape. IDMass uses the peak shape information as a major constraint and it is able to extract purer components than multivariate curve resolution (MCR) approaches especially in the case that co-eluted compounds with similar spectrum. However, some eluted peaks with bad shape caused by saturation of the mass spectrometer detector limit performance of IDMass but can be resolved by sample dilution. Last, by identifying compounds sequentially, IDMass can integrate the result into a peak table automatically for further statistical analysis. The performance of IDMass was tested in a data set containing 76 standard mixtures; the recall, precision and F-score were 0.92, 0.81 and 0.86, respectively. IDMass was successfully used to quantify the identified compounds in the 76 standard mixtures.
author2 Y. Jane Tseng
author_facet Y. Jane Tseng
Yu-Yen Chung
鍾宇彥
author Yu-Yen Chung
鍾宇彥
spellingShingle Yu-Yen Chung
鍾宇彥
IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS
author_sort Yu-Yen Chung
title IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS
title_short IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS
title_full IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS
title_fullStr IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS
title_full_unstemmed IDMass: Noise Reduction, Component Extraction, and Identification Processing Toolkit for GC/TOF-MS
title_sort idmass: noise reduction, component extraction, and identification processing toolkit for gc/tof-ms
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/26300398732089206059
work_keys_str_mv AT yuyenchung idmassnoisereductioncomponentextractionandidentificationprocessingtoolkitforgctofms
AT zhōngyǔyàn idmassnoisereductioncomponentextractionandidentificationprocessingtoolkitforgctofms
AT yuyenchung qùzàochéngfēnfēnlíjíbiànshíyúqìxiāngcéngxīchuànliánfēixíngshíjiānzhìpǔyízhīgōngjùtàojiàn
AT zhōngyǔyàn qùzàochéngfēnfēnlíjíbiànshíyúqìxiāngcéngxīchuànliánfēixíngshíjiānzhìpǔyízhīgōngjùtàojiàn
_version_ 1718200016451928064