Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage

Summary: Data-independent acquisition mass spectrometry (DIA-MS) is a powerful technique that enables relatively deep proteomic profiling with superior quantification reproducibility. DIA data mining predominantly relies on a spectral library of sufficient proteome coverage that, in most cases, is b...

Full description

Bibliographic Details
Main Authors: Ronghui Lou, Pan Tang, Kang Ding, Shanshan Li, Cuiping Tian, Yunxia Li, Suwen Zhao, Yaoyang Zhang, Wenqing Shui
Format: Article
Language:English
Published: Elsevier 2020-03-01
Series:iScience
Online Access:http://www.sciencedirect.com/science/article/pii/S2589004220300870
id doaj-70294ed79e2148b7900943eb9b484b68
record_format Article
spelling doaj-70294ed79e2148b7900943eb9b484b682020-11-25T02:28:23ZengElsevieriScience2589-00422020-03-01233Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome CoverageRonghui Lou0Pan Tang1Kang Ding2Shanshan Li3Cuiping Tian4Yunxia Li5Suwen Zhao6Yaoyang Zhang7Wenqing Shui8iHuman Institute, ShanghaiTech University, Shanghai 201210, China; School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaiHuman Institute, ShanghaiTech University, Shanghai 201210, China; School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaiHuman Institute, ShanghaiTech University, Shanghai 201210, China; School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; University of Chinese Academy of Sciences, Beijing 100049, ChinaiHuman Institute, ShanghaiTech University, Shanghai 201210, ChinaiHuman Institute, ShanghaiTech University, Shanghai 201210, ChinaInterdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 201210, ChinaiHuman Institute, ShanghaiTech University, Shanghai 201210, China; School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; Corresponding authorInterdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai 201210, China; Corresponding authoriHuman Institute, ShanghaiTech University, Shanghai 201210, China; School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; Corresponding authorSummary: Data-independent acquisition mass spectrometry (DIA-MS) is a powerful technique that enables relatively deep proteomic profiling with superior quantification reproducibility. DIA data mining predominantly relies on a spectral library of sufficient proteome coverage that, in most cases, is built on data-dependent acquisition-based analysis of the same sample. To expand the proteome coverage for a pre-determined protein family, we report herein on the construction of a hybrid spectral library that supplements a DIA experiment-derived library with a protein family-targeted virtual library predicted by deep learning. Leveraging this DIA hybrid library substantially deepens the coverage of three transmembrane protein families (G protein-coupled receptors, ion channels, and transporters) in mouse brain tissues with increases in protein identification of 37%–87% and peptide identification of 58%–161%. Moreover, of the 412 novel GPCR peptides exclusively identified with the DIA hybrid library strategy, 53.6% were validated as present in mouse brain tissues based on orthogonal experimental measurement. : Analytical Chemistry; Biological Sciences; Classification of Proteins; Proteomics Subject Areas: Analytical Chemistry, Biological Sciences, Classification of Proteins, Proteomicshttp://www.sciencedirect.com/science/article/pii/S2589004220300870
collection DOAJ
language English
format Article
sources DOAJ
author Ronghui Lou
Pan Tang
Kang Ding
Shanshan Li
Cuiping Tian
Yunxia Li
Suwen Zhao
Yaoyang Zhang
Wenqing Shui
spellingShingle Ronghui Lou
Pan Tang
Kang Ding
Shanshan Li
Cuiping Tian
Yunxia Li
Suwen Zhao
Yaoyang Zhang
Wenqing Shui
Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage
iScience
author_facet Ronghui Lou
Pan Tang
Kang Ding
Shanshan Li
Cuiping Tian
Yunxia Li
Suwen Zhao
Yaoyang Zhang
Wenqing Shui
author_sort Ronghui Lou
title Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage
title_short Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage
title_full Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage
title_fullStr Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage
title_full_unstemmed Hybrid Spectral Library Combining DIA-MS Data and a Targeted Virtual Library Substantially Deepens the Proteome Coverage
title_sort hybrid spectral library combining dia-ms data and a targeted virtual library substantially deepens the proteome coverage
publisher Elsevier
series iScience
issn 2589-0042
publishDate 2020-03-01
description Summary: Data-independent acquisition mass spectrometry (DIA-MS) is a powerful technique that enables relatively deep proteomic profiling with superior quantification reproducibility. DIA data mining predominantly relies on a spectral library of sufficient proteome coverage that, in most cases, is built on data-dependent acquisition-based analysis of the same sample. To expand the proteome coverage for a pre-determined protein family, we report herein on the construction of a hybrid spectral library that supplements a DIA experiment-derived library with a protein family-targeted virtual library predicted by deep learning. Leveraging this DIA hybrid library substantially deepens the coverage of three transmembrane protein families (G protein-coupled receptors, ion channels, and transporters) in mouse brain tissues with increases in protein identification of 37%–87% and peptide identification of 58%–161%. Moreover, of the 412 novel GPCR peptides exclusively identified with the DIA hybrid library strategy, 53.6% were validated as present in mouse brain tissues based on orthogonal experimental measurement. : Analytical Chemistry; Biological Sciences; Classification of Proteins; Proteomics Subject Areas: Analytical Chemistry, Biological Sciences, Classification of Proteins, Proteomics
url http://www.sciencedirect.com/science/article/pii/S2589004220300870
work_keys_str_mv AT ronghuilou hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT pantang hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT kangding hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT shanshanli hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT cuipingtian hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT yunxiali hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT suwenzhao hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT yaoyangzhang hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
AT wenqingshui hybridspectrallibrarycombiningdiamsdataandatargetedvirtuallibrarysubstantiallydeepenstheproteomecoverage
_version_ 1724838441018982400