用户名: 密码: 验证码:
Implications of Strain- and Species-Level Sequence Divergence for Community and Isolate Shotgun Proteomic Analysis
详细信息    查看全文
文摘
The recent surge in microbial genomic sequencing, combined with the development of high-throughputliquid chromatography-mass-spectrometry-based (LC/LC-MS/MS) proteomics, has raised the questionof the extent to which genomic information of one strain or environmental sample can be used toprofile proteomes of related strains or samples. Even with decreasing sequencing costs, it remainsimpractical to obtain genomic sequence for every strain or sample analyzed. Here, we evaluate howshotgun proteomics is affected by amino acid divergence between the sample and the genomic databaseusing a probability-based model and a random mutation simulation model constrained by experimentaldata. To assess the effects of nonrandom distribution of mutations, we also evaluated identificationlevels using in silico peptide data from sequenced isolates with average amino acid identities (AAI)varying between 76 and 98%. We compared the predictions to experimental protein identification levelsfor a sample that was evaluated using a database that included genomic information for the dominantorganism and for a closely related variant (95% AAI). The range of models set the boundaries atwhich half of the proteins in a proteomic experiment can be identified to be 77-92% AAI betweenorthologs in the sample and database. Consistent with this prediction, experimental data indicatedloss of half the identifiable proteins at 90% AAI. Additional analysis indicated a 6.4% reduction of theinitial protein coverage per 1% amino acid divergence and total identification loss at 86% AAI.Consequently, shotgun proteomics is capable of cross-strain identifications but avoids most cross-species false positives.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700