Cloud Parallel Processing of Tandem Mass Spectrometry Based Proteomics Data

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

Cloud Parallel Processing of Tandem Mass Spectrometry Based Proteomics Data

详细信息查看全文

作者：Yassene Mohammed ; Ekaterina Mostovenko ; Alex A. Henneman ; Rob J. Marissen ; Andr茅 M. Deelder ; Magnus Palmblad
刊名：Journal of Proteome Research
出版年：2012
出版时间：October 5, 2012
年：2012
卷：11
期：10
页码：5101-5108
全文大小：465K
年卷期：v.11,no.10(October 5, 2012)
ISSN：1535-3907

文摘

Data analysis in mass spectrometry based proteomics struggles to keep pace with the advances in instrumentation and the increasing rate of data acquisition. Analyzing this data involves multiple steps requiring diverse software, using different algorithms and data formats. Speed and performance of the mass spectral search engines are continuously improving, although not necessarily as needed to face the challenges of acquired big data. Improving and parallelizing the search algorithms is one possibility; data decomposition presents another, simpler strategy for introducing parallelism. We describe a general method for parallelizing identification of tandem mass spectra using data decomposition that keeps the search engine intact and wraps the parallelization around it. We introduce two algorithms for decomposing mzXML files and recomposing resulting pepXML files. This makes the approach applicable to different search engines, including those relying on sequence databases and those searching spectral libraries. We use cloud computing to deliver the computational power and scientific workflow engines to interface and automate the different processing steps. We show how to leverage these technologies to achieve faster data analysis in proteomics and present three scientific workflows for parallel database as well as spectral library search using our data decomposition programs, X!Tandem and SpectraST.

Keywords:

proteomics; mass spectrometry; scientific workflow; data decomposition

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700