Foreword

MSnbase is under active development; current functionality is evolving and new features will be added. This software is free and open-source software. If you use it, please support the project by citing it in publications:

Laurent Gatto and Kathryn S. Lilley. MSnbase - an R/Bioconductor package for isobaric tagged mass spectrometry data visualization, processing and quantitation. Bioinformatics 28, 288-289 (2011).

Questions and bugs

For bugs, typos, suggestions or other questions, please file an issue in our tracking system (https://github.com/lgatto/MSnbase/issues) providing as much information as possible, a reproducible example and the output of sessionInfo().

If you don’t have a GitHub account or wish to reach a broader audience for general questions about proteomics analysis using R, you may want to use the Bioconductor support site: https://support.bioconductor.org/.

1 Overview

MSnbase’s aims are to facilitate the reproducible analysis of mass spectrometry data within the R environment, from raw data import and processing, feature quantification, quantification and statistical analysis of the results (Gatto and Lilley 2012). Data import functions for several formats are provided and intermediate or final results can also be saved or exported. These capabilities are presented below.

2 Data input

Raw data

Data stored in one of the published XML-based formats. i.e. mzXML (Pedrioli et al. 2004), mzData (Orchard et al. 2007) or mzML (Martens et al. 2010), can be imported with the readMSData method, which makes use of the mzR package to create MSnExp objects. The files can be in profile or centroided mode. See ?readMSData for details.

Peak lists

Peak lists in the mgf format1 http://www.matrixscience.com/help/data_file_help.html can be imported using the readMgfData. In this case, the peak data has generally been pre-processed by other software. See ?readMgfData for details.

Quantitation data

Third party software can be used to generate quantitative data and exported as a spreadsheet (generally comma or tab separated format). This data as well as any additional meta-data can be imported with the readMSnSet function. See ?readMSnSet for details.

MSnbase also supports the mzTab format2 https://github.com/HUPO-PSI/mzTab, a light-weight, tab-delimited file format for proteomics data developed within the Proteomics Standards Initiative (PSI). mzTab files can be read into R with readMzTabData to create and MSnSet instance.