What is mzXML good for?

Published

Journal Article (Review)

mzXML (extensible markup language) is one of the pioneering data formats for mass spectrometry-based proteomics data collection. It is an open data format that has benefited and evolved as a result of the input of many groups, and it continues to evolve. Due to its dynamic history, its structure, purpose and applicability have all changed with time, meaning that groups that have looked at the standard at different points during its evolution have differing impressions of the usefulness of mzXML. In discussing mzXML, it is important to understand what mzXML is not. First, mzXML does not capture the raw data. Second, mzXML is not sufficient for regulatory submission. Third, mzXML is not optimized for computation and, finally, mzXML does not capture the experiment design. In general, it is the authors' opinion that XML is not a panacea for bioinformatics or a substitute for good data representation, and groups that want to use mzXML (or other XML-based representations) directly for data storage or computation will encounter performance and scalability problems. With these limitations in mind, the authors conclude that mzXML is, nonetheless, an indispensable data exchange format for proteomics.

Full Text

Duke Authors

Cited Authors

  • Lin, SM; Zhu, L; Winter, AQ; Sasinowski, M; Kibbe, WA

Published Date

  • December 2005

Published In

Volume / Issue

  • 2 / 6

Start / End Page

  • 839 - 845

PubMed ID

  • 16307524

Pubmed Central ID

  • 16307524

Electronic International Standard Serial Number (EISSN)

  • 1744-8387

Digital Object Identifier (DOI)

  • 10.1586/14789450.2.6.839

Language

  • eng

Conference Location

  • England