Skip to main content
Journal cover image

NeXML: rich, extensible, and verifiable representation of comparative data and metadata.

Publication ,  Journal Article
Vos, RA; Balhoff, JP; Caravas, JA; Holder, MT; Lapp, H; Maddison, WP; Midford, PE; Priyam, A; Sukumaran, J; Xia, X; Stoltzfus, A
Published in: Systematic biology
July 2012

In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input-output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Systematic biology

DOI

EISSN

1076-836X

ISSN

1063-5157

Publication Date

July 2012

Volume

61

Issue

4

Start / End Page

675 / 689

Related Subject Headings

  • Software
  • Programming Languages
  • Phylogeny
  • Models, Biological
  • Informatics
  • Evolutionary Biology
  • Computational Biology
  • Classification
  • Biological Evolution
  • Biodiversity
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Vos, R. A., Balhoff, J. P., Caravas, J. A., Holder, M. T., Lapp, H., Maddison, W. P., … Stoltzfus, A. (2012). NeXML: rich, extensible, and verifiable representation of comparative data and metadata. Systematic Biology, 61(4), 675–689. https://doi.org/10.1093/sysbio/sys025
Vos, Rutger A., James P. Balhoff, Jason A. Caravas, Mark T. Holder, Hilmar Lapp, Wayne P. Maddison, Peter E. Midford, et al. “NeXML: rich, extensible, and verifiable representation of comparative data and metadata.Systematic Biology 61, no. 4 (July 2012): 675–89. https://doi.org/10.1093/sysbio/sys025.
Vos RA, Balhoff JP, Caravas JA, Holder MT, Lapp H, Maddison WP, et al. NeXML: rich, extensible, and verifiable representation of comparative data and metadata. Systematic biology. 2012 Jul;61(4):675–89.
Vos, Rutger A., et al. “NeXML: rich, extensible, and verifiable representation of comparative data and metadata.Systematic Biology, vol. 61, no. 4, July 2012, pp. 675–89. Epmc, doi:10.1093/sysbio/sys025.
Vos RA, Balhoff JP, Caravas JA, Holder MT, Lapp H, Maddison WP, Midford PE, Priyam A, Sukumaran J, Xia X, Stoltzfus A. NeXML: rich, extensible, and verifiable representation of comparative data and metadata. Systematic biology. 2012 Jul;61(4):675–689.
Journal cover image

Published In

Systematic biology

DOI

EISSN

1076-836X

ISSN

1063-5157

Publication Date

July 2012

Volume

61

Issue

4

Start / End Page

675 / 689

Related Subject Headings

  • Software
  • Programming Languages
  • Phylogeny
  • Models, Biological
  • Informatics
  • Evolutionary Biology
  • Computational Biology
  • Classification
  • Biological Evolution
  • Biodiversity