Annotation of phenotypic diversity: decoupling data curation and ontology curation using Phenex.

Journal Article (Journal Article)

Background

Phenex (http://phenex.phenoscape.org/) is a desktop application for semantically annotating the phenotypic character matrix datasets common in evolutionary biology. Since its initial publication, we have added new features that address several major bottlenecks in the efficiency of the phenotype curation process: allowing curators during the data curation phase to provisionally request terms that are not yet available from a relevant ontology; supporting quality control against annotation guidelines to reduce later manual review and revision; and enabling the sharing of files for collaboration among curators.

Results

We decoupled data annotation from ontology development by creating an Ontology Request Broker (ORB) within Phenex. Curators can use the ORB to request a provisional term for use in data annotation; the provisional term can be automatically replaced with a permanent identifier once the term is added to an ontology. We added a set of annotation consistency checks to prevent common curation errors, reducing the need for later correction. We facilitated collaborative editing by improving the reliability of Phenex when used with online folder sharing services, via file change monitoring and continual autosave.

Conclusions

With the addition of these new features, and in particular the Ontology Request Broker, Phenex users have been able to focus more effectively on data annotation. Phenoscape curators using Phenex have reported a smoother annotation workflow, with much reduced interruptions from ontology maintenance and file management issues.

Full Text

Duke Authors

Cited Authors

  • Balhoff, JP; Dahdul, WM; Dececchi, TA; Lapp, H; Mabee, PM; Vision, TJ

Published Date

  • January 2014

Published In

Volume / Issue

  • 5 / 1

Start / End Page

  • 45 -

PubMed ID

  • 25411634

Pubmed Central ID

  • PMC4236444

Electronic International Standard Serial Number (EISSN)

  • 2041-1480

International Standard Serial Number (ISSN)

  • 2041-1480

Digital Object Identifier (DOI)

  • 10.1186/2041-1480-5-45

Language

  • eng