What are data? The many kinds of data and their implications for data re-use


Journal Article

One key feature of e-science is to encourage archiving and release of data so that they are available in digitally-processable forms for re-use almost from the point of collection. This assumes particular processes of translation by which data can be made visible in transportable and intelligible forms. It also requires mechanisms by which data quality and provenance can be trusted once "disconnected" from their producers. By analyzing the "life stages" of data in four academic projects, we show that these requirements create difficulties for disciplines where tacit knowledge and craft-like methods are deeply embedded in researchers, as well as for disciplines producing non-digital heterogeneous data or data derived from people rather than from material phenomena. While craft practices and tacit knowledges are a feature of most scientific endeavors, some disciplines currently appear more inclined to attempt to formalize or at least record these knowledges. We discuss the implications this has for the e-science objective of widespread data re-use. © 2007 International Communication Association.

Full Text

Duke Authors

Cited Authors

  • Carlson, S; Anderson, B

Published Date

  • January 1, 2007

Published In

Volume / Issue

  • 12 / 2

Start / End Page

  • 635 - 651

Electronic International Standard Serial Number (EISSN)

  • 1083-6101

International Standard Serial Number (ISSN)

  • 1083-6101

Digital Object Identifier (DOI)

  • 10.1111/j.1083-6101.2007.00342.x

Citation Source

  • Scopus