Estimating infection prevalence: Best practices and their theoretical underpinnings.

Published

Journal Article

Accurately estimating infection prevalence is fundamental to the study of population health, disease dynamics, and infection risk factors. Prevalence is estimated as the proportion of infected individuals ("individual-based estimation"), but is also estimated as the proportion of samples in which evidence of infection is detected ("anonymous estimation"). The latter method is often used when researchers lack information on individual host identity, which can occur during noninvasive sampling of wild populations or when the individual that produced a fecal sample is unknown. The goal of this study was to investigate biases in individual-based versus anonymous prevalence estimation theoretically and to test whether mathematically derived predictions are evident in a comparative dataset of gastrointestinal helminth infections in nonhuman primates. Using a mathematical model, we predict that anonymous estimates of prevalence will be lower than individual-based estimates when (a) samples from infected individuals do not always contain evidence of infection and/or (b) when false negatives occur. The mathematical model further predicts that no difference in bias should exist between anonymous estimation and individual-based estimation when one sample is collected from each individual. Using data on helminth parasites of primates, we find that anonymous estimates of prevalence are significantly and substantially (12.17%) lower than individual-based estimates of prevalence. We also observed that individual-based estimates of prevalence from studies employing single sampling are on average 6.4% higher than anonymous estimates, suggesting a bias toward sampling infected individuals. We recommend that researchers use individual-based study designs with repeated sampling of individuals to obtain the most accurate estimate of infection prevalence. Moreover, to ensure accurate interpretation of their results and to allow for prevalence estimates to be compared among studies, it is essential that authors explicitly describe their sampling designs and prevalence calculations in publications.

Full Text

Duke Authors

Cited Authors

  • Miller, IF; Schneider-Crease, I; Nunn, CL; Muehlenbein, MP

Published Date

  • July 2018

Published In

Volume / Issue

  • 8 / 13

Start / End Page

  • 6738 - 6747

PubMed ID

  • 30038770

Pubmed Central ID

  • 30038770

Electronic International Standard Serial Number (EISSN)

  • 2045-7758

International Standard Serial Number (ISSN)

  • 2045-7758

Digital Object Identifier (DOI)

  • 10.1002/ece3.4179

Language

  • eng