Scholars@Duke publication: Testing a point null hypothesis: The irreconcilability of P values and evidence

Testing a point null hypothesis: The irreconcilability of P values and evidence

Publication , Journal Article

Berger, JO; Sellke, T

Published in: Journal of the American Statistical Association

January 1, 1987

The problem of testing a point null hypothesis (or a “small interval” null hypothesis) is considered. of interest is the relationship between the P value (or observed significance level) and conditional and Bayesian measures of evidence against the null hypothesis. Although one might presume that a small P value indicates the presence of strong evidence against the null, such is not necessarily the case. Expanding on earlier work [especially Edwards, Lindman, and Savage (1963) and Dickey (1977)], it is shown that actual evidence against a null (as measured, say, by posterior probability or comparative likelihood) can differ by an order of magnitude from the P value. For instance, data that yield a P value of .05, when testing a normal mean, result in a posterior probability of the null of at least .30 for any objective prior distribution. (“Objective” here means that equal prior weight is given the two hypotheses and that the prior is symmetric and nonincreasing away from the null; other definitions of “objective” will be seen to yield qualitatively similar results.) The overall conclusion is that P values can be highly misleading measures of the evidence provided by the data against the null hypothesis. © 1976 Taylor & Francis Group, LLC.

Duke Scholars

Author James O. Berger Statistical Science

Published In

Journal of the American Statistical Association

DOI

10.1080/01621459.1987.10478397

EISSN

1537-274X

ISSN

0162-1459

Publication Date

January 1, 1987

Volume

Issue

397

Start / End Page

112 / 122

Related Subject Headings

Statistics & Probability
4905 Statistics
3802 Econometrics
1603 Demography
1403 Econometrics
0104 Statistics

Citation

APA

Chicago

ICMJE

MLA

NLM

Berger, J. O., & Sellke, T. (1987). Testing a point null hypothesis: The irreconcilability of P values and evidence. Journal of the American Statistical Association, 82(397), 112–122. https://doi.org/10.1080/01621459.1987.10478397

Berger, J. O., and T. Sellke. “Testing a point null hypothesis: The irreconcilability of P values and evidence.” Journal of the American Statistical Association 82, no. 397 (January 1, 1987): 112–22. https://doi.org/10.1080/01621459.1987.10478397.

Berger JO, Sellke T. Testing a point null hypothesis: The irreconcilability of P values and evidence. Journal of the American Statistical Association. 1987 Jan 1;82(397):112–22.

Berger, J. O., and T. Sellke. “Testing a point null hypothesis: The irreconcilability of P values and evidence.” Journal of the American Statistical Association, vol. 82, no. 397, Jan. 1987, pp. 112–22. Scopus, doi:10.1080/01621459.1987.10478397.

Berger JO, Sellke T. Testing a point null hypothesis: The irreconcilability of P values and evidence. Journal of the American Statistical Association. 1987 Jan 1;82(397):112–122.

Published In

Journal of the American Statistical Association

DOI

10.1080/01621459.1987.10478397

EISSN

1537-274X

ISSN

0162-1459

Publication Date

January 1, 1987

Volume

Issue

397

Start / End Page

112 / 122

Related Subject Headings

Statistics & Probability
4905 Statistics
3802 Econometrics
1603 Demography
1403 Econometrics
0104 Statistics