Prediction intervals for overdispersed binomial data with application to historical controls

Max Menssen; Frank Schaarschmidt

doi:10.1002/sim.8124

Details

Original language	English
Pages (from-to)	2652-2663
Number of pages	12
Journal	Statistics in medicine
Volume	38
Issue number	14
Early online date	5 Mar 2019
Publication status	Published - 30 Jun 2019

Abstract

Bioassays are highly standardized trials for assessing the impact of a chemical compound on a model organism. In that context, it is standard to compare several treatment groups with an untreated control. If the same type of bioassay is carried out several times, the amount of information about the historical controls rises with every new study. This information can be applied to predict the outcome of one future control using a prediction interval. Since the observations are counts of success out of a given sample size, like mortality or histopathological findings, the data can be assumed to be binomial but may exhibit overdispersion caused by the variability between historical studies. We describe two approaches that account for overdispersion: asymptotic prediction intervals using the quasi-binomial assumption and prediction intervals based on the quantiles of the beta-binomial distribution. Both interval types were α-calibrated using bootstrap methods. For an assessment of the intervals coverage probabilities, a simulation study based on various numbers of historical studies and sample sizes as well as different binomial proportions and varying levels of overdispersion was run. It could be shown that α-calibration can improve the coverage probabilities of both interval types. The coverage probability of the calibrated intervals, calculated based on at least 10 historical studies, was satisfactory close to the nominal 95%. In a last step, the intervals were computed based on a real data set from the NTP homepage, using historical controls from bioassays with the mice strain B6C3F1.

Keywords

alpha-calibration bootstrap, beta-binomial, bioassay, extra binomial variation, quasi-binomial

ASJC Scopus subject areas

Medicine(all)
Epidemiology
Mathematics(all)
Statistics and Probability

Cite this

Prediction intervals for overdispersed binomial data with application to historical controls. / Menssen, Max ; Schaarschmidt, Frank.
In: Statistics in medicine, Vol. 38, No. 14, 30.06.2019, p. 2652-2663.

Research output: Contribution to journal › Article › Research › peer review

Menssen, M & Schaarschmidt, F 2019, 'Prediction intervals for overdispersed binomial data with application to historical controls', Statistics in medicine, vol. 38, no. 14, pp. 2652-2663. https://doi.org/10.1002/sim.8124

Menssen, M., & Schaarschmidt, F. (2019). Prediction intervals for overdispersed binomial data with application to historical controls. Statistics in medicine, 38(14), 2652-2663. https://doi.org/10.1002/sim.8124

Menssen M , Schaarschmidt F. Prediction intervals for overdispersed binomial data with application to historical controls. Statistics in medicine. 2019 Jun 30;38(14):2652-2663. Epub 2019 Mar 5. doi: 10.1002/sim.8124

Menssen, Max ; Schaarschmidt, Frank. / Prediction intervals for overdispersed binomial data with application to historical controls. In: Statistics in medicine. 2019 ; Vol. 38, No. 14. pp. 2652-2663.

Download

@article{697b4e4eedd8484ea3ef390345851a2e,

title = "Prediction intervals for overdispersed binomial data with application to historical controls",

abstract = "Bioassays are highly standardized trials for assessing the impact of a chemical compound on a model organism. In that context, it is standard to compare several treatment groups with an untreated control. If the same type of bioassay is carried out several times, the amount of information about the historical controls rises with every new study. This information can be applied to predict the outcome of one future control using a prediction interval. Since the observations are counts of success out of a given sample size, like mortality or histopathological findings, the data can be assumed to be binomial but may exhibit overdispersion caused by the variability between historical studies. We describe two approaches that account for overdispersion: asymptotic prediction intervals using the quasi-binomial assumption and prediction intervals based on the quantiles of the beta-binomial distribution. Both interval types were α-calibrated using bootstrap methods. For an assessment of the intervals coverage probabilities, a simulation study based on various numbers of historical studies and sample sizes as well as different binomial proportions and varying levels of overdispersion was run. It could be shown that α-calibration can improve the coverage probabilities of both interval types. The coverage probability of the calibrated intervals, calculated based on at least 10 historical studies, was satisfactory close to the nominal 95%. In a last step, the intervals were computed based on a real data set from the NTP homepage, using historical controls from bioassays with the mice strain B6C3F1.",

keywords = "alpha-calibration bootstrap, beta-binomial, bioassay, extra binomial variation, quasi-binomial",

author = "Max Menssen and Frank Schaarschmidt",

note = "Funding information: We want to thank Prof Dr Ludwig Hothorn for giving helpful suggestions and Clemens Buczilowski for his technicalsupport. Furthermore, we want to thank the reviewers for reading the manuscript and for their helpful comments.",

year = "2019",

month = jun,

day = "30",

doi = "10.1002/sim.8124",

language = "English",

volume = "38",

pages = "2652--2663",

journal = "Statistics in medicine",

issn = "0277-6715",

publisher = "John Wiley and Sons Ltd",

number = "14",

}

Download

TY - JOUR

T1 - Prediction intervals for overdispersed binomial data with application to historical controls

AU - Menssen, Max

AU - Schaarschmidt, Frank

N1 - Funding information: We want to thank Prof Dr Ludwig Hothorn for giving helpful suggestions and Clemens Buczilowski for his technicalsupport. Furthermore, we want to thank the reviewers for reading the manuscript and for their helpful comments.

PY - 2019/6/30

Y1 - 2019/6/30

N2 - Bioassays are highly standardized trials for assessing the impact of a chemical compound on a model organism. In that context, it is standard to compare several treatment groups with an untreated control. If the same type of bioassay is carried out several times, the amount of information about the historical controls rises with every new study. This information can be applied to predict the outcome of one future control using a prediction interval. Since the observations are counts of success out of a given sample size, like mortality or histopathological findings, the data can be assumed to be binomial but may exhibit overdispersion caused by the variability between historical studies. We describe two approaches that account for overdispersion: asymptotic prediction intervals using the quasi-binomial assumption and prediction intervals based on the quantiles of the beta-binomial distribution. Both interval types were α-calibrated using bootstrap methods. For an assessment of the intervals coverage probabilities, a simulation study based on various numbers of historical studies and sample sizes as well as different binomial proportions and varying levels of overdispersion was run. It could be shown that α-calibration can improve the coverage probabilities of both interval types. The coverage probability of the calibrated intervals, calculated based on at least 10 historical studies, was satisfactory close to the nominal 95%. In a last step, the intervals were computed based on a real data set from the NTP homepage, using historical controls from bioassays with the mice strain B6C3F1.

AB - Bioassays are highly standardized trials for assessing the impact of a chemical compound on a model organism. In that context, it is standard to compare several treatment groups with an untreated control. If the same type of bioassay is carried out several times, the amount of information about the historical controls rises with every new study. This information can be applied to predict the outcome of one future control using a prediction interval. Since the observations are counts of success out of a given sample size, like mortality or histopathological findings, the data can be assumed to be binomial but may exhibit overdispersion caused by the variability between historical studies. We describe two approaches that account for overdispersion: asymptotic prediction intervals using the quasi-binomial assumption and prediction intervals based on the quantiles of the beta-binomial distribution. Both interval types were α-calibrated using bootstrap methods. For an assessment of the intervals coverage probabilities, a simulation study based on various numbers of historical studies and sample sizes as well as different binomial proportions and varying levels of overdispersion was run. It could be shown that α-calibration can improve the coverage probabilities of both interval types. The coverage probability of the calibrated intervals, calculated based on at least 10 historical studies, was satisfactory close to the nominal 95%. In a last step, the intervals were computed based on a real data set from the NTP homepage, using historical controls from bioassays with the mice strain B6C3F1.

KW - alpha-calibration bootstrap

KW - beta-binomial

KW - bioassay

KW - extra binomial variation

KW - quasi-binomial

UR - http://www.scopus.com/inward/record.url?scp=85062514015&partnerID=8YFLogxK

U2 - 10.1002/sim.8124

DO - 10.1002/sim.8124

M3 - Article

C2 - 30835886

AN - SCOPUS:85062514015

VL - 38

SP - 2652

EP - 2663

JO - Statistics in medicine

JF - Statistics in medicine

SN - 0277-6715

IS - 14

ER -

Research@Leibniz University

Prediction intervals for overdispersed binomial data with application to historical controls

Authors

Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Cytotoxic and proliferation-inhibitory activity of natural and synthetic fungal tropolone sesquiterpenoids in various cell lines

A Multifunctional Nanostructured Hydrogel as a Platform for Deciphering Niche Interactions of Hematopoietic Stem and Progenitor Cells

Three Arabidopsis UMP kinases have different roles in pyrimidine nucleotide biosynthesis and (deoxy)CMP salvage

Prediction Intervals for Overdispersed Poisson Data and Their Application in Medical and Pre-Clinical Quality Control

Simultaneous Inference Using Multiple Marginal Models

Cytotoxic and proliferation-inhibitory activity of natural and synthetic fungal tropolone sesquiterpenoids in various cell lines

A Multifunctional Nanostructured Hydrogel as a Platform for Deciphering Niche Interactions of Hematopoietic Stem and Progenitor Cells

Three Arabidopsis UMP kinases have different roles in pyrimidine nucleotide biosynthesis and (deoxy)CMP salvage

Prediction Intervals for Overdispersed Poisson Data and Their Application in Medical and Pre-Clinical Quality Control

Simultaneous Inference Using Multiple Marginal Models

Cytotoxic and proliferation-inhibitory activity of natural and synthetic fungal tropolone sesquiterpenoids in various cell lines