Bayesian measures of model complexity and fit

David J. Spiegelhalter, Nicola G. Best, Bradley P. Carlin, Angelika Van Der Linde

Research output: Contribution to journalArticlepeer-review

7616 Scopus citations

Abstract

We consider the problem of comparing complex hierarchical models in which the number of parameters is not clearly defined. Using an information theoretic argument we derive a measure pD for the effective number of parameters in a model as the difference between the posterior mean of the deviance and the deviance at the posterior means of the parameters of interest. In general pD approximately corresponds to the trace of the product of Fisher's information and the posterior covariance, which in normal models is the trace of the 'hat' matrix projecting observations onto fitted values. Its properties in exponential families are explored. The posterior mean deviance is suggested as a Bayesian measure of fit or adequacy, and the contributions of individual observations to the fit and complexity can give rise to a diagnostic plot of deviance residuals against leverages. Adding pD to the posterior mean deviance gives a deviance information criterion for comparing models, which is related to other information criteria and has an approximate decision theoretic justification. The procedure is illustrated in some examples, and comparisons are drawn with alternative Bayesian and classical proposals. Throughout it is emphasized that the quantities required are trivial to compute in a Markov chain Monte Carlo analysis.

Original languageEnglish (US)
Pages (from-to)583-616
Number of pages34
JournalJournal of the Royal Statistical Society. Series B: Statistical Methodology
Volume64
Issue number4
DOIs
StatePublished - 2002

Keywords

  • Bayesian model comparison
  • Decision theory
  • Deviance information criterion
  • Effective number of parameters
  • Hierarchical models
  • Information theory
  • Leverage
  • Markov chain Monte Carlo methods
  • Model dimension

Fingerprint Dive into the research topics of 'Bayesian measures of model complexity and fit'. Together they form a unique fingerprint.

Cite this