Often in clinical dental research, clinical attachment level (CAL) is recorded at several sites throughout the mouth to assess the extent of periodontal disease (PD). One might be interested to quantify PD at the tooth-level via the proportion of diseased sites per tooth type (say, incisors, canines, pre-molars and molars) per subject. However, these studies might consist of relatively diseasefree and highly diseased subjects leading to the proportion responses distributed in the interval [0, 1]. While beta regression (BR) is often the model of choice to assess covariate effects for proportion data, the presence (and/or abundance) of zeros and/or ones makes it inapplicable here because the beta support is defined in the interval (0, 1). Avoiding ad hoc data transformation, we explore the potential of the augmented BR framework which augments the beta density with non-zero masses at zero and one while accounting for the clustering induced. Our classical estimation framework using maximum likelihood utilizes the potential of the SAS R Proc NLMIXED procedure. We explore our methodology via simulation studies and application to a real cross-sectional dataset on PD, and we assess the gain in model fit and parameter estimation over other ad hoc alternatives. This reveals newer insights into risk quantification on clustered proportion responses. Our methods can be implemented using standard SAS software routines. The augmented BR model results in a better fit to clustered periodontal proportion data over the standard beta model. We recommend using it as a parametric alternative for fitting proportion data, and avoid ad hoc data transformation.
|Original language||English (US)|
|Number of pages||18|
|Journal||Journal of Applied Probability and Statistics|
|State||Published - 2017|
Bibliographical noteFunding Information:
The authors thank the Center for Oral Health Research at the Medical University of South Carolina for providing the motivation and the context of this work. They also thank an anonymous reviewer whose constructive comments led to an improved presentation of the manuscript. The work is partially funded by US National Institutes of Health grants R03DE021762, R03DE023372 and R01DE024984.
- Augmented distribution
- Beta regression
- Likelihood functions
- Periodontal disease
- Proc NLMIXED
- Proportion data