The objective of this study was to assess the potential of combining graph learning methods with latent variable estimation methods for mining clinically useful information from observational clinical data sets. Materials and Methods: The data set contained self-reported measures of psychopathology symptoms from a clinical sample receiving treatment for alcohol use disorder. We used the traditional graph learning methods: Graphical Least Absolute Shrinkage and Selection Operator, and Friedman's hill climbing algorithm; traditional latent variable estimation method factor analysis; recently developed graph learning method Greedy Fast Causal Inference; and recently developed latent variable estimation method Find One Factor Clusters. Methods were assessed qualitatively by the content of their findings. Results: Recently developed graphical methods identified potential latent variables (ie, not represented in the model) influencing particular scores. Recently developed latent effect estimation methods identified plausible cross-score loadings that were not found with factor analysis. A graphical analysis of individual items identified a mistake in wording on 1 questionnaire and provided further evidence that certain scores are not reflective of indirectly measured common causes. Discussion and Conclusion: Our findings suggest that a combination of Greedy Fast Causal Inference and Find One Factor Clusters can enhance the evidence-based information yield from psychopathological constructs and questionnaires. Traditional methods provided some of the same information but missed other important findings. These conclusions point the way toward more informative interrogations of existing and future data sets than are commonly employed at present.
|Original language||English (US)|
|Number of pages||10|
|Journal||Journal of the American Medical Informatics Association|
|State||Published - Jun 24 2019|
PubMed: MeSH publication types
- Journal Article
- Research Support, N.I.H., Extramural