Recent developments in model-based clustering with applications

Volodymyr Melnykov, Semhar Michael, Igor Melnykov

Research output: Chapter in Book/Report/Conference proceedingChapter

5 Scopus citations

Abstract

Model–based clustering is a popular technique relying on the notion of finite mixture models that proved to be efficient in modeling heterogeneity in data. The underlying idea is to model each data group by a particular mixture component. This relationship between mixed distributions and clusters forms an attractive interpretation of groups: each cluster is assumed to be a sample from the corresponding distribution. In practice, however, there are many issues that have to be accounted for by the researcher. The area of model–based clustering is very dynamic and rapidly developing, with many questions yet to be answered. In this paper, we review and discuss the latest developments in model–based clustering including semi–supervised clustering, non–parametric mixture modeling, choice of initialization strategies, merging mixture components for clustering, handling spurious solutions, and assessing variability of obtained partitions. We also demonstrate the utility of model–based clustering by considering several challenging applications to real–life problems.

Original languageEnglish (US)
Title of host publicationPartitional Clustering Algorithms
PublisherSpringer International Publishing
Pages1-39
Number of pages39
ISBN (Electronic)9783319092591
ISBN (Print)9783319092584
DOIs
StatePublished - Jan 1 2015
Externally publishedYes

Keywords

  • Finite mixture model
  • Initialization strategy
  • Merging mixture components
  • Model-based clustering
  • Semi-supervised clustering
  • Spurious solutions
  • Variable selection

Cite this

Melnykov, V., Michael, S., & Melnykov, I. (2015). Recent developments in model-based clustering with applications. In Partitional Clustering Algorithms (pp. 1-39). Springer International Publishing. https://doi.org/10.1007/978-3-319-09259-1_1