Skip to main navigation Skip to search Skip to main content

Exploring and Comparing Unsupervised Clustering Algorithms

  • Marc Lavielle
  • , Philip D. Waggoner

Research output: Contribution to journalArticlepeer-review

Abstract

One of the most widely used approaches to explore and understand non-random structure in data in a largely assumption-free manner is clustering. In this paper, we detail two original Shiny apps written in R, openly developed at Github, and archived at Zenodo, for exploring and comparing major unsupervised algorithms for clustering applications: k-means and Gaussian mixture models via Expectation-Maximization. The first app leverages simulated data and the second uses Fisher's Iris data set to visually and numerically compare the clustering algorithms using data familiar to many applied researchers. In addition to being valuable tools for comparing these clustering techniques, the open source architecture of our Shiny apps allows for wide engagement and extension by the broader open science community, such as including different data sets and algorithms.

Original languageEnglish
Pages (from-to)xx-xx
JournalJournal of Open Research Software
Volume8
Issue number1
DOIs
Publication statusPublished - 1 Jan 2020
Externally publishedYes

Keywords

  • EM
  • Gaussian mixture models
  • R
  • Shiny
  • k-means
  • unsupervised clustering

Fingerprint

Dive into the research topics of 'Exploring and Comparing Unsupervised Clustering Algorithms'. Together they form a unique fingerprint.

Cite this