Abstract
The R package sharp (Stability-enHanced Approaches using Resampling Procedures) provides an integrated framework for stability-enhanced variable selection, graphical modeling and clustering. In stability selection, a feature selection algorithm is combined with a resampling technique to estimate feature selection probabilities. Features with selection proportions above a threshold are considered stably selected. Similarly, a clustering algorithm is applied on multiple subsamples of items to compute co-membership proportions in consensus clustering. The consensus clusters are obtained by clustering using co-membership proportions as a measure of similarity. We calibrate the hyper-parameters of stability selection (or consensus clustering) jointly by maximizing a consensus score calculated under the null hypothesis of equiprobability of selection (or co-membership), which characterizes instability. The package offers flexibility in the modeling, includes diagnostic and visualization tools, and allows for parallelization.
| Original language | English |
|---|---|
| Pages (from-to) | 1-27 |
| Number of pages | 27 |
| Journal | Journal of Statistical Software |
| Volume | 112 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 1 Jan 2025 |
| Externally published | Yes |
Keywords
- R
- calibration
- consensus clustering
- graphical modeling
- regularization
- stability selection
- structural equation modeling
- variable selection
Fingerprint
Dive into the research topics of 'Stability Selection and Consensus Clustering in R: The R Package sharp'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver