|
|
High-dimensional Bayesian clustering with variable selection: the R package bclust
|
|
|
|
|
نویسنده
|
nia v.p. ,davison a.c.
|
منبع
|
journal of statistical software - 2012 - دوره : 47 - شماره : 0
|
چکیده
|
The r package bclust is useful for clustering high-dimensional continuous data. the package uses a parametric spike-and-slab bayesian model to downweight the effect of noise variables and to quantify the importance of each variable in agglomerative clustering. we take advantage of the existence of closed-form marginal distributions to estimate the model hyper-parameters using empirical bayes,thereby yielding a fully automatic method. we discuss computational problems arising in implementation of the procedure and illustrate the usefulness of the package through examples.
|
کلیدواژه
|
Agglomerative clustering; Bayesian clustering; Bayesian variable selection; Dendrogram; Hierarchical clustering; R; Spike-and-slab model
|
آدرس
|
department of mathematics and industrial engineering,ecole polytechnique de montréal,2900 edouard-montpetit,h3t 1j4 montréal, Canada, ecole polytechnique fédérale de lausanne,epfl-fsb-mathaa-stat,station 8,1015 lausanne, Switzerland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|