public class DENCLUE extends PartitionClustering<double[]>
Clearly, DENCLUE doesn't work on data with uniform distribution. In high dimensional space, the data always look like uniformly distributed because of the curse of dimensionality. Therefore, DENCLUDE doesn't work well on high-dimensional data in general.
k, size, y
OUTLIER
Constructor and Description |
---|
DENCLUE(double[][] data,
double sigma,
int m)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
double[][] |
getDensityAttractors()
Returns the density attractors of cluster.
|
double |
getSigma()
Returns the smooth (standard deviation) parameter in the Gaussian kernel.
|
int |
predict(double[] x)
Cluster a new instance.
|
java.lang.String |
toString() |
getClusterLabel, getClusterSize, getNumClusters, seed, seed
public DENCLUE(double[][] data, double sigma, int m)
data
- the dataset for clustering.sigma
- the smooth parameter in the Gaussian kernel. The user can
choose sigma such that number of density attractors is constant for a
long interval of sigma.m
- the number of selected samples used in the iteration.
This number should be much smaller than the number of data points
to speed up the algorithm. It should also be large enough to capture
the sufficient information of underlying distribution.public double getSigma()
public double[][] getDensityAttractors()
public int predict(double[] x)
Clustering
x
- a new instance.public java.lang.String toString()
toString
in class java.lang.Object