Class/Object

org.clustering4ever.clustering.kcenters.rdd

KMeans

Related Docs: object KMeans | package rdd

Permalink

final case class KMeans[D <: ContinuousDistance](k: Int, metric: D, minShift: Double, maxIterations: Int, persistanceLVL: StorageLevel = StorageLevel.MEMORY_ONLY, customCenters: HashMap[Int, ScalarVector] = ...)(implicit ctV: ClassTag[ScalarVector]) extends KCentersAncestor[ScalarVector, D, KMeansModel[D]] with Product with Serializable

The famous K-Means using a user-defined dissmilarity measure.

k

: number of clusters

metric

: a defined dissimilarity measure

minShift

: minimal threshold under which we consider a centroid has converged

maxIterations

: maximal number of iteration

Linear Supertypes
Product, Equals, KCentersAncestor[ScalarVector, D, KMeansModel[D]], ClusteringAlgorithmDistributed[ScalarVector, KMeansModel[D]], ClusteringAlgorithm, KCommonsSpark[ScalarVector, D], KCommons[ScalarVector, D], ClusteringSharedTypes, KCommonsArgs[ScalarVector, D], MetricArgs[ScalarVector, D], KArgs, MaxIterationsArgs, MinShiftArgs, AlgorithmsArguments, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. KMeans
  2. Product
  3. Equals
  4. KCentersAncestor
  5. ClusteringAlgorithmDistributed
  6. ClusteringAlgorithm
  7. KCommonsSpark
  8. KCommons
  9. ClusteringSharedTypes
  10. KCommonsArgs
  11. MetricArgs
  12. KArgs
  13. MaxIterationsArgs
  14. MinShiftArgs
  15. AlgorithmsArguments
  16. Serializable
  17. Serializable
  18. AnyRef
  19. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new KMeans(k: Int, metric: D, minShift: Double, maxIterations: Int, persistanceLVL: StorageLevel = StorageLevel.MEMORY_ONLY, customCenters: HashMap[Int, ScalarVector] = ...)(implicit ctV: ClassTag[ScalarVector])

    Permalink

    k

    : number of clusters

    metric

    : a defined dissimilarity measure

    minShift

    : minimal threshold under which we consider a centroid has converged

    maxIterations

    : maximal number of iteration

Type Members

  1. final type ClusterID = Int

    Permalink
    Definition Classes
    ClusteringSharedTypes

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final val algorithmID: extensibleAlgorithmNature.KMeans.type

    Permalink
    Definition Classes
    KMeans → ClusteringAlgorithm
  5. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. implicit final val ctV: ClassTag[ScalarVector]

    Permalink

    Attributes
    protected
    Definition Classes
    KMeansKCommonsSpark
  8. final val customCenters: HashMap[Int, ScalarVector]

    Permalink
    Definition Classes
    KMeans → KCommons
  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  11. final def fit[O, Cz[Y, Z <: GVector[Z]] <: Clusterizable[Y, Z, Cz]](data: RDD[Cz[O, ScalarVector]])(implicit ct: ClassTag[Cz[O, ScalarVector]]): KMeansModel[D]

    Permalink

    Execute the corresponding clustering algorithm

    Execute the corresponding clustering algorithm

    returns

    ClusteringModel

    Definition Classes
    KMeansClusteringAlgorithmDistributed
  12. implicit val getCenter: (ScalarVector, Long) ⇒ ScalarVector

    Permalink
    Definition Classes
    KMeansKCentersAncestor
  13. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  15. final val k: Int

    Permalink

    : number of clusters

    : number of clusters

    Definition Classes
    KMeans → KCommonsArgs → KArgs
  16. final def kmppInitializationRDD(vectorizedDataset: RDD[ScalarVector], k: Int, metric: D): HashMap[Int, ScalarVector]

    Permalink

    To upgrade Kmeans++ initialization

    To upgrade Kmeans++ initialization

    References

    • Tapas Kanungo, David M. Mount, Nathan S. Netanyahu, Christine D. Piatko, Ruth Silverman, and Angela Y. Wu. An Efficient k-Means Clustering Algorithm: Analysis and Implementation. IEEE TRANS. PAMI, 2002.
    • D. Arthur and S. Vassilvitskii. "K-means++: the advantages of careful seeding". ACM-SIAM symposium on Discrete algorithms, 1027-1035, 2007.
    • Anna D. Peterson, Arka P. Ghosh and Ranjan Maitra. A systematic evaluation of different methods for initializing the K-means clustering algorithm. 2010.
    Attributes
    protected
    Definition Classes
    KCommonsSpark
  17. final val maxIterations: Int

    Permalink

    : maximal number of iteration

    : maximal number of iteration

    Definition Classes
    KMeans → MaxIterationsArgs
  18. final val metric: D

    Permalink

    : a defined dissimilarity measure

    : a defined dissimilarity measure

    Definition Classes
    KMeans → MetricArgs
  19. final val minShift: Double

    Permalink

    : minimal threshold under which we consider a centroid has converged

    : minimal threshold under which we consider a centroid has converged

    Definition Classes
    KMeans → KCommonsArgs → MinShiftArgs
  20. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  21. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  22. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  23. final def obtainCenters[O, Cz[Y, Z <: GVector[Z]] <: Clusterizable[Y, Z, Cz]](data: RDD[Cz[O, ScalarVector]])(implicit ct: ClassTag[Cz[O, ScalarVector]]): HashMap[Int, ScalarVector]

    Permalink

    Attributes
    protected
    Definition Classes
    KCentersAncestor
  24. final def parallelKmPPInitialization(vectorizedDataset: RDD[ScalarVector], k: Int, metric: D, l: Int = 10, numIter: Int = 5): HashMap[Int, ScalarVector]

    Permalink

    Definition Classes
    KCommonsSpark
  25. final val persistanceLVL: StorageLevel

    Permalink

    Definition Classes
    KMeansKCommonsSpark
  26. final def randomSelectedInitializationRDD(vectorizedDataset: RDD[ScalarVector], k: Int): HashMap[Int, ScalarVector]

    Permalink

    Select randomly k points which will becomes k centers itinialization.

    Select randomly k points which will becomes k centers itinialization.

    Attributes
    protected
    Definition Classes
    KCommonsSpark
  27. implicit val sumVector: (ScalarVector, ScalarVector) ⇒ ScalarVector

    Permalink
    Definition Classes
    KMeansKCentersAncestor
  28. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  29. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  30. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  31. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Product

Inherited from Equals

Inherited from KCentersAncestor[ScalarVector, D, KMeansModel[D]]

Inherited from ClusteringAlgorithmDistributed[ScalarVector, KMeansModel[D]]

Inherited from ClusteringAlgorithm

Inherited from KCommonsSpark[ScalarVector, D]

Inherited from KCommons[ScalarVector, D]

Inherited from ClusteringSharedTypes

Inherited from KCommonsArgs[ScalarVector, D]

Inherited from MetricArgs[ScalarVector, D]

Inherited from KArgs

Inherited from MaxIterationsArgs

Inherited from MinShiftArgs

Inherited from AlgorithmsArguments

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped