org.bdgenomics.utils

minhash

package minhash

Visibility
  1. Public
  2. All

Type Members

  1. case class MinHashSignature extends Product with Serializable

  2. trait MinHashable extends AnyRef

Value Members

  1. object MinHash extends Serializable

    This object presents several methods for determining approximate pair-wise Jaccard similarity through the use of MinHash signatures.

    This object presents several methods for determining approximate pair-wise Jaccard similarity through the use of MinHash signatures. A description of this algorithm can be found in chapter 3 of:

    Rajaraman, Anand, and Jeffrey David Ullman. Mining of massive datasets. Cambridge University Press, 2011.

    This chapter may be freely (and legally) downloaded from:

    http://infolab.stanford.edu/~ullman/mmds/ch3.pdf

  2. object MinHashRDD extends Serializable

  3. object MinHashSignature extends Serializable

Ungrouped