Class

com.twitter.algebird

MinHasher32

Related Doc: package algebird

Permalink

class MinHasher32 extends MinHasher[Int]

Linear Supertypes
MinHasher[Int], Monoid[MinHashSignature], AdditiveMonoid[MinHashSignature], cats.kernel.Monoid[MinHashSignature], Semigroup[MinHashSignature], AdditiveSemigroup[MinHashSignature], cats.kernel.Semigroup[MinHashSignature], Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. MinHasher32
  2. MinHasher
  3. Monoid
  4. AdditiveMonoid
  5. Monoid
  6. Semigroup
  7. AdditiveSemigroup
  8. Semigroup
  9. Serializable
  10. Serializable
  11. AnyRef
  12. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MinHasher32(targetThreshold: Double, maxBytes: Int)

    Permalink
  2. new MinHasher32(numHashes: Int, numBands: Int)

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def additive: algebra.Monoid[MinHashSignature]

    Permalink

    These are from algebra.Monoid

    These are from algebra.Monoid

    Definition Classes
    Monoid → AdditiveMonoid → Semigroup → AdditiveSemigroup
  5. def approxCount(sig: Array[Byte]): Long

    Permalink

    Seems to work, but experimental and not generic yet

  6. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  7. def assertNotZero(v: MinHashSignature): Unit

    Permalink
    Definition Classes
    Monoid
  8. def buckets(sig: MinHashSignature): List[Long]

    Permalink

    Bucket keys to use for quickly finding other similar items via locality sensitive hashing

    Bucket keys to use for quickly finding other similar items via locality sensitive hashing

    Definition Classes
    MinHasher
  9. def buildArray(left: Array[Byte], right: Array[Byte])(fn: (Int, Int) ⇒ Int): Array[Byte]

    Permalink

    Decode two signatures into hash values, combine them somehow, and produce a new array

    Decode two signatures into hash values, combine them somehow, and produce a new array

    Attributes
    protected
    Definition Classes
    MinHasher32MinHasher
  10. def buildArray(fn: ⇒ Int): Array[Byte]

    Permalink

    Initialize a byte array by generating hash values

    Initialize a byte array by generating hash values

    Attributes
    protected
    Definition Classes
    MinHasher32MinHasher
  11. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  12. def combine(l: MinHashSignature, r: MinHashSignature): MinHashSignature

    Permalink
    Definition Classes
    Semigroup → Semigroup
  13. def combineAll(t: TraversableOnce[MinHashSignature]): MinHashSignature

    Permalink
    Definition Classes
    Monoid → Monoid
  14. def combineAllOption(as: TraversableOnce[MinHashSignature]): Option[MinHashSignature]

    Permalink
    Definition Classes
    Monoid → Semigroup
  15. def combineN(a: MinHashSignature, n: Int): MinHashSignature

    Permalink
    Definition Classes
    Monoid → Semigroup
  16. def empty: MinHashSignature

    Permalink
    Definition Classes
    Monoid → Monoid
  17. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  18. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  19. val estimatedThreshold: Double

    Permalink

    Useful for understanding the effects of numBands and numRows

    Useful for understanding the effects of numBands and numRows

    Definition Classes
    MinHasher
  20. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  21. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  22. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  23. def hashSize: Int

    Permalink

    The number of bytes used for each hash in the signature

    The number of bytes used for each hash in the signature

    Definition Classes
    MinHasher32MinHasher
  24. def init(fn: (MurmurHash128) ⇒ (Long, Long)): MinHashSignature

    Permalink

    Create a signature for an arbitrary value

    Create a signature for an arbitrary value

    Definition Classes
    MinHasher
  25. def init(value: String): MinHashSignature

    Permalink

    Create a signature for a single String value

    Create a signature for a single String value

    Definition Classes
    MinHasher
  26. def init(value: Long): MinHashSignature

    Permalink

    Create a signature for a single Long value

    Create a signature for a single Long value

    Definition Classes
    MinHasher
  27. def isEmpty(a: MinHashSignature)(implicit ev: Eq[MinHashSignature]): Boolean

    Permalink
    Definition Classes
    Monoid
  28. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  29. def isNonZero(v: MinHashSignature): Boolean

    Permalink
    Definition Classes
    Monoid
  30. def isZero(a: MinHashSignature)(implicit ev: Eq[MinHashSignature]): Boolean

    Permalink
    Definition Classes
    AdditiveMonoid
  31. def maxHash: Int

    Permalink

    Maximum value the hash can take on (not 2*hashSize because of signed types)

    Maximum value the hash can take on (not 2*hashSize because of signed types)

    Definition Classes
    MinHasher32MinHasher
  32. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  33. def nonZeroOption(v: MinHashSignature): Option[MinHashSignature]

    Permalink
    Definition Classes
    Monoid
  34. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  35. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  36. val numBands: Int

    Permalink
    Definition Classes
    MinHasher
  37. val numBytes: Int

    Permalink

    For explanation of the "bands" and "rows" see Ullman and Rajaraman

    For explanation of the "bands" and "rows" see Ullman and Rajaraman

    Definition Classes
    MinHasher
  38. val numHashes: Int

    Permalink
    Definition Classes
    MinHasher
  39. val numRows: Int

    Permalink
    Definition Classes
    MinHasher
  40. def plus(left: MinHashSignature, right: MinHashSignature): MinHashSignature

    Permalink

    Set union

    Set union

    Definition Classes
    MinHasher → AdditiveSemigroup
  41. def positiveSumN(a: MinHashSignature, n: Int): MinHashSignature

    Permalink
    Attributes
    protected[this]
    Definition Classes
    AdditiveSemigroup
  42. def probabilityOfInclusion(sim: Double): Double

    Permalink

    Useful for understanding the effects of numBands and numRows

    Useful for understanding the effects of numBands and numRows

    Definition Classes
    MinHasher
  43. def repeatedCombineN(a: MinHashSignature, n: Int): MinHashSignature

    Permalink
    Attributes
    protected[this]
    Definition Classes
    Semigroup
  44. def similarity(left: MinHashSignature, right: MinHashSignature): Double

    Permalink

    Esimate Jaccard similarity (size of union / size of intersection)

    Esimate Jaccard similarity (size of union / size of intersection)

    Definition Classes
    MinHasher
  45. def sum(vs: TraversableOnce[MinHashSignature]): MinHashSignature

    Permalink
    Definition Classes
    Monoid → AdditiveMonoid
  46. def sumN(a: MinHashSignature, n: Int): MinHashSignature

    Permalink
    Definition Classes
    AdditiveMonoid → AdditiveSemigroup
  47. def sumOption(iter: TraversableOnce[MinHashSignature]): Option[MinHashSignature]

    Permalink

    Returns an instance of T calculated by summing all instances in iter in one pass.

    Returns an instance of T calculated by summing all instances in iter in one pass. Returns None if iter is empty, else Some[T].

    iter

    instances of T to be combined

    returns

    None if iter is empty, else an option value containing the summed T

    Definition Classes
    Semigroup
    Note

    Override if there is a faster way to compute this sum than iter.reduceLeftOption using plus.

  48. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  49. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  50. def trySum(as: TraversableOnce[MinHashSignature]): Option[MinHashSignature]

    Permalink
    Definition Classes
    AdditiveMonoid → AdditiveSemigroup
  51. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  52. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  53. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  54. val zero: MinHashSignature

    Permalink

    Signature for empty set, needed to be a proper Monoid

    Signature for empty set, needed to be a proper Monoid

    Definition Classes
    MinHasher → AdditiveMonoid

Inherited from MinHasher[Int]

Inherited from Monoid[MinHashSignature]

Inherited from AdditiveMonoid[MinHashSignature]

Inherited from cats.kernel.Monoid[MinHashSignature]

Inherited from Semigroup[MinHashSignature]

Inherited from AdditiveSemigroup[MinHashSignature]

Inherited from cats.kernel.Semigroup[MinHashSignature]

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped