com.twitter.algebird

MinHasher32

class MinHasher32 extends MinHasher[Int]

Linear Supertypes
MinHasher[Int], Monoid[Array[Byte]], Semigroup[Array[Byte]], Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. MinHasher32
  2. MinHasher
  3. Monoid
  4. Semigroup
  5. Serializable
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MinHasher32(t: Double, n: Int)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def approxCount(sig: Array[Byte]): Long

    seems to work, but experimental and not generic yet

  7. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  8. def assertNotZero(v: Array[Byte]): Unit

    Definition Classes
    Monoid
  9. def buckets(sig: Array[Byte]): List[Long]

    Bucket keys to use for quickly finding other similar items via locality sensitive hashing

    Bucket keys to use for quickly finding other similar items via locality sensitive hashing

    Definition Classes
    MinHasher
  10. def buildArray(left: Array[Byte], right: Array[Byte])(fn: (Int, Int) ⇒ Int): Array[Byte]

    Decode two signatures into hash values, combine them somehow, and produce a new array

    Decode two signatures into hash values, combine them somehow, and produce a new array

    Definition Classes
    MinHasher32MinHasher
  11. def buildArray(fn: ⇒ Int): Array[Byte]

    Initialize a byte array by generating hash values

    Initialize a byte array by generating hash values

    Definition Classes
    MinHasher32MinHasher
  12. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws()
  13. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  14. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  15. val estimatedThreshold: Double

    useful for understanding the effects of numBands and numRows

    useful for understanding the effects of numBands and numRows

    Definition Classes
    MinHasher
  16. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws()
  17. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  18. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  19. val hashFunctions: IndexedSeq[MurmurHash128]

    We always use a 128 bit hash function, so the number of hash functions is different (and usually smaller) than the number of hashes in the signature.

    We always use a 128 bit hash function, so the number of hash functions is different (and usually smaller) than the number of hashes in the signature.

    Definition Classes
    MinHasher
  20. def hashSize: Int

    the number of bytes used for each hash in the signature

    the number of bytes used for each hash in the signature

    Definition Classes
    MinHasher32MinHasher
  21. def init(fn: (MurmurHash128) ⇒ (Long, Long)): Array[Byte]

    Create a signature for an arbitrary value

    Create a signature for an arbitrary value

    Definition Classes
    MinHasher
  22. def init(value: String): Array[Byte]

    Create a signature for a single String value

    Create a signature for a single String value

    Definition Classes
    MinHasher
  23. def init(value: Long): Array[Byte]

    Create a signature for a single Long value

    Create a signature for a single Long value

    Definition Classes
    MinHasher
  24. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  25. def isNonZero(v: Array[Byte]): Boolean

    Definition Classes
    MonoidSemigroup
  26. def maxHash: Int

    Maximum value the hash can take on (not 2*hashSize because of signed types)

    Maximum value the hash can take on (not 2*hashSize because of signed types)

    Definition Classes
    MinHasher32MinHasher
  27. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  28. def nonZeroOption(v: Array[Byte]): Option[Array[Byte]]

    Definition Classes
    Monoid
  29. final def notify(): Unit

    Definition Classes
    AnyRef
  30. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  31. val numBands: Int

    For explanation of the "bands" and "rows" see Ullman and Rajaraman

    For explanation of the "bands" and "rows" see Ullman and Rajaraman

    Definition Classes
    MinHasher
  32. val numBytes: Int

    Definition Classes
    MinHasher
  33. val numHashes: Int

    Definition Classes
    MinHasher
  34. val numRows: Int

    Definition Classes
    MinHasher
  35. def pickBands(threshold: Double, hashes: Int): Int

    numerically solve the inverse of estimatedThreshold, given numBands*numRows

    numerically solve the inverse of estimatedThreshold, given numBands*numRows

    Definition Classes
    MinHasher
  36. def plus(left: Array[Byte], right: Array[Byte]): Array[Byte]

    Set union

    Set union

    Definition Classes
    MinHasherSemigroup
  37. def probabilityOfInclusion(sim: Double): Double

    useful for understanding the effects of numBands and numRows

    useful for understanding the effects of numBands and numRows

    Definition Classes
    MinHasher
  38. val seed: Int

    This seed could be anything

    This seed could be anything

    Definition Classes
    MinHasher
  39. def similarity(left: Array[Byte], right: Array[Byte]): Double

    Esimate jaccard similarity (size of union / size of intersection)

    Esimate jaccard similarity (size of union / size of intersection)

    Definition Classes
    MinHasher
  40. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  41. def toString(): String

    Definition Classes
    AnyRef → Any
  42. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws()
  43. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws()
  44. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws()
  45. val zero: Array[Byte]

    Signature for empty set, needed to be a proper Monoid

    Signature for empty set, needed to be a proper Monoid

    Definition Classes
    MinHasherMonoid

Deprecated Value Members

  1. def sum(vs: TraversableOnce[Array[Byte]]): Array[Byte]

    Definition Classes
    Monoid
    Annotations
    @deprecated
    Deprecated

    Just use Monoid.sum

Inherited from MinHasher[Int]

Inherited from Monoid[Array[Byte]]

Inherited from Semigroup[Array[Byte]]

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped