Implements the stable bloom filter from the paper by F.
A builder interface for creating StableBloomFilter instances.
A builder interface for creating StableBloomFilter instances.
the type of item inserted into the filter
a guava funnel
number of cells in the filter
number of bits per cell in the filter
desired maximum false positive rate of the filter
Implements the stable bloom filter from the paper by F. Deng and D. Rafiei. Approximately detecting duplicates for streaming data using stable bloom filters. In SIGMOD, pages 25–36, 2006.
We use heap storage (an array of Longs). This implies
M=m*d
can be set as high as about 125 giga-bits.the type of funnel used
a Guava funnel for taking input
number of cells (see the paper,
m
is aLong
butm/floor(63/d)
must fit in a 32-bitInt
)bits per cell (see the paper, should lie in [1,63] but often set to 1, 2 or 3)
expected false positive rate (should lie in (0,1))