Sampler for Avro files.
Sampler for BigQuery tables.
Sampler for BigQuery tables.
Only head mode is supported.
Trait for a data sampler.
Sample wrapper function for Avro GenericRecord
Sample wrapper function for Avro GenericRecord
Record Type
The input SCollection to be sampled
The sample rate
Fields to construct hash over for determinism
Seed used to salt the deterministic hash
Desired output sample distribution
Fields to construct distribution over (strata = set of unique fields)
Approximate or Exact precision
Maximum allowed size per key (can be tweaked for very large data sets)
Determines how bytes are encoded prior to hashing.
SCollection containing Sample population
Sample wrapper function for Avro GenericRecord
Sample wrapper function for Avro GenericRecord
Record Type
The input SCollection to be sampled
The sample rate
Fields to construct hash over for determinism
Seed used to salt the deterministic hash
Desired output sample distribution
Fields to construct distribution over (strata = set of unique fields)
Approximate or Exact precision
Maximum allowed size per key (can be tweaked for very large data sets)
Determines how bytes are encoded prior to hashing.
SCollection containing Sample population
Sample wrapper function for Avro GenericRecord
Sample wrapper function for Avro GenericRecord
The input SCollection to be sampled
The sample rate
Fields to construct hash over for determinism
Seed used to salt the deterministic hash
Desired output sample distribution
Fields to construct distribution over (strata = set of unique fields)
Approximate or Exact precision
Maximum allowed size per key (can be tweaked for very large data sets)
Determines how bytes are encoded prior to hashing.
SCollection containing Sample population