Metadata describing Cassandra table partition processed by a single Spark task.
RDD created by repartitionByCassandraReplica with preferred locations mapping to the CassandraReplicas each partition was created for.
Creates CassandraPartitions for given Cassandra table
Stores a CQL WHERE
predicate matching a range of tokens.
Estimates amount of data in the Cassandra table.
Fast token range splitter assuming that data are spread out evenly in the whole range.
Looks up listen address of a cluster node given its RPC address.
Fast token range splitter assuming that data are spread out evenly in the whole range.
The replica partitioner will work on an RDD which is keyed on sets of InetAddresses representing Cassandra Hosts .
Divides a set of token ranges into groups containing not more than maxRowCountPerGroup
rows
and not more than maxGroupSize
token ranges.
Splits a token range into smaller sub-ranges, each with the desired approximate number of rows.
Provides components for partitioning a Cassandra table into smaller parts of appropriate size. Each partition can be processed locally on at least one cluster node.