org.hammerlab.magic.rdd

keyed

package keyed

Visibility
  1. Public
  2. All

Type Members

  1. class CappedGroupByKeyRDD[K, V] extends AnyRef

    Wrap an RDD and expose a cappedGroupByKey method, which behaves like org.apache.spark.rdd.PairRDDFunctions.groupByKey but with a cap on the number of values that will be accumulated for each key.

  2. class KeySamples[V] extends Serializable

  3. class ReduceByKeyRDD[K, V] extends AnyRef

    Adds maxByKey and minByKey helpers to an RDD.

  4. class SampleByKeyRDD[K, V] extends AnyRef

  5. class SlicePartitionsRDD[T] extends PartitionPruningRDD[T]

  6. class SplitByKeyRDD[K, V] extends AnyRef

    Add splitByKey method to any paired RDD: returns a Map from each key (type K) to an RDD[V] with all the values that had that key in the original RDD (in arbitrary order).

Value Members

  1. object CappedGroupByKeyRDD

  2. object KeySamples extends Serializable

  3. object ReduceByKeyRDD

  4. object SampleByKeyRDD

  5. object SplitByKeyRDD

Ungrouped