org.hammerlab.magic.rdd

keyed

package keyed

Visibility

Public
All

Type Members

class CappedGroupByKeyRDD[K, V] extends AnyRef

Wrap an RDD and expose a cappedGroupByKey method, which behaves like org.apache.spark.rdd.PairRDDFunctions.groupByKey but with a cap on the number of values that will be accumulated for each key.
class KeySamples[V] extends Serializable
class ReduceByKeyRDD[K, V] extends AnyRef

Adds maxByKey and minByKey helpers to an RDD.
class SampleByKeyRDD[K, V] extends AnyRef
class SlicePartitionsRDD[T] extends PartitionPruningRDD[T]
class SplitByKeyRDD[K, V] extends AnyRef

Add splitByKey method to any paired RDD: returns a Map from each key (type K) to an RDD[V] with all the values that had that key in the original RDD (in arbitrary order).

Value Members

object CappedGroupByKeyRDD
object KeySamples extends Serializable
object ReduceByKeyRDD
object SampleByKeyRDD
object SplitByKeyRDD

Ungrouped