Creates an RDD having each of required objects distributed to all required destinations
Creates an RDD having each of required objects distributed to all required destinations
If the data
and/or mapping
are partitioned with the same partitioner, no shuffle is required.
Mapping of keys to destinations. Duplicate destinations from different partitions are coalesced.
Partitioner to use. Operation is more efficient if data is partitioned with the same partitioner
Function to apply to values. It's called exactly once for each value in data
Utility used to group data by key within RDD partitions by a key, assuming that RDD is already partitioned and sorted by key.
Utility used to group data by key within RDD partitions by a key, assuming that RDD is already partitioned and sorted by key. The RDD data are processed sequentialy without shuffling and materializing them in memory.
RDD with key -> values.
Similar to org.apache.spark.rdd.PairRDDFunctions.join, but ensures that all keys are unique in both RDDs.
Similar to org.apache.spark.rdd.PairRDDFunctions.join, but ensures that all keys are unique in both RDDs.
Throws java.lang.IllegalArgumentException if encounters duplicate
Similar to org.apache.spark.rdd.PairRDDFunctions.join, but ensures that all keys are unique in both RDDs.
Similar to org.apache.spark.rdd.PairRDDFunctions.join, but ensures that all keys are unique in both RDDs.
Throws java.lang.IllegalArgumentException if encounters duplicate
Similar to org.apache.spark.rdd.PairRDDFunctions.join, but ensures that all keys are unique in both RDDs.
Similar to org.apache.spark.rdd.PairRDDFunctions.join, but ensures that all keys are unique in both RDDs.
Throws java.lang.IllegalArgumentException if encounters duplicate
Similar to org.apache.spark.rdd.PairRDDFunctions.leftOuterJoin, but ensures that all keys are unique in both RDDs.
Similar to org.apache.spark.rdd.PairRDDFunctions.leftOuterJoin, but ensures that all keys are unique in both RDDs.
Throws java.lang.IllegalArgumentException if encounters duplicate
Similar to org.apache.spark.rdd.PairRDDFunctions.leftOuterJoin, but ensures that all keys are unique in both RDDs.
Similar to org.apache.spark.rdd.PairRDDFunctions.leftOuterJoin, but ensures that all keys are unique in both RDDs.
Throws java.lang.IllegalArgumentException if encounters duplicate
Similar to org.apache.spark.rdd.PairRDDFunctions.leftOuterJoin, but ensures that all keys are unique in both RDDs.
Similar to org.apache.spark.rdd.PairRDDFunctions.leftOuterJoin, but ensures that all keys are unique in both RDDs.
Throws java.lang.IllegalArgumentException if encounters duplicate