Given an outer-join of two RDDs, expose statistics about how many identical elements at identical positions they have.
Summable data-type for counting the number of elements that are the same vs.
Given an outer join of two RDDs with the presence or absence of values for a key replaced with a Boolean for each RDD, expose statistics about how many elements exist in either or both RDDs.
Summable data type recording the number of elements that are present in either or both of two RDDs.
Wrap an RDD and expose compare
, compareElements
, and isEqual
methods for testing its equality to other
RDDs.
Wrap an RDD and expose methods for comparing its elements to those of other RDD's, disregarding the order in which they appear in each.
Summable data-type for counting the number of elements that are the same vs. different in two RDDs, or that exist in just one or the other.