Performs an inner hash join of two child relations.
Performs an inner hash join of two child relations. When the output RDD of this operator is being constructed, a Spark job is asynchronously started to calculate the values for the broadcasted relation. This data is then placed in a Spark broadcast variable. The streamed relation is not shuffled.
Performs a outer hash join for two child relations.
Performs a outer hash join for two child relations. When the output RDD of this operator is being constructed, a Spark job is asynchronously started to calculate the values for the broadcasted relation. This data is then placed in a Spark broadcast variable. The streamed relation is not shuffled.
Build the right table's join keys into a HashSet, and iteratively go through the left table, to find the if join keys are in the Hash set.
Using BroadcastNestedLoopJoin to calculate left semi join result when there's no join keys for hash join.
Build the right table's join keys into a HashSet, and iteratively go through the left table, to find the if join keys are in the Hash set.
Performs an sort merge join of two child relations.
Performs an sort merge outer join of two child relations.
Physical execution operators for join operations.