A relation produced by applying func
to each element of the child
, concatenating the
resulting columns at the end of the input row.
An optimized version of AppendColumns, that can be executed on deserialized object directly.
A logical plan node with a left and right child.
A hint for the optimizer that we should broadcast the child
if used in a join operator.
A relation produced by applying func
to each grouping key and associated values from left and
right children.
A logical node that represents a non-query command to be executed by the system.
Takes the input row from child and turns it into object using the given deserializer expression.
Returns a new logical plan that dedups input rows.
Apply a number of projections to every input row, hence we will get multiple output rows for an input row.
Applies a Generator to a stream of input rows, combining the output of each into a new stream of rows.
A GROUP BY clause with GROUPING SETS can generate a result set equivalent to generated by a UNION ALL of multiple simple GROUP BY clauses.
A logical plan node with no children.
A relation produced by applying func
to each element of the child
.
Applies func to each unique group in child
, based on the evaluation of groupingAttributes
.
A relation produced by applying func
to each partition of the child
.
A relation produced by applying a serialized R function func
to each partition of the child
.
A trait for logical operators that consumes domain objects as input.
A trait for logical operators that produces domain objects as output.
Performs a physical redistribution of the data.
Returns a new RDD that has exactly numPartitions
partitions.
This method repartitions data using Expressions into numPartitions
, and receives
information about the number of partitions during execution.
When planning take() or collect() operations, this special node that is inserted at the top of the logical plan before invoking the query planner.
Sample the dataset.
Input and output properties when passing data to a script.
Transforms the input by forking and running the specified script.
Takes the input object from child and turns it into unsafe row using the given serializer expression.
A logical plan node with single child.
A container for holding named common table expressions (CTEs) and a query plan.
Factory for constructing new AppendColumn
nodes.
Factory for constructing new CoGroup
nodes.
Factory for constructing new FlatMapGroupsInR
nodes.
Factory for constructing new MapGroups
nodes.
A relation with one row.
Factory for constructing new Range
nodes.
Factory for constructing new Union
nodes.