if true then aggregation is done partially on local data without shuffling to
ensure all values where groupingExpressions
are equal are present.
expressions that are evaluated to determine grouping.
expressions that are computed for each group.
the input data source.
An aggregate that needs to be computed for each row in a group.
expressions that are computed for each group.
the input data source.
the input data source.
Overridden by concrete implementations of SparkPlan.
Returns the result of this query as an RDD[Row] by delegating to doExecute after adding query plan information to created RDDs for visualization.
Returns the result of this query as an RDD[Row] by delegating to doExecute after adding query plan information to created RDDs for visualization. Concrete implementations of SparkPlan should override doExecute instead.
Runs this query returning the result as an array.
Runs this query returning the result as an array.
Runs this query returning the first n
rows as an array.
Runs this query returning the first n
rows as an array.
This is modeled after RDD.take but never runs any job locally on the driver.
expressions that are evaluated to determine grouping.
Overridden make copy also propogates sqlContext to copied plan.
Overridden make copy also propogates sqlContext to copied plan.
Specifies how data is ordered in each partition.
Specifies how data is ordered in each partition.
Specifies how data is partitioned across different nodes in the cluster.
Specifies how data is partitioned across different nodes in the cluster.
if true then aggregation is done partially on local data without shuffling to
ensure all values where groupingExpressions
are equal are present.
Specifies any partition requirements on the input data for this operator.
Specifies sort order for each partition requirements on the input data for this operator.
Specifies sort order for each partition requirements on the input data for this operator.
A handle to the SQL Context that was used to create this plan.
A handle to the SQL Context that was used to create this plan. Since many operators need access to the sqlContext for RDD operations or configuration this field is automatically populated by the query planning infrastructure.
:: DeveloperApi :: Groups input data by
groupingExpressions
and computes theaggregateExpressions
for each group.if true then aggregation is done partially on local data without shuffling to ensure all values where
groupingExpressions
are equal are present.expressions that are evaluated to determine grouping.
expressions that are computed for each group.
the input data source.