logical

Type Members

case class Aggregate(groupingExpressions: Seq[Expression], aggregateExpressions: Seq[NamedExpression], child: LogicalPlan) extends UnaryNode with Product with Serializable
abstract class BinaryNode extends LogicalPlan with trees.BinaryNode[LogicalPlan]

A logical plan node with a left and right child.
trait Command extends AnyRef

A logical node that represents a non-query command to be executed by the system.
A logical node that represents a non-query command to be executed by the system. For example, commands can be used by parsers to represent DDL operations. Commands, unlike queries, are eagerly executed.
case class Cube(groupByExprs: Seq[Expression], child: LogicalPlan, aggregations: Seq[NamedExpression], gid: AttributeReference = VirtualColumn.newGroupingId) extends UnaryNode with GroupingAnalytics with Product with Serializable

Cube is a syntactic sugar for GROUPING SETS, and will be transformed to GroupingSets, and eventually will be transformed to Aggregate(.., Expand) in Analyzer
Cube is a syntactic sugar for GROUPING SETS, and will be transformed to GroupingSets, and eventually will be transformed to Aggregate(.., Expand) in Analyzer
groupByExprs
The Group By expressions candidates.
child
Child operator
aggregations
The Aggregation expressions, those non selected group by expressions will be considered as constant null if it appears in the expressions
gid
The attribute represents the virtual column GROUPINGID, and it's also the bitmask indicates the selected GroupBy Expressions for each aggregating output row.
case class Distinct(child: LogicalPlan) extends UnaryNode with Product with Serializable
case class Except(left: LogicalPlan, right: LogicalPlan) extends BinaryNode with Product with Serializable
case class Expand(projections: Seq[GroupExpression], output: Seq[Attribute], child: LogicalPlan) extends UnaryNode with Product with Serializable

Apply the all of the GroupExpressions to every input row, hence we will get multiple output rows for a input row.
Apply the all of the GroupExpressions to every input row, hence we will get multiple output rows for a input row.
projections
The group of expressions, all of the group expressions should output the same schema specified by the parameter output
output
The output Schema
child
Child operator
case class Filter(condition: Expression, child: LogicalPlan) extends UnaryNode with Product with Serializable
case class Generate(generator: Generator, join: Boolean, outer: Boolean, qualifier: Option[String], generatorOutput: Seq[Attribute], child: LogicalPlan) extends UnaryNode with Product with Serializable

Applies a Generator to a stream of input rows, combining the output of each into a new stream of rows.
Applies a Generator to a stream of input rows, combining the output of each into a new stream of rows. This operation is similar to a flatMap in functional programming with one important additional feature, which allows the input rows to be joined with their output.
generator
the generator expression
join
when true, each output row is implicitly joined with the input tuple that produced it.
outer
when true, each input row will be output at least once, even if the output of the given generator is empty. outer has no effect when join is false.
qualifier
Qualifier for the attributes of generator(UDTF)
generatorOutput
The output schema of the Generator.
child
Children logical plan node
trait GroupingAnalytics extends UnaryNode
case class GroupingSets(bitmasks: Seq[Int], groupByExprs: Seq[Expression], child: LogicalPlan, aggregations: Seq[NamedExpression], gid: AttributeReference = VirtualColumn.newGroupingId) extends UnaryNode with GroupingAnalytics with Product with Serializable

A GROUP BY clause with GROUPING SETS can generate a result set equivalent to generated by a UNION ALL of multiple simple GROUP BY clauses.
A GROUP BY clause with GROUPING SETS can generate a result set equivalent to generated by a UNION ALL of multiple simple GROUP BY clauses.
We will transform GROUPING SETS into logical plan Aggregate(.., Expand) in Analyzer
bitmasks
A list of bitmasks, each of the bitmask indicates the selected GroupBy expressions
groupByExprs
The Group By expressions candidates, take effective only if the associated bit in the bitmask set to 1.
child
Child operator
aggregations
The Aggregation expressions, those non selected group by expressions will be considered as constant null if it appears in the expressions
gid
The attribute represents the virtual column GROUPINGID, and it's also the bitmask indicates the selected GroupBy Expressions for each aggregating output row. The associated output will be one of the value in bitmasks
case class InsertIntoTable(table: LogicalPlan, partition: Map[String, Option[String]], child: LogicalPlan, overwrite: Boolean, ifNotExists: Boolean) extends LogicalPlan with Product with Serializable
case class Intersect(left: LogicalPlan, right: LogicalPlan) extends BinaryNode with Product with Serializable
case class Join(left: LogicalPlan, right: LogicalPlan, joinType: JoinType, condition: Option[Expression]) extends BinaryNode with Product with Serializable
abstract class LeafNode extends LogicalPlan with trees.LeafNode[LogicalPlan]

A logical plan node with no children.
case class Limit(limitExpr: Expression, child: LogicalPlan) extends UnaryNode with Product with Serializable
case class LocalRelation(output: Seq[Attribute], data: Seq[Row] = Nil) extends LeafNode with MultiInstanceRelation with Product with Serializable
abstract class LogicalPlan extends QueryPlan[LogicalPlan] with Logging
case class Project(projectList: Seq[NamedExpression], child: LogicalPlan) extends UnaryNode with Product with Serializable
abstract class RedistributeData extends UnaryNode

Performs a physical redistribution of the data.
Performs a physical redistribution of the data. Used when the consumer of the query result have expectations about the distribution and ordering of partitioned input data.
case class Repartition(numPartitions: Int, shuffle: Boolean, child: LogicalPlan) extends UnaryNode with Product with Serializable

Return a new RDD that has exactly numPartitions partitions.
Return a new RDD that has exactly numPartitions partitions. Differs from RepartitionByExpression as this method is called directly by DataFrame's, because the user asked for coalesce or repartition. RepartitionByExpression is used when the consumer of the output requires some specific ordering or distribution of the data.
case class RepartitionByExpression(partitionExpressions: Seq[Expression], child: LogicalPlan) extends RedistributeData with Product with Serializable

This method repartitions data using Expressions, and receives information about the number of partitions during execution.
This method repartitions data using Expressions, and receives information about the number of partitions during execution. Used when a specific ordering or distribution is expected by the consumer of the query result. Use Repartition for RDD-like coalesce and repartition.
case class Rollup(groupByExprs: Seq[Expression], child: LogicalPlan, aggregations: Seq[NamedExpression], gid: AttributeReference = VirtualColumn.newGroupingId) extends UnaryNode with GroupingAnalytics with Product with Serializable

Rollup is a syntactic sugar for GROUPING SETS, and will be transformed to GroupingSets, and eventually will be transformed to Aggregate(.., Expand) in Analyzer
Rollup is a syntactic sugar for GROUPING SETS, and will be transformed to GroupingSets, and eventually will be transformed to Aggregate(.., Expand) in Analyzer
groupByExprs
The Group By expressions candidates, take effective only if the associated bit in the bitmask set to 1.
child
Child operator
aggregations
The Aggregation expressions, those non selected group by expressions will be considered as constant null if it appears in the expressions
gid
The attribute represents the virtual column GROUPINGID, and it's also the bitmask indicates the selected GroupBy Expressions for each aggregating output row.
case class Sample(lowerBound: Double, upperBound: Double, withReplacement: Boolean, seed: Long, child: LogicalPlan) extends UnaryNode with Product with Serializable

Sample the dataset.
Sample the dataset.
lowerBound
Lower-bound of the sampling probability (usually 0.0)
upperBound
Upper-bound of the sampling probability. The expected fraction sampled will be ub - lb.
withReplacement
Whether to sample with replacement.
seed
the random seed
child
the LogicalPlan
trait ScriptInputOutputSchema extends AnyRef

A placeholder for implementation specific input and output properties when passing data to a script.
A placeholder for implementation specific input and output properties when passing data to a script. For example, in Hive this would specify which SerDes to use.
case class ScriptTransformation(input: Seq[Expression], script: String, output: Seq[Attribute], child: LogicalPlan, ioschema: ScriptInputOutputSchema) extends UnaryNode with Product with Serializable

Transforms the input by forking and running the specified script.
Transforms the input by forking and running the specified script.
input
the set of expression that should be passed to the script.
script
the command that should be executed.
output
the attributes that are produced by the script.
ioschema
the input and output schema applied in the execution of the script.
case class Sort(order: Seq[SortOrder], global: Boolean, child: LogicalPlan) extends UnaryNode with Product with Serializable

order
The ordering expressions
global
True means global sorting apply for entire data set, False means sorting only apply within the partition.
child
Child logical plan
case class SortPartitions(sortExpressions: Seq[SortOrder], child: LogicalPlan) extends RedistributeData with Product with Serializable
case class Subquery(alias: String, child: LogicalPlan) extends UnaryNode with Product with Serializable
abstract class UnaryNode extends LogicalPlan with trees.UnaryNode[LogicalPlan]

A logical plan node with single child.
case class Union(left: LogicalPlan, right: LogicalPlan) extends BinaryNode with Product with Serializable
case class Window(projectList: Seq[Attribute], windowExpressions: Seq[NamedExpression], windowSpec: WindowSpecDefinition, child: LogicalPlan) extends UnaryNode with Product with Serializable
case class With(child: LogicalPlan, cteRelations: Map[String, Subquery]) extends UnaryNode with Product with Serializable

A container for holding named common table expressions (CTEs) and a query plan.
A container for holding named common table expressions (CTEs) and a query plan. This operator will be removed during analysis and the relations will be substituted into child.
child
The final query of this CTE.
cteRelations
Queries that this CTE defined, key is the alias of the CTE definition, value is the CTE definition.
case class WithWindowDefinition(windowDefinitions: Map[String, WindowSpecDefinition], child: LogicalPlan) extends UnaryNode with Product with Serializable
case class WriteToFile(path: String, child: LogicalPlan) extends UnaryNode with Product with Serializable

Value Members

object LocalRelation extends Serializable
object OneRowRelation extends LeafNode with Product with Serializable

A relation with one row.
A relation with one row. This is used in "SELECT ..." without a from clause.

package logical

Type Members

case class Aggregate(groupingExpressions: Seq[Expression], aggregateExpressions: Seq[NamedExpression], child: LogicalPlan) extends UnaryNode with Product with Serializable

abstract class BinaryNode extends LogicalPlan with trees.BinaryNode[LogicalPlan]

trait Command extends AnyRef

case class Cube(groupByExprs: Seq[Expression], child: LogicalPlan, aggregations: Seq[NamedExpression], gid: AttributeReference = VirtualColumn.newGroupingId) extends UnaryNode with GroupingAnalytics with Product with Serializable

case class Distinct(child: LogicalPlan) extends UnaryNode with Product with Serializable

case class Except(left: LogicalPlan, right: LogicalPlan) extends BinaryNode with Product with Serializable

case class Expand(projections: Seq[GroupExpression], output: Seq[Attribute], child: LogicalPlan) extends UnaryNode with Product with Serializable

case class Filter(condition: Expression, child: LogicalPlan) extends UnaryNode with Product with Serializable

case class Generate(generator: Generator, join: Boolean, outer: Boolean, qualifier: Option[String], generatorOutput: Seq[Attribute], child: LogicalPlan) extends UnaryNode with Product with Serializable

trait GroupingAnalytics extends UnaryNode

case class GroupingSets(bitmasks: Seq[Int], groupByExprs: Seq[Expression], child: LogicalPlan, aggregations: Seq[NamedExpression], gid: AttributeReference = VirtualColumn.newGroupingId) extends UnaryNode with GroupingAnalytics with Product with Serializable

case class InsertIntoTable(table: LogicalPlan, partition: Map[String, Option[String]], child: LogicalPlan, overwrite: Boolean, ifNotExists: Boolean) extends LogicalPlan with Product with Serializable

case class Intersect(left: LogicalPlan, right: LogicalPlan) extends BinaryNode with Product with Serializable

case class Join(left: LogicalPlan, right: LogicalPlan, joinType: JoinType, condition: Option[Expression]) extends BinaryNode with Product with Serializable

abstract class LeafNode extends LogicalPlan with trees.LeafNode[LogicalPlan]

case class Limit(limitExpr: Expression, child: LogicalPlan) extends UnaryNode with Product with Serializable

case class LocalRelation(output: Seq[Attribute], data: Seq[Row] = Nil) extends LeafNode with MultiInstanceRelation with Product with Serializable

abstract class LogicalPlan extends QueryPlan[LogicalPlan] with Logging

case class Project(projectList: Seq[NamedExpression], child: LogicalPlan) extends UnaryNode with Product with Serializable

abstract class RedistributeData extends UnaryNode

case class Repartition(numPartitions: Int, shuffle: Boolean, child: LogicalPlan) extends UnaryNode with Product with Serializable

case class RepartitionByExpression(partitionExpressions: Seq[Expression], child: LogicalPlan) extends RedistributeData with Product with Serializable

case class Rollup(groupByExprs: Seq[Expression], child: LogicalPlan, aggregations: Seq[NamedExpression], gid: AttributeReference = VirtualColumn.newGroupingId) extends UnaryNode with GroupingAnalytics with Product with Serializable

case class Sample(lowerBound: Double, upperBound: Double, withReplacement: Boolean, seed: Long, child: LogicalPlan) extends UnaryNode with Product with Serializable

trait ScriptInputOutputSchema extends AnyRef

case class ScriptTransformation(input: Seq[Expression], script: String, output: Seq[Attribute], child: LogicalPlan, ioschema: ScriptInputOutputSchema) extends UnaryNode with Product with Serializable

case class Sort(order: Seq[SortOrder], global: Boolean, child: LogicalPlan) extends UnaryNode with Product with Serializable

case class SortPartitions(sortExpressions: Seq[SortOrder], child: LogicalPlan) extends RedistributeData with Product with Serializable

case class Subquery(alias: String, child: LogicalPlan) extends UnaryNode with Product with Serializable

abstract class UnaryNode extends LogicalPlan with trees.UnaryNode[LogicalPlan]

case class Union(left: LogicalPlan, right: LogicalPlan) extends BinaryNode with Product with Serializable

case class Window(projectList: Seq[Attribute], windowExpressions: Seq[NamedExpression], windowSpec: WindowSpecDefinition, child: LogicalPlan) extends UnaryNode with Product with Serializable

case class With(child: LogicalPlan, cteRelations: Map[String, Subquery]) extends UnaryNode with Product with Serializable

case class WithWindowDefinition(windowDefinitions: Map[String, WindowSpecDefinition], child: LogicalPlan) extends UnaryNode with Product with Serializable

case class WriteToFile(path: String, child: LogicalPlan) extends UnaryNode with Product with Serializable

Value Members

object LocalRelation extends Serializable

object OneRowRelation extends LeafNode with Product with Serializable

Ungrouped