Packages

c

org.apache.spark.sql.execution

OptimizeMetadataOnlyQuery

case class OptimizeMetadataOnlyQuery(catalog: SessionCatalog) extends Rule[LogicalPlan] with Product with Serializable

This rule optimizes the execution of queries that can be answered by looking only at partition-level metadata. This applies when all the columns scanned are partition columns, and the query has an aggregate operator that satisfies the following conditions: 1. aggregate expression is partition columns. e.g. SELECT col FROM tbl GROUP BY col. 2. aggregate function on partition columns with DISTINCT. e.g. SELECT col1, count(DISTINCT col2) FROM tbl GROUP BY col1. 3. aggregate function on partition columns which have same result w or w/o DISTINCT keyword. e.g. SELECT col1, Max(col2) FROM tbl GROUP BY col1.

Linear Supertypes
Serializable, Serializable, Product, Equals, Rule[LogicalPlan], Logging, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. OptimizeMetadataOnlyQuery
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. Rule
  7. Logging
  8. AnyRef
  9. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new OptimizeMetadataOnlyQuery(catalog: SessionCatalog)

Value Members

  1. def apply(plan: LogicalPlan): LogicalPlan
    Definition Classes
    OptimizeMetadataOnlyQuery → Rule
  2. val catalog: SessionCatalog
  3. val ruleName: String
    Definition Classes
    Rule
  4. object PartitionedRelation extends PredicateHelper

    A pattern that finds the partitioned table relation node inside the given plan, and returns a pair of the partition attributes and the table relation node.