aggregate

Type Members

class AggregationBufferEntry extends AnyRef
abstract class AggregationIterator extends Iterator[UnsafeRow] with Logging

The base class of SortBasedAggregationIterator and TungstenAggregationIterator.
sealed trait BufferSetterGetterUtils extends AnyRef

A helper trait used to create specialized setter and getter for types supported by org.apache.spark.sql.execution.UnsafeFixedWidthAggregationMap's buffer.
case class ComplexTypedAggregateExpression(aggregator: expressions.Aggregator[Any, Any, Any], inputDeserializer: Option[Expression], inputClass: Option[Class[_]], inputSchema: Option[StructType], bufferSerializer: Seq[NamedExpression], bufferDeserializer: Expression, outputSerializer: Seq[Expression], dataType: DataType, nullable: Boolean, mutableAggBufferOffset: Int = 0, inputAggBufferOffset: Int = 0) extends TypedImperativeAggregate[Any] with TypedAggregateExpression with NonSQLExpression with Product with Serializable
case class HashAggregateExec(requiredChildDistributionExpressions: Option[Seq[Expression]], groupingExpressions: Seq[NamedExpression], aggregateExpressions: Seq[AggregateExpression], aggregateAttributes: Seq[Attribute], initialInputBufferOffset: Int, resultExpressions: Seq[NamedExpression], child: SparkPlan) extends SparkPlan with UnaryExecNode with CodegenSupport with Product with Serializable

Hash-based aggregate operator that can also fallback to sorting when data exceeds memory size.
abstract class HashMapGenerator extends AnyRef

This is a helper class to generate an append-only row-based hash map that can act as a 'cache' for extremely fast key-value lookups while evaluating aggregates (and fall back to the BytesToBytesMap if a given key isn't found).
class ObjectAggregationIterator extends AggregationIterator with Logging
class ObjectAggregationMap extends AnyRef

An aggregation map that supports using safe SpecificInternalRows aggregation buffers, so that we can support storing arbitrary Java objects as aggregate function states in the aggregation buffers.
case class ObjectHashAggregateExec(requiredChildDistributionExpressions: Option[Seq[Expression]], groupingExpressions: Seq[NamedExpression], aggregateExpressions: Seq[AggregateExpression], aggregateAttributes: Seq[Attribute], initialInputBufferOffset: Int, resultExpressions: Seq[NamedExpression], child: SparkPlan) extends SparkPlan with UnaryExecNode with Product with Serializable

A hash-based aggregate operator that supports TypedImperativeAggregate functions that may use arbitrary JVM objects as aggregation states.
class RowBasedHashMapGenerator extends HashMapGenerator

This is a helper class to generate an append-only row-based hash map that can act as a 'cache' for extremely fast key-value lookups while evaluating aggregates (and fall back to the BytesToBytesMap if a given key isn't found).
case class ScalaUDAF(children: Seq[Expression], udaf: UserDefinedAggregateFunction, mutableAggBufferOffset: Int = 0, inputAggBufferOffset: Int = 0) extends ImperativeAggregate with NonSQLExpression with Logging with ImplicitCastInputTypes with Product with Serializable

The internal wrapper used to hook a UserDefinedAggregateFunction udaf in the internal aggregation code path.
case class SimpleTypedAggregateExpression(aggregator: expressions.Aggregator[Any, Any, Any], inputDeserializer: Option[Expression], inputClass: Option[Class[_]], inputSchema: Option[StructType], bufferSerializer: Seq[NamedExpression], bufferDeserializer: Expression, outputSerializer: Seq[Expression], outputExternalType: DataType, dataType: DataType, nullable: Boolean) extends DeclarativeAggregate with TypedAggregateExpression with NonSQLExpression with Product with Serializable
case class SortAggregateExec(requiredChildDistributionExpressions: Option[Seq[Expression]], groupingExpressions: Seq[NamedExpression], aggregateExpressions: Seq[AggregateExpression], aggregateAttributes: Seq[Attribute], initialInputBufferOffset: Int, resultExpressions: Seq[NamedExpression], child: SparkPlan) extends SparkPlan with UnaryExecNode with Product with Serializable

Sort-based aggregate operator.
class SortBasedAggregationIterator extends AggregationIterator

An iterator used to evaluate AggregateFunction.
class SortBasedAggregator extends AnyRef

A class used to handle sort-based aggregation, used together with ObjectHashAggregateExec.
class TungstenAggregationIterator extends AggregationIterator with Logging

An iterator used to evaluate aggregate functions.
trait TypedAggregateExpression extends AggregateFunction

A helper class to hook Aggregator into the aggregation system.
class TypedAverage[IN] extends expressions.Aggregator[IN, (Double, Long), Double]
class TypedCount[IN] extends expressions.Aggregator[IN, Long, Long]
class TypedSumDouble[IN] extends expressions.Aggregator[IN, Double, Double]
class TypedSumLong[IN] extends expressions.Aggregator[IN, Long, Long]
class VectorizedHashMapGenerator extends HashMapGenerator

This is a helper class to generate an append-only vectorized hash map that can act as a 'cache' for extremely fast key-value lookups while evaluating aggregates (and fall back to the BytesToBytesMap if a given key isn't found).

Value Members

object AggUtils

Utility functions used by the query planner to convert our plan to new aggregation code path.
object HashAggregateExec extends Serializable
object ObjectHashAggregateExec extends Serializable
object TypedAggregateExpression

package aggregate

Type Members

class AggregationBufferEntry extends AnyRef

abstract class AggregationIterator extends Iterator[UnsafeRow] with Logging

sealed trait BufferSetterGetterUtils extends AnyRef

abstract class HashMapGenerator extends AnyRef

class ObjectAggregationIterator extends AggregationIterator with Logging

class ObjectAggregationMap extends AnyRef

class RowBasedHashMapGenerator extends HashMapGenerator

case class ScalaUDAF(children: Seq[Expression], udaf: UserDefinedAggregateFunction, mutableAggBufferOffset: Int = 0, inputAggBufferOffset: Int = 0) extends ImperativeAggregate with NonSQLExpression with Logging with ImplicitCastInputTypes with Product with Serializable

class SortBasedAggregationIterator extends AggregationIterator

class SortBasedAggregator extends AnyRef

class TungstenAggregationIterator extends AggregationIterator with Logging

trait TypedAggregateExpression extends AggregateFunction

class TypedAverage[IN] extends expressions.Aggregator[IN, (Double, Long), Double]

class TypedCount[IN] extends expressions.Aggregator[IN, Long, Long]

class TypedSumDouble[IN] extends expressions.Aggregator[IN, Double, Double]

class TypedSumLong[IN] extends expressions.Aggregator[IN, Long, Long]

class VectorizedHashMapGenerator extends HashMapGenerator

Value Members

object AggUtils

object HashAggregateExec extends Serializable

object ObjectHashAggregateExec extends Serializable

object TypedAggregateExpression

Ungrouped