Class

org.apache.spark.sql.execution.aggregate

TungstenAggregationIterator

Related Doc: package aggregate

Permalink

class TungstenAggregationIterator extends Iterator[UnsafeRow] with Logging

An iterator used to evaluate aggregate functions. It operates on UnsafeRows.

This iterator first uses hash-based aggregation to process input rows. It uses a hash map to store groups and their corresponding aggregation buffers. If we this map cannot allocate memory from org.apache.spark.shuffle.ShuffleMemoryManager, it switches to sort-based aggregation. The process of the switch has the following step:

The code of this class is organized as follows:

Linear Supertypes
Logging, Iterator[UnsafeRow], TraversableOnce[UnsafeRow], GenTraversableOnce[UnsafeRow], AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TungstenAggregationIterator
  2. Logging
  3. Iterator
  4. TraversableOnce
  5. GenTraversableOnce
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TungstenAggregationIterator(groupingExpressions: Seq[NamedExpression], nonCompleteAggregateExpressions: Seq[AggregateExpression2], completeAggregateExpressions: Seq[AggregateExpression2], initialInputBufferOffset: Int, resultExpressions: Seq[NamedExpression], newMutableProjection: (Seq[Expression], Seq[Attribute]) ⇒ () ⇒ MutableProjection, originalInputAttributes: Seq[Attribute], testFallbackStartsAt: Option[Int], numInputRows: LongSQLMetric, numOutputRows: LongSQLMetric)

    Permalink

    groupingExpressions

    expressions for grouping keys

    nonCompleteAggregateExpressions

    AggregateExpression2 containing AggregateFunction2s with mode Partial, PartialMerge, or Final.

    completeAggregateExpressions

    AggregateExpression2 containing AggregateFunction2s with mode Complete.

    initialInputBufferOffset

    If this iterator is used to handle functions with mode PartialMerge or Final. The input rows have the format of grouping keys + aggregation buffer. This offset indicates the starting position of aggregation buffer in a input row.

    resultExpressions

    expressions for generating output rows.

    newMutableProjection

    the function used to create mutable projections.

    originalInputAttributes

    attributes of representing input rows from inputIter.

Type Members

  1. class GroupedIterator[B >: A] extends AbstractIterator[Seq[B]] with Iterator[Seq[B]]

    Permalink
    Definition Classes
    Iterator

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. def ++[B >: UnsafeRow](that: ⇒ GenTraversableOnce[B]): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  4. def /:[B](z: B)(op: (B, UnsafeRow) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  5. def :\[B](z: B)(op: (UnsafeRow, B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  6. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  7. def addString(b: StringBuilder): StringBuilder

    Permalink
    Definition Classes
    TraversableOnce
  8. def addString(b: StringBuilder, sep: String): StringBuilder

    Permalink
    Definition Classes
    TraversableOnce
  9. def addString(b: StringBuilder, start: String, sep: String, end: String): StringBuilder

    Permalink
    Definition Classes
    TraversableOnce
  10. def aggregate[B](z: ⇒ B)(seqop: (B, UnsafeRow) ⇒ B, combop: (B, B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  11. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  12. def buffered: BufferedIterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  13. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  14. def collect[B](pf: PartialFunction[UnsafeRow, B]): Iterator[B]

    Permalink
    Definition Classes
    Iterator
    Annotations
    @migration
    Migration

    (Changed in version 2.8.0) collect has changed. The previous behavior can be reproduced with toSeq.

  15. def collectFirst[B](pf: PartialFunction[UnsafeRow, B]): Option[B]

    Permalink
    Definition Classes
    TraversableOnce
  16. def contains(elem: Any): Boolean

    Permalink
    Definition Classes
    Iterator
  17. def copyToArray[B >: UnsafeRow](xs: Array[B], start: Int, len: Int): Unit

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  18. def copyToArray[B >: UnsafeRow](xs: Array[B]): Unit

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  19. def copyToArray[B >: UnsafeRow](xs: Array[B], start: Int): Unit

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  20. def copyToBuffer[B >: UnsafeRow](dest: Buffer[B]): Unit

    Permalink
    Definition Classes
    TraversableOnce
  21. def corresponds[B](that: GenTraversableOnce[B])(p: (UnsafeRow, B) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    Iterator
  22. def count(p: (UnsafeRow) ⇒ Boolean): Int

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  23. def drop(n: Int): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  24. def dropWhile(p: (UnsafeRow) ⇒ Boolean): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  25. def duplicate: (Iterator[UnsafeRow], Iterator[UnsafeRow])

    Permalink
    Definition Classes
    Iterator
  26. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  27. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  28. def exists(p: (UnsafeRow) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  29. def filter(p: (UnsafeRow) ⇒ Boolean): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  30. def filterNot(p: (UnsafeRow) ⇒ Boolean): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  31. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  32. def find(p: (UnsafeRow) ⇒ Boolean): Option[UnsafeRow]

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  33. def flatMap[B](f: (UnsafeRow) ⇒ GenTraversableOnce[B]): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  34. def fold[A1 >: UnsafeRow](z: A1)(op: (A1, A1) ⇒ A1): A1

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  35. def foldLeft[B](z: B)(op: (B, UnsafeRow) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  36. def foldRight[B](z: B)(op: (UnsafeRow, B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  37. def forall(p: (UnsafeRow) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  38. def foreach[U](f: (UnsafeRow) ⇒ U): Unit

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  39. def free(): Unit

    Permalink

    Free memory used in the underlying map.

  40. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  41. def grouped[B >: UnsafeRow](size: Int): GroupedIterator[B]

    Permalink
    Definition Classes
    Iterator
  42. def hasDefiniteSize: Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  43. final def hasNext: Boolean

    Permalink
    Definition Classes
    TungstenAggregationIterator → Iterator
  44. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  45. def indexOf[B >: UnsafeRow](elem: B): Int

    Permalink
    Definition Classes
    Iterator
  46. def indexWhere(p: (UnsafeRow) ⇒ Boolean): Int

    Permalink
    Definition Classes
    Iterator
  47. def isEmpty: Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  48. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  49. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  50. def isTraversableAgain: Boolean

    Permalink
    Definition Classes
    Iterator → GenTraversableOnce
  51. def length: Int

    Permalink
    Definition Classes
    Iterator
  52. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  53. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  54. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  55. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  56. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  57. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  58. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  59. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  60. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  61. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  62. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  63. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  64. def map[B](f: (UnsafeRow) ⇒ B): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  65. def max[B >: UnsafeRow](implicit cmp: Ordering[B]): UnsafeRow

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  66. def maxBy[B](f: (UnsafeRow) ⇒ B)(implicit cmp: Ordering[B]): UnsafeRow

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  67. def min[B >: UnsafeRow](implicit cmp: Ordering[B]): UnsafeRow

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  68. def minBy[B](f: (UnsafeRow) ⇒ B)(implicit cmp: Ordering[B]): UnsafeRow

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  69. def mkString: String

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  70. def mkString(sep: String): String

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  71. def mkString(start: String, sep: String, end: String): String

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  72. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  73. final def next(): UnsafeRow

    Permalink
    Definition Classes
    TungstenAggregationIterator → Iterator
  74. def nonEmpty: Boolean

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  75. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  76. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  77. def outputForEmptyGroupingKeyWithoutInput(): UnsafeRow

    Permalink

    Generate a output row when there is no input and there is no grouping expression.

  78. def padTo[A1 >: UnsafeRow](len: Int, elem: A1): Iterator[A1]

    Permalink
    Definition Classes
    Iterator
  79. def partition(p: (UnsafeRow) ⇒ Boolean): (Iterator[UnsafeRow], Iterator[UnsafeRow])

    Permalink
    Definition Classes
    Iterator
  80. def patch[B >: UnsafeRow](from: Int, patchElems: Iterator[B], replaced: Int): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  81. def product[B >: UnsafeRow](implicit num: Numeric[B]): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  82. def reduce[A1 >: UnsafeRow](op: (A1, A1) ⇒ A1): A1

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  83. def reduceLeft[B >: UnsafeRow](op: (B, UnsafeRow) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce
  84. def reduceLeftOption[B >: UnsafeRow](op: (B, UnsafeRow) ⇒ B): Option[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  85. def reduceOption[A1 >: UnsafeRow](op: (A1, A1) ⇒ A1): Option[A1]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  86. def reduceRight[B >: UnsafeRow](op: (UnsafeRow, B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  87. def reduceRightOption[B >: UnsafeRow](op: (UnsafeRow, B) ⇒ B): Option[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  88. def reversed: List[UnsafeRow]

    Permalink
    Attributes
    protected[this]
    Definition Classes
    TraversableOnce
  89. def sameElements(that: Iterator[_]): Boolean

    Permalink
    Definition Classes
    Iterator
  90. def scanLeft[B](z: B)(op: (B, UnsafeRow) ⇒ B): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  91. def scanRight[B](z: B)(op: (UnsafeRow, B) ⇒ B): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  92. def seq: Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  93. def size: Int

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  94. def slice(from: Int, until: Int): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  95. def sliding[B >: UnsafeRow](size: Int, step: Int): GroupedIterator[B]

    Permalink
    Definition Classes
    Iterator
  96. def span(p: (UnsafeRow) ⇒ Boolean): (Iterator[UnsafeRow], Iterator[UnsafeRow])

    Permalink
    Definition Classes
    Iterator
  97. def start(parentIter: Iterator[InternalRow]): Unit

    Permalink

    Start processing input rows.

    Start processing input rows. Only after this method is called will this iterator be non-empty.

  98. def sum[B >: UnsafeRow](implicit num: Numeric[B]): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  99. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  100. def take(n: Int): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  101. def takeWhile(p: (UnsafeRow) ⇒ Boolean): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  102. def to[Col[_]](implicit cbf: CanBuildFrom[Nothing, UnsafeRow, Col[UnsafeRow]]): Col[UnsafeRow]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  103. def toArray[B >: UnsafeRow](implicit arg0: ClassTag[B]): Array[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  104. def toBuffer[B >: UnsafeRow]: Buffer[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  105. def toIndexedSeq: IndexedSeq[UnsafeRow]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  106. def toIterable: Iterable[UnsafeRow]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  107. def toIterator: Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator → GenTraversableOnce
  108. def toList: List[UnsafeRow]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  109. def toMap[T, U](implicit ev: <:<[UnsafeRow, (T, U)]): Map[T, U]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  110. def toSeq: Seq[UnsafeRow]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  111. def toSet[B >: UnsafeRow]: Set[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  112. def toStream: Stream[UnsafeRow]

    Permalink
    Definition Classes
    Iterator → GenTraversableOnce
  113. def toString(): String

    Permalink
    Definition Classes
    Iterator → AnyRef → Any
  114. def toTraversable: Traversable[UnsafeRow]

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  115. def toVector: Vector[UnsafeRow]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  116. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  117. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  118. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  119. def withFilter(p: (UnsafeRow) ⇒ Boolean): Iterator[UnsafeRow]

    Permalink
    Definition Classes
    Iterator
  120. def zip[B](that: Iterator[B]): Iterator[(UnsafeRow, B)]

    Permalink
    Definition Classes
    Iterator
  121. def zipAll[B, A1 >: UnsafeRow, B1 >: B](that: Iterator[B], thisElem: A1, thatElem: B1): Iterator[(A1, B1)]

    Permalink
    Definition Classes
    Iterator
  122. def zipWithIndex: Iterator[(UnsafeRow, Int)]

    Permalink
    Definition Classes
    Iterator

Inherited from Logging

Inherited from Iterator[UnsafeRow]

Inherited from TraversableOnce[UnsafeRow]

Inherited from GenTraversableOnce[UnsafeRow]

Inherited from AnyRef

Inherited from Any

Ungrouped