Class/Object

org.apache.spark.sql.execution

GroupedIterator

Related Docs: object GroupedIterator | package execution

Permalink

class GroupedIterator extends Iterator[(InternalRow, Iterator[InternalRow])]

Iterates over a presorted set of rows, chunking it up by the grouping expression. Each call to next will return a pair containing the current group and an iterator that will return all the elements of that group. Iterators for each group are lazily constructed by extracting rows from the input iterator. As such, full groups are never materialized by this class.

Example input:

Input: [a, 1], [b, 2], [b, 3]
Grouping: x#1
InputSchema: x#1, y#2

Result:

First call to next():  ([a], Iterator([a, 1])
Second call to next(): ([b], Iterator([b, 2], [b, 3])

Note, the class does not handle the case of an empty input for simplicity of implementation. Use the factory to construct a new instance.

Linear Supertypes
Iterator[(InternalRow, Iterator[InternalRow])], TraversableOnce[(InternalRow, Iterator[InternalRow])], GenTraversableOnce[(InternalRow, Iterator[InternalRow])], AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. GroupedIterator
  2. Iterator
  3. TraversableOnce
  4. GenTraversableOnce
  5. AnyRef
  6. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. class GroupedIterator[B >: A] extends AbstractIterator[Seq[B]] with Iterator[Seq[B]]

    Permalink
    Definition Classes
    Iterator

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. def ++[B >: (InternalRow, Iterator[InternalRow])](that: ⇒ GenTraversableOnce[B]): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  4. def /:[B](z: B)(op: (B, (InternalRow, Iterator[InternalRow])) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  5. def :\[B](z: B)(op: ((InternalRow, Iterator[InternalRow]), B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  6. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  7. def addString(b: StringBuilder): StringBuilder

    Permalink
    Definition Classes
    TraversableOnce
  8. def addString(b: StringBuilder, sep: String): StringBuilder

    Permalink
    Definition Classes
    TraversableOnce
  9. def addString(b: StringBuilder, start: String, sep: String, end: String): StringBuilder

    Permalink
    Definition Classes
    TraversableOnce
  10. def aggregate[B](z: ⇒ B)(seqop: (B, (InternalRow, Iterator[InternalRow])) ⇒ B, combop: (B, B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  11. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  12. def buffered: BufferedIterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  13. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  14. def collect[B](pf: PartialFunction[(InternalRow, Iterator[InternalRow]), B]): Iterator[B]

    Permalink
    Definition Classes
    Iterator
    Annotations
    @migration
    Migration

    (Changed in version 2.8.0) collect has changed. The previous behavior can be reproduced with toSeq.

  15. def collectFirst[B](pf: PartialFunction[(InternalRow, Iterator[InternalRow]), B]): Option[B]

    Permalink
    Definition Classes
    TraversableOnce
  16. def contains(elem: Any): Boolean

    Permalink
    Definition Classes
    Iterator
  17. def copyToArray[B >: (InternalRow, Iterator[InternalRow])](xs: Array[B], start: Int, len: Int): Unit

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  18. def copyToArray[B >: (InternalRow, Iterator[InternalRow])](xs: Array[B]): Unit

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  19. def copyToArray[B >: (InternalRow, Iterator[InternalRow])](xs: Array[B], start: Int): Unit

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  20. def copyToBuffer[B >: (InternalRow, Iterator[InternalRow])](dest: Buffer[B]): Unit

    Permalink
    Definition Classes
    TraversableOnce
  21. def corresponds[B](that: GenTraversableOnce[B])(p: ((InternalRow, Iterator[InternalRow]), B) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    Iterator
  22. def count(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Int

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  23. var currentGroup: InternalRow

    Permalink

    Holds a copy of an input row that is in the current group.

  24. var currentIterator: Iterator[InternalRow]

    Permalink
  25. var currentRow: InternalRow

    Permalink

    Holds null or the row that will be returned on next call to next() in the inner iterator.

  26. def drop(n: Int): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  27. def dropWhile(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  28. def duplicate: (Iterator[(InternalRow, Iterator[InternalRow])], Iterator[(InternalRow, Iterator[InternalRow])])

    Permalink
    Definition Classes
    Iterator
  29. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  30. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  31. def exists(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  32. def filter(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  33. def filterNot(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  34. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  35. def find(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Option[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  36. def flatMap[B](f: ((InternalRow, Iterator[InternalRow])) ⇒ GenTraversableOnce[B]): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  37. def fold[A1 >: (InternalRow, Iterator[InternalRow])](z: A1)(op: (A1, A1) ⇒ A1): A1

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  38. def foldLeft[B](z: B)(op: (B, (InternalRow, Iterator[InternalRow])) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  39. def foldRight[B](z: B)(op: ((InternalRow, Iterator[InternalRow]), B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  40. def forall(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  41. def foreach[U](f: ((InternalRow, Iterator[InternalRow])) ⇒ U): Unit

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  42. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  43. def grouped[B >: (InternalRow, Iterator[InternalRow])](size: Int): GroupedIterator[B]

    Permalink
    Definition Classes
    Iterator
  44. def hasDefiniteSize: Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  45. def hasNext: Boolean

    Permalink

    Return true if we already have the next iterator or fetching a new iterator is successful.

    Return true if we already have the next iterator or fetching a new iterator is successful.

    Note that, if we get the iterator by next, we should consume it before call hasNext, because we will consume the input data to skip to next group while fetching a new iterator, thus make the previous iterator empty.

    Definition Classes
    GroupedIterator → Iterator
  46. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  47. def indexOf[B >: (InternalRow, Iterator[InternalRow])](elem: B): Int

    Permalink
    Definition Classes
    Iterator
  48. def indexWhere(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Int

    Permalink
    Definition Classes
    Iterator
  49. def isEmpty: Boolean

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  50. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  51. def isTraversableAgain: Boolean

    Permalink
    Definition Classes
    Iterator → GenTraversableOnce
  52. val keyOrdering: Ordering[InternalRow]

    Permalink
  53. val keyProjection: UnsafeProjection

    Permalink

    Creates a row containing only the key for a given input row.

  54. def length: Int

    Permalink
    Definition Classes
    Iterator
  55. def map[B](f: ((InternalRow, Iterator[InternalRow])) ⇒ B): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  56. def max[B >: (InternalRow, Iterator[InternalRow])](implicit cmp: Ordering[B]): (InternalRow, Iterator[InternalRow])

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  57. def maxBy[B](f: ((InternalRow, Iterator[InternalRow])) ⇒ B)(implicit cmp: Ordering[B]): (InternalRow, Iterator[InternalRow])

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  58. def min[B >: (InternalRow, Iterator[InternalRow])](implicit cmp: Ordering[B]): (InternalRow, Iterator[InternalRow])

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  59. def minBy[B](f: ((InternalRow, Iterator[InternalRow])) ⇒ B)(implicit cmp: Ordering[B]): (InternalRow, Iterator[InternalRow])

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  60. def mkString: String

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  61. def mkString(sep: String): String

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  62. def mkString(start: String, sep: String, end: String): String

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  63. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  64. def next(): (InternalRow, Iterator[InternalRow])

    Permalink
    Definition Classes
    GroupedIterator → Iterator
  65. def nonEmpty: Boolean

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  66. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  67. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  68. def padTo[A1 >: (InternalRow, Iterator[InternalRow])](len: Int, elem: A1): Iterator[A1]

    Permalink
    Definition Classes
    Iterator
  69. def partition(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): (Iterator[(InternalRow, Iterator[InternalRow])], Iterator[(InternalRow, Iterator[InternalRow])])

    Permalink
    Definition Classes
    Iterator
  70. def patch[B >: (InternalRow, Iterator[InternalRow])](from: Int, patchElems: Iterator[B], replaced: Int): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  71. def product[B >: (InternalRow, Iterator[InternalRow])](implicit num: Numeric[B]): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  72. def reduce[A1 >: (InternalRow, Iterator[InternalRow])](op: (A1, A1) ⇒ A1): A1

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  73. def reduceLeft[B >: (InternalRow, Iterator[InternalRow])](op: (B, (InternalRow, Iterator[InternalRow])) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce
  74. def reduceLeftOption[B >: (InternalRow, Iterator[InternalRow])](op: (B, (InternalRow, Iterator[InternalRow])) ⇒ B): Option[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  75. def reduceOption[A1 >: (InternalRow, Iterator[InternalRow])](op: (A1, A1) ⇒ A1): Option[A1]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  76. def reduceRight[B >: (InternalRow, Iterator[InternalRow])](op: ((InternalRow, Iterator[InternalRow]), B) ⇒ B): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  77. def reduceRightOption[B >: (InternalRow, Iterator[InternalRow])](op: ((InternalRow, Iterator[InternalRow]), B) ⇒ B): Option[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  78. def reversed: List[(InternalRow, Iterator[InternalRow])]

    Permalink
    Attributes
    protected[this]
    Definition Classes
    TraversableOnce
  79. def sameElements(that: Iterator[_]): Boolean

    Permalink
    Definition Classes
    Iterator
  80. def scanLeft[B](z: B)(op: (B, (InternalRow, Iterator[InternalRow])) ⇒ B): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  81. def scanRight[B](z: B)(op: ((InternalRow, Iterator[InternalRow]), B) ⇒ B): Iterator[B]

    Permalink
    Definition Classes
    Iterator
  82. def seq: Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  83. def size: Int

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  84. def slice(from: Int, until: Int): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  85. def sliding[B >: (InternalRow, Iterator[InternalRow])](size: Int, step: Int): GroupedIterator[B]

    Permalink
    Definition Classes
    Iterator
  86. val sortOrder: Seq[SortOrder]

    Permalink

    Compares two input rows and returns 0 if they are in the same group.

  87. def span(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): (Iterator[(InternalRow, Iterator[InternalRow])], Iterator[(InternalRow, Iterator[InternalRow])])

    Permalink
    Definition Classes
    Iterator
  88. def sum[B >: (InternalRow, Iterator[InternalRow])](implicit num: Numeric[B]): B

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  89. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  90. def take(n: Int): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  91. def takeWhile(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  92. def to[Col[_]](implicit cbf: CanBuildFrom[Nothing, (InternalRow, Iterator[InternalRow]), Col[(InternalRow, Iterator[InternalRow])]]): Col[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  93. def toArray[B >: (InternalRow, Iterator[InternalRow])](implicit arg0: ClassTag[B]): Array[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  94. def toBuffer[B >: (InternalRow, Iterator[InternalRow])]: Buffer[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  95. def toIndexedSeq: IndexedSeq[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  96. def toIterable: Iterable[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  97. def toIterator: Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator → GenTraversableOnce
  98. def toList: List[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  99. def toMap[T, U](implicit ev: <:<[(InternalRow, Iterator[InternalRow]), (T, U)]): Map[T, U]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  100. def toSeq: Seq[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  101. def toSet[B >: (InternalRow, Iterator[InternalRow])]: Set[B]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  102. def toStream: Stream[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator → GenTraversableOnce
  103. def toString(): String

    Permalink
    Definition Classes
    Iterator → AnyRef → Any
  104. def toTraversable: Traversable[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator → TraversableOnce → GenTraversableOnce
  105. def toVector: Vector[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    TraversableOnce → GenTraversableOnce
  106. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  107. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  108. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  109. def withFilter(p: ((InternalRow, Iterator[InternalRow])) ⇒ Boolean): Iterator[(InternalRow, Iterator[InternalRow])]

    Permalink
    Definition Classes
    Iterator
  110. def zip[B](that: Iterator[B]): Iterator[((InternalRow, Iterator[InternalRow]), B)]

    Permalink
    Definition Classes
    Iterator
  111. def zipAll[B, A1 >: (InternalRow, Iterator[InternalRow]), B1 >: B](that: Iterator[B], thisElem: A1, thatElem: B1): Iterator[(A1, B1)]

    Permalink
    Definition Classes
    Iterator
  112. def zipWithIndex: Iterator[((InternalRow, Iterator[InternalRow]), Int)]

    Permalink
    Definition Classes
    Iterator

Inherited from Iterator[(InternalRow, Iterator[InternalRow])]

Inherited from TraversableOnce[(InternalRow, Iterator[InternalRow])]

Inherited from GenTraversableOnce[(InternalRow, Iterator[InternalRow])]

Inherited from AnyRef

Inherited from Any

Ungrouped