KeyedList

Abstract Value Members

abstract def filterKeys(fn: (K) ⇒ Boolean): KeyedList[K, T]

filter keys on a predicate.
filter keys on a predicate. More efficient than filter if you are only looking at keys

Definition Classes
KeyedListLike
abstract def mapGroup[V](smfn: (K, Iterator[T]) ⇒ Iterator[V]): KeyedList[K, V]

Operate on an Iterator[T] of all the values for each key at one time.
Operate on an Iterator[T] of all the values for each key at one time. Avoid accumulating the whole list in memory if you can. Prefer sum, which is partially executed map-side by default.

Definition Classes
KeyedListLike
abstract def toTypedPipe: TypedPipe[(K, T)]

End of the operations on values.
End of the operations on values. From this point on the keyed structure is lost and another shuffle is generally required to reconstruct it

Definition Classes
KeyedListLike

Concrete Value Members

final def !=(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def !=(arg0: Any): Boolean

Definition Classes
Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def ==(arg0: Any): Boolean

Definition Classes
Any
def aggregate[B, C](agg: Aggregator[T, B, C]): KeyedList[K, C]

Use Algebird Aggregator to do the reduction
Use Algebird Aggregator to do the reduction

Definition Classes
KeyedListLike
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def count(fn: (T) ⇒ Boolean): KeyedList[K, Long]

Definition Classes
KeyedListLike
def drop(n: Int): KeyedList[K, T]

Selects all elements except first n ones.
Selects all elements except first n ones.

Definition Classes
KeyedListLike
def dropWhile(p: (T) ⇒ Boolean): KeyedList[K, T]

Drops longest prefix of elements that satisfy the given predicate.
Drops longest prefix of elements that satisfy the given predicate.

Definition Classes
KeyedListLike
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def filter(fn: ((K, T)) ⇒ Boolean): KeyedList[K, T]

.
.filter(fn).toTypedPipe == .toTypedPipe.filter(fn) It is generally better to avoid going back to a TypedPipe as long as possible: this minimizes the times we go in and out of cascading/hadoop types.

Definition Classes
KeyedListLike
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def foldLeft[B](z: B)(fn: (B, T) ⇒ B): KeyedList[K, B]

Definition Classes
KeyedListLike
def forall(fn: (T) ⇒ Boolean): KeyedList[K, Boolean]

Definition Classes
KeyedListLike
def forceToReducers: KeyedList[K, T]

This is just short hand for mapValueStream(identity), it makes sure the planner sees that you want to force a shuffle.
This is just short hand for mapValueStream(identity), it makes sure the planner sees that you want to force a shuffle. For expert tuning

Definition Classes
KeyedListLike
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
def head: KeyedList[K, T]

Use this to get the first value encountered.
Use this to get the first value encountered. prefer this to take(1).

Definition Classes
KeyedListLike
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def keys: TypedPipe[K]

Definition Classes
KeyedListLike
def mapValueStream[V](smfn: (Iterator[T]) ⇒ Iterator[V]): KeyedList[K, V]

Use this when you don't care about the key for the group, otherwise use mapGroup
Use this when you don't care about the key for the group, otherwise use mapGroup

Definition Classes
KeyedListLike
def mapValues[V](fn: (T) ⇒ V): KeyedList[K, V]

This is a special case of mapValueStream, but can be optimized because it doesn't need all the values for a given key at once.
This is a special case of mapValueStream, but can be optimized because it doesn't need all the values for a given key at once. An unoptimized implementation is: mapValueStream { _.map { fn } } but for Grouped we can avoid resorting to mapValueStream

Definition Classes
KeyedListLike
def max[B >: T](implicit cmp: Ordering[B]): KeyedList[K, T]

Definition Classes
KeyedListLike
def maxBy[B](fn: (T) ⇒ B)(implicit cmp: Ordering[B]): KeyedList[K, T]

Definition Classes
KeyedListLike
def min[B >: T](implicit cmp: Ordering[B]): KeyedList[K, T]

Definition Classes
KeyedListLike
def minBy[B](fn: (T) ⇒ B)(implicit cmp: Ordering[B]): KeyedList[K, T]

Definition Classes
KeyedListLike
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def product[U >: T](implicit ring: Ring[U]): KeyedList[K, U]

Definition Classes
KeyedListLike
def reduce[U >: T](fn: (U, U) ⇒ U): KeyedList[K, U]

reduce with fn which must be associative and commutative.
reduce with fn which must be associative and commutative. Like the above this can be optimized in some Grouped cases. If you don't have a commutative operator, use reduceLeft

Definition Classes
KeyedListLike
def reduceLeft[U >: T](fn: (U, U) ⇒ U): KeyedList[K, U]

Definition Classes
KeyedListLike
def scanLeft[B](z: B)(fn: (B, T) ⇒ B): KeyedList[K, B]

Definition Classes
KeyedListLike
def size: KeyedList[K, Long]

Definition Classes
KeyedListLike
def sortWithTake[U >: T](k: Int)(lessThan: (U, U) ⇒ Boolean): KeyedList[K, Seq[T]]

Like the above, but with a less than operation for the ordering
Like the above, but with a less than operation for the ordering

Definition Classes
KeyedListLike
def sortedReverseTake(k: Int)(implicit ord: Ordering[_ >: T]): KeyedList[K, Seq[T]]

Take the largest k things according to the implicit ordering.
Take the largest k things according to the implicit ordering. Useful for top-k without having to call ord.reverse

Definition Classes
KeyedListLike
def sortedTake(k: Int)(implicit ord: Ordering[_ >: T]): KeyedList[K, Seq[T]]

This implements bottom-k (smallest k items) on each mapper for each key, then sends those to reducers to get the result.
This implements bottom-k (smallest k items) on each mapper for each key, then sends those to reducers to get the result. This is faster than using .take if k * (number of Keys) is small enough to fit in memory.

Definition Classes
KeyedListLike
def sum[U >: T](implicit sg: Semigroup[U]): KeyedList[K, U]

If there is no ordering, we default to assuming the Semigroup is commutative.
If there is no ordering, we default to assuming the Semigroup is commutative. If you don't want that, define an ordering on the Values, or .forceToReducers.
Semigroups MAY have a faster implementation of sum for iterators, so prefer using sum/sumLeft to reduce

Definition Classes
KeyedListLike
def sumLeft[U >: T](implicit sg: Semigroup[U]): KeyedList[K, U]

Semigroups MAY have a faster implementation of sum for iterators, so prefer using sum/sumLeft to reduce/reduceLeft
Semigroups MAY have a faster implementation of sum for iterators, so prefer using sum/sumLeft to reduce/reduceLeft

Definition Classes
KeyedListLike
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def take(n: Int): KeyedList[K, T]

Selects first n elements.
Selects first n elements. Don't use this if n == 1, head is faster in that case.

Definition Classes
KeyedListLike
def takeWhile(p: (T) ⇒ Boolean): KeyedList[K, T]

Takes longest prefix of elements that satisfy the given predicate.
Takes longest prefix of elements that satisfy the given predicate.

Definition Classes
KeyedListLike
def toList: KeyedList[K, List[T]]

Definition Classes
KeyedListLike
def toSet[U >: T]: KeyedList[K, Set[U]]

Definition Classes
KeyedListLike
def toString(): String

Definition Classes
AnyRef → Any
def values: TypedPipe[T]

Definition Classes
KeyedListLike
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

trait KeyedList[K, +T] extends KeyedListLike[K, T, KeyedList]

Abstract Value Members

abstract def filterKeys(fn: (K) ⇒ Boolean): KeyedList[K, T]

abstract def mapGroup[V](smfn: (K, Iterator[T]) ⇒ Iterator[V]): KeyedList[K, V]

abstract def toTypedPipe: TypedPipe[(K, T)]

Concrete Value Members

final def !=(arg0: AnyRef): Boolean

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: AnyRef): Boolean

final def ==(arg0: Any): Boolean

def aggregate[B, C](agg: Aggregator[T, B, C]): KeyedList[K, C]

final def asInstanceOf[T0]: T0

def clone(): AnyRef

def count(fn: (T) ⇒ Boolean): KeyedList[K, Long]

def drop(n: Int): KeyedList[K, T]

def dropWhile(p: (T) ⇒ Boolean): KeyedList[K, T]

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def filter(fn: ((K, T)) ⇒ Boolean): KeyedList[K, T]

def finalize(): Unit

def foldLeft[B](z: B)(fn: (B, T) ⇒ B): KeyedList[K, B]

def forall(fn: (T) ⇒ Boolean): KeyedList[K, Boolean]

def forceToReducers: KeyedList[K, T]

final def getClass(): Class[_]

def hashCode(): Int

def head: KeyedList[K, T]

final def isInstanceOf[T0]: Boolean

def keys: TypedPipe[K]

def mapValueStream[V](smfn: (Iterator[T]) ⇒ Iterator[V]): KeyedList[K, V]

def mapValues[V](fn: (T) ⇒ V): KeyedList[K, V]

def max[B >: T](implicit cmp: Ordering[B]): KeyedList[K, T]

def maxBy[B](fn: (T) ⇒ B)(implicit cmp: Ordering[B]): KeyedList[K, T]

def min[B >: T](implicit cmp: Ordering[B]): KeyedList[K, T]

def minBy[B](fn: (T) ⇒ B)(implicit cmp: Ordering[B]): KeyedList[K, T]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def product[U >: T](implicit ring: Ring[U]): KeyedList[K, U]

def reduce[U >: T](fn: (U, U) ⇒ U): KeyedList[K, U]

def reduceLeft[U >: T](fn: (U, U) ⇒ U): KeyedList[K, U]

def scanLeft[B](z: B)(fn: (B, T) ⇒ B): KeyedList[K, B]

def size: KeyedList[K, Long]

def sortWithTake[U >: T](k: Int)(lessThan: (U, U) ⇒ Boolean): KeyedList[K, Seq[T]]

def sortedReverseTake(k: Int)(implicit ord: Ordering[_ >: T]): KeyedList[K, Seq[T]]

def sortedTake(k: Int)(implicit ord: Ordering[_ >: T]): KeyedList[K, Seq[T]]

def sum[U >: T](implicit sg: Semigroup[U]): KeyedList[K, U]

def sumLeft[U >: T](implicit sg: Semigroup[U]): KeyedList[K, U]

final def synchronized[T0](arg0: ⇒ T0): T0

def take(n: Int): KeyedList[K, T]

def takeWhile(p: (T) ⇒ Boolean): KeyedList[K, T]

def toList: KeyedList[K, List[T]]

def toSet[U >: T]: KeyedList[K, Set[U]]

def toString(): String

def values: TypedPipe[T]

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from KeyedListLike[K, T, KeyedList]

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped