Same as groupAll.
Same as groupAll.aggregate.values
Returns the set of distinct elements in the TypedPipe
Returns the set of distinct elements in the TypedPipe
Keep only items satisfying a predicate
flatten an Iterable
Force a materialization of this pipe prior to the next operation.
Force a materialization of this pipe prior to the next operation. This is useful if you filter almost everything before a hashJoin, for instance.
limit the output to at most count items.
limit the output to at most count items. useful for debugging, but probably that's about it. The number may be less than count, and not sampled particular method
This actually runs all the pure map functions in one Cascading Each This approach is more efficient than untyped scalding because we don't use TupleConverters/Setters after each map.
This actually runs all the pure map functions in one Cascading Each This approach is more efficient than untyped scalding because we don't use TupleConverters/Setters after each map. The output pipe has a single item CTuple with an object of type T in position 0
Reasonably common shortcut for cases of associative/commutative reduction returns a typed pipe with only one element.
Safely write to a Mappable[U].
Safely write to a Mappable[U]. If you want to write to a Source (not mappable) you need to do something like: toPipe(fieldNames).write(dest)
a pipe equivalent to the current pipe.
Represents a phase in a distributed computation on an input data source Wraps a cascading Pipe object, and holds the transformation done up until that point