com.twitter.scalding.reducer_estimation

InputSizeReducerEstimator

class InputSizeReducerEstimator extends ReducerEstimator

Estimator that uses the input size and a fixed "bytesPerReducer" target.

Bytes per reducer can be configured with configuration parameter, defaults to 1 GB.

Linear Supertypes
ReducerEstimator, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. InputSizeReducerEstimator
  2. ReducerEstimator
  3. AnyRef
  4. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new InputSizeReducerEstimator()

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  8. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  9. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  10. def estimateReducers(info: FlowStrategyInfo): Option[Int]

    Figure out the total size of the input to the current step and set the number of reducers using the "bytesPerReducer" configuration parameter.

    Figure out the total size of the input to the current step and set the number of reducers using the "bytesPerReducer" configuration parameter.

    info

    Holds information about the overall flow (.flow), previously-run steps (.predecessorSteps), and the current step (.step).

    returns

    Number of reducers recommended by the estimator, or None to keep the default.

    Definition Classes
    InputSizeReducerEstimatorReducerEstimator
  11. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  13. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  14. def inputSizes(step: FlowStep[JobConf]): Option[Seq[(String, Long)]]

    Attributes
    protected
  15. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  16. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Definition Classes
    AnyRef
  18. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  19. def size(f: Hfs, conf: JobConf): Long

    Get the total size of the file(s) specified by the Hfs, which may contain a glob pattern in its path, so we must be ready to handle that case.

    Get the total size of the file(s) specified by the Hfs, which may contain a glob pattern in its path, so we must be ready to handle that case.

    Attributes
    protected
  20. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  21. def toString(): String

    Definition Classes
    AnyRef → Any
  22. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  24. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from ReducerEstimator

Inherited from AnyRef

Inherited from Any

Ungrouped