com.twitter.scalding

TypedDelimited

class TypedDelimited[T] extends FixedPathSource with DelimitedScheme with Mappable[T] with TypedSink[T]

Allows you to set the types, prefer this: If T is a subclass of Product, we assume it is a tuple. If it is not, wrap T in a Tuple1: e.g. TypedTsv[Tuple1[List[Int]]]

Linear Supertypes
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TypedDelimited
  2. TypedSink
  3. Mappable
  4. TypedSource
  5. DelimitedScheme
  6. FixedPathSource
  7. FileSource
  8. LocalSourceOverride
  9. SchemedSource
  10. Source
  11. Serializable
  12. AnyRef
  13. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new TypedDelimited(p: Seq[String], fields: Fields = cascading.tuple.Fields.ALL, skipHeader: Boolean = false, writeHeader: Boolean = false, separator: String = "\t")(implicit mf: Manifest[T], conv: TupleConverter[T], tset: TupleSetter[T])

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. def checkFlowDefNotNull(implicit flowDef: FlowDef, mode: Mode): Unit

    Attributes
    protected
    Definition Classes
    Source
  8. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. def converter[U >: T]: TupleConverter[U]

    Because TupleConverter cannot be covariant, we need to jump through this hoop.

    Because TupleConverter cannot be covariant, we need to jump through this hoop. A typical implementation might be: (implicit conv: TupleConverter[T]) and then:

    override def converter[U >: T] = TupleConverter.asSuperConverter[T, U](conv)

    Definition Classes
    TypedDelimitedTypedSource
  10. def createHdfsReadTap(hdfsMode: Hdfs): Tap[JobConf, _, _]

    Attributes
    protected
    Definition Classes
    FileSource
  11. def createLocalTap(sinkMode: SinkMode): Tap[_, _, _]

    Creates a local tap.

    Creates a local tap.

    sinkMode

    The mode for handling output conflicts.

    Definition Classes
    LocalSourceOverride
  12. def createTap(readOrWrite: AccessMode)(implicit mode: Mode): Tap[_, _, _]

    Subclasses of Source MUST override this method.

    Subclasses of Source MUST override this method. They may call out to TestTapFactory for making Taps suitable for testing.

    Definition Classes
    FileSourceSource
  13. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  14. def equals(that: Any): Boolean

    Definition Classes
    TypedDelimitedFixedPathSource → AnyRef → Any
  15. val fields: Fields

    Definition Classes
    TypedDelimitedDelimitedScheme
  16. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  17. final def flatMapTo[U](out: Fields)(mf: (T) ⇒ TraversableOnce[U])(implicit flowDef: FlowDef, mode: Mode, setter: TupleSetter[U]): Pipe

    If you want to filter, you should use this and output a 0 or 1 length Iterable.

    If you want to filter, you should use this and output a 0 or 1 length Iterable. Filter does not change column names, and we generally expect to change columns here

    Definition Classes
    Mappable
  18. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  19. def goodHdfsPaths(hdfsMode: Hdfs): Iterable[String]

    Attributes
    protected
    Definition Classes
    FileSource
  20. lazy val hashCode: Int

    Definition Classes
    TypedDelimitedFixedPathSource → AnyRef → Any
  21. def hdfsPaths: List[String]

    Definition Classes
    FixedPathSourceFileSource
  22. def hdfsReadPathsAreGood(conf: Configuration): Boolean

    Attributes
    protected
    Definition Classes
    FileSource
  23. def hdfsScheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _]

    The scheme to use if the source is on hdfs.

    The scheme to use if the source is on hdfs.

    Definition Classes
    DelimitedSchemeSchemedSource
  24. def hdfsWritePath: String

    Definition Classes
    FileSource
  25. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  26. def localPath: String

    A path to use for the local tap.

    A path to use for the local tap.

    Definition Classes
    FixedPathSourceLocalSourceOverride
  27. def localScheme: TextDelimited

    The scheme to use if the source is local.

    The scheme to use if the source is local.

    Definition Classes
    DelimitedSchemeSchemedSource
  28. final def mapTo[U](out: Fields)(mf: (T) ⇒ U)(implicit flowDef: FlowDef, mode: Mode, setter: TupleSetter[U]): Pipe

    Definition Classes
    Mappable
  29. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  30. final def notify(): Unit

    Definition Classes
    AnyRef
  31. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  32. def pathIsGood(p: String, conf: Configuration): Boolean

    Determines if a path is 'valid' for this source.

    Determines if a path is 'valid' for this source. In strict mode all paths must be valid. In non-strict mode, all invalid paths will be filtered out.

    Subclasses can override this to validate paths.

    The default implementation is a quick sanity check to look for missing or empty directories. It is necessary but not sufficient -- there are cases where this will return true but there is in fact missing data.

    TODO: consider writing a more in-depth version of this method in TimePathedSource that looks for TODO: missing days / hours etc.

    Attributes
    protected
    Definition Classes
    FileSource
  33. val quote: String

    Definition Classes
    DelimitedScheme
  34. def read(implicit flowDef: FlowDef, mode: Mode): Pipe

    Definition Classes
    Source
  35. val safe: Boolean

    Definition Classes
    DelimitedScheme
  36. val separator: String

    Definition Classes
    TypedDelimitedDelimitedScheme
  37. def setter[U <: T]: TupleSetter[U]

    Definition Classes
    TypedDelimitedTypedSink
  38. def sinkFields: Fields

    Definition Classes
    TypedSink
  39. val sinkMode: SinkMode

    Definition Classes
    SchemedSource
  40. val skipHeader: Boolean

    Definition Classes
    TypedDelimitedDelimitedScheme
  41. def sourceFields: Fields

    Definition Classes
    TypedSource
  42. val strict: Boolean

    Definition Classes
    DelimitedScheme
  43. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  44. def toIterator(implicit mode: Mode): Iterator[T]

    Allows you to read a Tap on the submit node NOT FOR USE IN THE MAPPERS OR REDUCERS.

    Allows you to read a Tap on the submit node NOT FOR USE IN THE MAPPERS OR REDUCERS. Typical use might be to read in Job.next to determine if another job is needed

    Definition Classes
    Mappable
  45. lazy val toString: String

    Definition Classes
    TypedDelimitedFixedPathSource → AnyRef → Any
  46. def transformForRead(pipe: Pipe): Pipe

    Attributes
    protected
    Definition Classes
    Source
  47. def transformForWrite(pipe: Pipe): Pipe

    Attributes
    protected
    Definition Classes
    Source
  48. def transformInTest: Boolean

    The mock passed in to scalding.

    The mock passed in to scalding.JobTest may be considered as a mock of the Tap or the Source. By default, as of 0.9.0, it is considered as a Mock of the Source. If you set this to true, the mock in TestMode will be considered to be a mock of the Tap (which must be transformed) and not the Source.

    Definition Classes
    Source
  49. val types: Array[Class[_]]

    Definition Classes
    TypedDelimitedDelimitedScheme
  50. def validateTaps(mode: Mode): Unit

    Definition Classes
    FileSourceSource
  51. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  52. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  53. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  54. def writeFrom(pipe: Pipe)(implicit flowDef: FlowDef, mode: Mode): Pipe

    write the pipe and return the input so it can be chained into the next operation

    write the pipe and return the input so it can be chained into the next operation

    Definition Classes
    Source
  55. val writeHeader: Boolean

    Definition Classes
    TypedDelimitedDelimitedScheme

Deprecated Value Members

  1. def readAtSubmitter[T](implicit mode: Mode, conv: TupleConverter[T]): Stream[T]

    Definition Classes
    Source
    Annotations
    @deprecated
    Deprecated

    (Since version 0.9.0) replace with Mappable.toIterator

Inherited from typed.TypedSink[T]

Inherited from Mappable[T]

Inherited from typed.TypedSource[T]

Inherited from DelimitedScheme

Inherited from FixedPathSource

Inherited from FileSource

Inherited from LocalSourceOverride

Inherited from SchemedSource

Inherited from Source

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped