Object

com.vbounyasit.bigdata.example

SampleApplication

Related Doc: package example

Permalink

object SampleApplication extends SparkApplication[Nothing, Nothing]

Linear Supertypes
SparkApplication[Nothing, Nothing], LoggerProvider, ETL[Nothing, Nothing], SparkSessionProvider, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. SampleApplication
  2. SparkApplication
  3. LoggerProvider
  4. ETL
  5. SparkSessionProvider
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. val configDefinition: ConfigDefinition

    Permalink

    The configuration files definition

    The configuration files definition

    Definition Classes
    SampleApplicationSparkApplication
  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  9. def executionPlans(implicit spark: SparkSession): Map[String, ExecutionConfig]

    Permalink

    The defined execution plans

    The defined execution plans

    spark

    an implicit spark session

    returns

    A JobName/ExecutionPlan Map

    Definition Classes
    SampleApplicationSparkApplication
  10. def extract(jobName: String, jobSourcesConf: List[JobSource], sourcesConf: SourcesConf, env: String)(implicit spark: SparkSession): Sources

    Permalink

    Extracts data from a provided sources configuration

    Extracts data from a provided sources configuration

    jobName

    The Job name

    jobSourcesConf

    The Job input sources configuration

    sourcesConf

    The different input sources configuration

    env

    The environment in which we want to extract the input sources from

    spark

    An implicit spark session

    returns

    A Map of sourceName/SourcePipeline containing the extracted sources.

    Definition Classes
    SparkApplicationETL
  11. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  13. final def getSparkSession(sparkParamsConf: SparkParamsConf): SparkSession

    Permalink
    Definition Classes
    SparkSessionProvider
  14. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  15. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  16. def load(dataFrame: DataFrame, database: String, table: String, optionalJobParameters: EmptyOptionalParameters): Unit

    Permalink

    Saves the resulting dataFrame to disk

    Saves the resulting dataFrame to disk

    dataFrame

    The resulting DataFrame

    database

    The output database name

    table

    The output table name (job name)

    optionalJobParameters

    An OptionalJobParameters object containing any custom argument/application files we defined through our application.

    Definition Classes
    SampleApplicationETL
  17. def loadExecutionData(args: Array[String]): ExecutionData[_, _]

    Permalink

    Loads a set of parameters needed for the ETL Operation

    Loads a set of parameters needed for the ETL Operation

    through : config files loading, argument parsing, execution parameters creation, etc...

    args

    The list of arguments to parse

    returns

    An ExecutionData object containing all the required parameters

    Attributes
    protected
    Definition Classes
    SparkApplicationETL
  18. val logger: Logger

    Permalink
    Definition Classes
    LoggerProvider
  19. def main(args: Array[String]): Unit

    Permalink
  20. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  21. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  22. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  23. def runETL[GlobalConfig, GlobalArgument, Config, Argument, ConfigInput, ArgumentInput](executionData: ExecutionData[GlobalConfig, GlobalArgument]): Unit

    Permalink

    The main method containing the logic for running our ETL job

    The main method containing the logic for running our ETL job

    executionData

    The ExecutionData object that will be used

    Definition Classes
    ETL
  24. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  25. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  26. def transform(jobName: String, sources: Sources, executionPlan: ExecutionPlan, outputColumns: Option[Seq[String]], exportDateColumn: Option[String])(implicit spark: SparkSession): DataFrame

    Permalink

    Apply transformations to a given set of sources

    Apply transformations to a given set of sources

    jobName

    The Job name

    sources

    The extracted input sources

    executionPlan

    The execution plan to apply

    exportDateColumn

    An optional date column name to tie the result computation date

    spark

    An implicit spark session

    returns

    The resulting DataFrame

    Definition Classes
    SparkApplicationETL
  27. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  28. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  29. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from SparkApplication[Nothing, Nothing]

Inherited from LoggerProvider

Inherited from ETL[Nothing, Nothing]

Inherited from SparkSessionProvider

Inherited from AnyRef

Inherited from Any

Ungrouped