com.krux.hyperion.aws

AdpHiveActivity

class AdpHiveActivity extends AdpDataPipelineAbstractObject with AdpActivity

ref: http://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-object-hiveactivity.html

Source
AdpActivities.scala
Linear Supertypes
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. AdpHiveActivity
  2. AdpActivity
  3. AdpDataPipelineObject
  4. AdpDataPipelineAbstractObject
  5. AdpObject
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new AdpHiveActivity(id: String, name: Option[String], hiveScript: Option[String], scriptUri: Option[String], scriptVariable: Option[Seq[String]], stage: Option[String], input: Option[AdpRef[AdpDataNode]], output: Option[AdpRef[AdpDataNode]], hadoopQueue: Option[String], preActivityTaskConfig: Option[AdpRef[AdpShellScriptConfig]], postActivityTaskConfig: Option[AdpRef[AdpShellScriptConfig]], workerGroup: Option[String], runsOn: Option[AdpRef[AdpEmrCluster]], dependsOn: Option[Seq[AdpRef[AdpActivity]]], precondition: Option[Seq[AdpRef[AdpPrecondition]]], onFail: Option[Seq[AdpRef[AdpSnsAlarm]]], onSuccess: Option[Seq[AdpRef[AdpSnsAlarm]]], onLateAction: Option[Seq[AdpRef[AdpSnsAlarm]]], attemptTimeout: Option[String], lateAfterTimeout: Option[String], maximumRetries: Option[String], retryDelay: Option[String], failureAndRerunMode: Option[String])

    hiveScript

    The Hive script to run.

    scriptUri

    The location of the Hive script to run. For example, s3://script location.

    scriptVariable

    Specifies script variables for Amazon EMR to pass to Hive while running a script. For example, the following example script variables would pass a SAMPLE and FILTER_DATE variable to Hive: SAMPLE=s3://elasticmapreduce/samples/hive-ads and FILTER_DATE=#{format(@scheduledStartTime,'YYYY-MM-dd')}% This field accepts multiple values and works with both script and scriptUri fields. In addition, scriptVariable functions regardless of stage set to true or false. This field is especially useful to send dynamic values to Hive using AWS Data Pipeline expressions and functions. For more information, see Pipeline Expressions and Functions.

    stage

    Determines whether staging is enabled. Not permitted with Hive 11, so use an Amazon EMR AMI version 3.2.0 or greater.

    input

    The input data source. Data node object reference Yes

    output

    The location for the output. Data node object reference Yes

    runsOn

    The Amazon EMR cluster to run this activity. EmrCluster object reference Yes

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  7. val attemptTimeout: Option[String]

    The timeout time interval for an object attempt.

    The timeout time interval for an object attempt. If an attempt does not complete within the start time plus this time interval, AWS Data Pipeline marks the attempt as failed and your retry settings determine the next steps taken.

    Definition Classes
    AdpHiveActivityAdpActivity
  8. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  9. val dependsOn: Option[Seq[AdpRef[AdpActivity]]]

    One or more references to other Activities that must reach the FINISHED state before this activity will start.

    One or more references to other Activities that must reach the FINISHED state before this activity will start.

    Definition Classes
    AdpHiveActivityAdpActivity
  10. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  11. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  12. val failureAndRerunMode: Option[String]

    Determines whether pipeline object failures and rerun commands cascade through pipeline object dependencies

    Determines whether pipeline object failures and rerun commands cascade through pipeline object dependencies

    Possible values include cascade and none.

    Definition Classes
    AdpHiveActivityAdpActivity
  13. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  15. val hadoopQueue: Option[String]

  16. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  17. val hiveScript: Option[String]

    The Hive script to run.

  18. val id: String

    The ID of the object, IDs must be unique within a pipeline definition

    The ID of the object, IDs must be unique within a pipeline definition

    Definition Classes
    AdpHiveActivityAdpDataPipelineObjectAdpObject
  19. val input: Option[AdpRef[AdpDataNode]]

    The input data source.

    The input data source. Data node object reference Yes

  20. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  21. val lateAfterTimeout: Option[String]

    The time period in which the object run must start.

    The time period in which the object run must start. If the object does not start within the scheduled start time plus this time interval, it is considered late

    Definition Classes
    AdpHiveActivityAdpActivity
  22. val maximumRetries: Option[String]

    The maximum number of times to retry the action.

    The maximum number of times to retry the action. The default value is 2, which results in 3 tries total (1 original attempt plus 2 retries). The maximum value is 5 (6 total attempts).

    Definition Classes
    AdpHiveActivityAdpActivity
  23. val name: Option[String]

    The optional, user-defined label of the object.

    The optional, user-defined label of the object. If you do not provide a name for an object in a pipeline definition, AWS Data Pipeline automatically duplicates the value of id.

    Definition Classes
    AdpHiveActivityAdpDataPipelineObjectAdpDataPipelineAbstractObject
  24. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  25. final def notify(): Unit

    Definition Classes
    AnyRef
  26. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  27. val onFail: Option[Seq[AdpRef[AdpSnsAlarm]]]

    The SNS alarm to raise when the activity fails.

    The SNS alarm to raise when the activity fails.

    Definition Classes
    AdpHiveActivityAdpActivity
  28. val onLateAction: Option[Seq[AdpRef[AdpSnsAlarm]]]

    The SNS alarm to raise when the activity fails to start on time.

    The SNS alarm to raise when the activity fails to start on time.

    Definition Classes
    AdpHiveActivityAdpActivity
  29. val onSuccess: Option[Seq[AdpRef[AdpSnsAlarm]]]

    The SNS alarm to raise when the activity succeeds.

    The SNS alarm to raise when the activity succeeds.

    Definition Classes
    AdpHiveActivityAdpActivity
  30. val output: Option[AdpRef[AdpDataNode]]

    The location for the output.

    The location for the output. Data node object reference Yes

  31. val postActivityTaskConfig: Option[AdpRef[AdpShellScriptConfig]]

  32. val preActivityTaskConfig: Option[AdpRef[AdpShellScriptConfig]]

  33. val precondition: Option[Seq[AdpRef[AdpPrecondition]]]

    A condition that must be met before the object can run.

    A condition that must be met before the object can run. To specify multiple conditions, add multiple precondition fields. The activity cannot run until all its conditions are met.

    Definition Classes
    AdpHiveActivityAdpActivity
  34. val retryDelay: Option[String]

    The timeout duration between two retry attempts.

    The timeout duration between two retry attempts. The default is 10 minutes.

    Definition Classes
    AdpHiveActivityAdpActivity
  35. val runsOn: Option[AdpRef[AdpEmrCluster]]

    The Amazon EMR cluster to run this activity.

    The Amazon EMR cluster to run this activity. EmrCluster object reference Yes

  36. val scriptUri: Option[String]

    The location of the Hive script to run.

    The location of the Hive script to run. For example, s3://script location.

  37. val scriptVariable: Option[Seq[String]]

    Specifies script variables for Amazon EMR to pass to Hive while running a script.

    Specifies script variables for Amazon EMR to pass to Hive while running a script. For example, the following example script variables would pass a SAMPLE and FILTER_DATE variable to Hive: SAMPLE=s3://elasticmapreduce/samples/hive-ads and FILTER_DATE=#{format(@scheduledStartTime,'YYYY-MM-dd')}% This field accepts multiple values and works with both script and scriptUri fields. In addition, scriptVariable functions regardless of stage set to true or false. This field is especially useful to send dynamic values to Hive using AWS Data Pipeline expressions and functions. For more information, see Pipeline Expressions and Functions.

  38. val stage: Option[String]

    Determines whether staging is enabled.

    Determines whether staging is enabled. Not permitted with Hive 11, so use an Amazon EMR AMI version 3.2.0 or greater.

  39. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  40. def toString(): String

    Definition Classes
    AnyRef → Any
  41. val type: String

    The type of object.

    The type of object. Use one of the predefined AWS Data Pipeline object types.

    Definition Classes
    AdpHiveActivityAdpDataPipelineObject
  42. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  43. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  44. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  45. val workerGroup: Option[String]

    The worker group.

    The worker group. This is used for routing tasks. If you provide a runsOn value and workerGroup exists, workerGroup is ignored.

    Definition Classes
    AdpHiveActivityAdpActivity

Inherited from AdpActivity

Inherited from AdpDataPipelineObject

Inherited from AdpObject

Inherited from AnyRef

Inherited from Any

Ungrouped