AdpEmrActivity

Runs an Amazon EMR cluster.

AWS Data Pipeline uses a different format for steps than Amazon EMR, for example AWS Data Pipeline uses comma-separated arguments after the JAR name in the EmrActivity step field.

input: The input data source.
output: The location for the output
preStepCommand: Shell scripts to be run before any steps are run. To specify multiple scripts, up to 255, add multiple preStepCommand fields.
postStepCommand: Shell scripts to be run after all steps are finished. To specify multiple scripts, up to 255, add multiple postStepCommand fields.
runsOn: The Amazon EMR cluster to run this cluster.
step: One or more steps for the cluster to run. To specify multiple steps, up to 255, add multiple step fields. Use comma-separated arguments after the JAR name; for example, "s3://example-bucket/MyWork.jar,arg1,arg2,arg3".

Source: AdpActivities.scala

Linear Supertypes

Serializable, Serializable, Product, Equals, AdpActivity, AdpDataPipelineObject, AdpDataPipelineAbstractObject, AnyRef, Any

Instance Constructors

new AdpEmrActivity(id: String, name: Option[String], input: Option[AdpRef[AdpDataNode]], output: Option[AdpRef[AdpDataNode]], preStepCommand: Option[Seq[String]], postStepCommand: Option[Seq[String]], runsOn: AdpRef[AdpEmrCluster], step: Seq[String], dependsOn: Option[Seq[AdpRef[AdpActivity]]])

input
The input data source.
output
The location for the output
preStepCommand
Shell scripts to be run before any steps are run. To specify multiple scripts, up to 255, add multiple preStepCommand fields.
postStepCommand
Shell scripts to be run after all steps are finished. To specify multiple scripts, up to 255, add multiple postStepCommand fields.
runsOn
The Amazon EMR cluster to run this cluster.
step
One or more steps for the cluster to run. To specify multiple steps, up to 255, add multiple step fields. Use comma-separated arguments after the JAR name; for example, "s3://example-bucket/MyWork.jar,arg1,arg2,arg3".

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
val dependsOn: Option[Seq[AdpRef[AdpActivity]]]

One or more references to other Activities that must reach the FINISHED state before this activity will start.
One or more references to other Activities that must reach the FINISHED state before this activity will start.

Definition Classes
AdpEmrActivity → AdpActivity
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
val id: String

The ID of the object, IDs must be unique within a pipeline definition
The ID of the object, IDs must be unique within a pipeline definition

Definition Classes
AdpEmrActivity → AdpDataPipelineObject → AdpDataPipelineAbstractObject
val input: Option[AdpRef[AdpDataNode]]

The input data source.
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
val name: Option[String]

The optional, user-defined label of the object.
The optional, user-defined label of the object. If you do not provide a name for an object in a pipeline definition, AWS Data Pipeline automatically duplicates the value of id.

Definition Classes
AdpEmrActivity → AdpDataPipelineObject → AdpDataPipelineAbstractObject
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
val output: Option[AdpRef[AdpDataNode]]

The location for the output
val postStepCommand: Option[Seq[String]]

Shell scripts to be run after all steps are finished.
Shell scripts to be run after all steps are finished. To specify multiple scripts, up to 255, add multiple postStepCommand fields.
val preStepCommand: Option[Seq[String]]

Shell scripts to be run before any steps are run.
Shell scripts to be run before any steps are run. To specify multiple scripts, up to 255, add multiple preStepCommand fields.
val runsOn: AdpRef[AdpEmrCluster]

The Amazon EMR cluster to run this cluster.
The Amazon EMR cluster to run this cluster.

Definition Classes
AdpEmrActivity → AdpActivity
val step: Seq[String]

One or more steps for the cluster to run.
One or more steps for the cluster to run. To specify multiple steps, up to 255, add multiple step fields. Use comma-separated arguments after the JAR name; for example, "s3://example-bucket/MyWork.jar,arg1,arg2,arg3".
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
val type: String

The type of object.
The type of object. Use one of the predefined AWS Data Pipeline object types.

Definition Classes
AdpEmrActivity → AdpDataPipelineObject
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Doc: package aws

Instance Constructors

new AdpEmrActivity(id: String, name: Option[String], input: Option[AdpRef[AdpDataNode]], output: Option[AdpRef[AdpDataNode]], preStepCommand: Option[Seq[String]], postStepCommand: Option[Seq[String]], runsOn: AdpRef[AdpEmrCluster], step: Seq[String], dependsOn: Option[Seq[AdpRef[AdpActivity]]])

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def clone(): AnyRef

val dependsOn: Option[Seq[AdpRef[AdpActivity]]]

final def eq(arg0: AnyRef): Boolean

def finalize(): Unit

final def getClass(): Class[_]

val id: String

val input: Option[AdpRef[AdpDataNode]]

final def isInstanceOf[T0]: Boolean

val name: Option[String]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

val output: Option[AdpRef[AdpDataNode]]

val postStepCommand: Option[Seq[String]]

val preStepCommand: Option[Seq[String]]

val runsOn: AdpRef[AdpEmrCluster]

val step: Seq[String]

final def synchronized[T0](arg0: ⇒ T0): T0

val type: String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from AdpActivity

Inherited from AdpDataPipelineObject

Inherited from AdpDataPipelineAbstractObject

Inherited from AnyRef

Inherited from Any

Ungrouped